Latency measures how long a feature makes the user wait — from a spoken word to its caption, or a frame to its blurred background. Real-time calls feel broken above roughly 100-200 ms, so every model in the path must fit inside a strict time budget.
Definition
The delay between input and result, usually in milliseconds. In real-time video, the budget for an AI step is tiny — often under 100 ms end to end.
Latency measures how long a feature makes the user wait — from a spoken word to its caption, or a frame to its blurred background. Real-time calls feel broken above roughly 100-200 ms, so every model in the path must fit inside a strict time budget.
Also known as
delay, lag