A latency budget assigns each stage — capture, model, network, render — a slice of the allowed delay, forcing trade-offs so the whole pipeline feels instant. Designing to a budget is how teams keep AI in a live call from making it lag.
Definition
The total delay a real-time feature is allowed, split across each step so the sum stays under the limit — often around 100 ms end to end for live video.
A latency budget assigns each stage — capture, model, network, render — a slice of the allowed delay, forcing trade-offs so the whole pipeline feels instant. Designing to a budget is how teams keep AI in a live call from making it lag.
Also known as
latency budgeting