The context window is the model's working memory limit. A large window lets you feed long transcripts or many frames in a single call; exceed it and you must summarise, chunk, or retrieve. Window size strongly affects what video tasks are feasible and what they cost.

