A large language model learns patterns of language from huge corpora and produces fluent text, answers, and code. In video products it summarises transcripts, drives agents, and, when paired with a vision encoder, becomes the language half of a VLM.