Text-to-video models turn a prompt into moving footage, handling motion, lighting, and increasingly sound. They enable b-roll, concepts, and ads without filming, while consistency across shots and physical realism remain the hard problems.
Definition
Generating a video clip from a written description. The headline capability of tools like Sora, Veo, Runway, and Kling.
Text-to-video models turn a prompt into moving footage, handling motion, lighting, and increasingly sound. They enable b-roll, concepts, and ads without filming, while consistency across shots and physical realism remain the hard problems.
Also known as
T2V, text to video