Lip-sync models reshape the mouth and jaw of a video face to fit new speech, used for dubbing into other languages and for AI presenters. Quality depends on natural mouth shapes and exact timing; visible mismatch breaks the illusion fast.
Definition
Making a face's mouth movements match a given audio track, so a talking-head video looks like it is really speaking the words. Core to AI avatars and dubbing.
Lip-sync models reshape the mouth and jaw of a video face to fit new speech, used for dubbing into other languages and for AI presenters. Quality depends on natural mouth shapes and exact timing; visible mismatch breaks the illusion fast.
Also known as
lip synchronization, talking head