Text-to-speech converts written words into a lifelike voice. Quality now rivals human recordings, and streaming variants start speaking before the whole text is ready, which matters for live assistants. Voice cloning is a sensitive extension of TTS.