ASR, diarization, TTS, voice cloning, noise suppression, echo cancellation, live translation — the speech stack of a modern video product, vendor by vendor and model by model. Ten lessons with the pricing tables vendors don't publish and the consent engineering the NO FAKES Act now requires.