Client-Side ASR (Whisper In The Browser) Engineering Cheat Sheet

One-page reference: the name untangle ('faster-whisper' is a server-side CTranslate2 library, NOT a browser engine; the real browser engines are whisper.cpp via WebAssembly and Transformers.js via WebGPU; plus the built-in Web Speech API, which sends audio to Google by default); the in-browser pipeline (microphone -> 16 kHz mono -> Silero VAD gate -> Whisper -> caption, all in the tab); WebAssembly-vs-WebGPU (CPU vs GPU, ~100x, cross-origin isolation COOP+COEP for Wasm threads); the cost ledger (model download 31-182 MB quantized, memory, battery); the real-time-factor check (RTF below 1.0 keeps up live); Moonshine for tight live loops; and the client-vs-server decision table.

Download free PDF

PDF

Specialist software house for video, real-time and AI products. Founded 2005. 50 in-house engineers.

+1 (914) 775-5855
New York · USA
© Fora Soft, 20052026
Describe your project and we will get in touch
Enter your message
Enter your email
Enter your name

By submitting data in this form, you agree with the Personal Data Processing Policy.

Your message has been sent successfully
We will contact you soon
Message not sent. Please try again.