WebGPU lets JavaScript drive the graphics card for heavy parallel work, including neural networks, inside a normal web page. It makes in-browser inference — background blur, client-side transcription — fast enough for real time without a server round-trip.