What is this course about?+
AI for video engineering is the practice of wiring AI into a real video product — and the AI tools you build with along the way. The course runs two tracks: Track A covers AI inside the product (object detection on a stream, speech-to-text in a call, generative b-roll, agents that review footage), and Track B covers the AI-assisted tools video engineers actually use to ship faster. Both are taught model by model, wired into a real pipeline, never derived from scratch.
Who is this course for?+
Engineers who already know either ML or video and need to ship AI features in a video product. Both halves are welcome: ML engineers with no video background, and video engineers with no ML background. Product leads scoping an AI feature will get the architecture and cost framing they need without the math.
Do I need a machine-learning background?+
No. The course doesn't re-derive backprop, transformers, or generic LLMs — excellent resources already do that, and we link to them. We focus on what's missing on the open web: how to wire a model into a video pipeline, what it costs, and where it breaks in production.
What models and tools does it cover?+
The ones that ship in 2026 — YOLO v8–v12, SAM 2, Grounding DINO, PaddleOCR, Whisper and WhisperX, Pyannote, ElevenLabs, LLaVA and Qwen-VL, Sora, Runway, Kling, CogVideoX, LangGraph, CrewAI, vLLM, Triton, and LiveKit Agents — always wired into a real video product with quality and cost gates, not catalogued for academic completeness.
Is the code runnable and open source?+
Yes. Lessons ship with runnable code, MIT-licensed, on public GitHub, with a maintainer keeping the repos working for 24 months. Each lesson is signed by an author and carries a visible last-updated date; the fastest-moving phases (multimodal and generative) are refreshed quarterly.
How is this different from a generic ML or dev-tools course?+
Every lesson passes two tests: it's about AI in video engineering specifically, and the information is genuinely hard to find on the open web. Generic ML theory, generic Cursor tutorials, and vendor docs are linked and cut. What's left is Fora Soft engineering recipes, 2026 frontier-model assessments that don't exist elsewhere, and production failure-mode walkthroughs.