Learning course · Updated June 2026

Video surveillance & VMS engineering

How a modern surveillance system actually works — IP cameras, ONVIF and RTSP, VMS and NVR, edge and cloud video analytics, storage and retention, and privacy compliance. A practical course from Fora Soft engineers, from the camera sensor to the operator's screen.

Every chapter starts with a question and ends with a build decision. Standards cited by name and profile. Privacy tied to named law. No vendor brochures.

8 chapters       63 articles        120+ glossary terms       ~25 hrs total reading

Outcomes

What you'll be able to ship.

Eight blocks that take you from a single IP camera to a federated, compliant, multi-site VMS. By the end, you can specify, build, and operate a video surveillance system that holds up under load — and stands up to an audit.

01

Choose the right recorder and deployment

VMS vs NVR vs DVR, on-prem vs cloud vs hybrid. Pick the architecture that fits your camera count, retention needs, and budget.

02

Integrate any camera, any vendor

ONVIF Profile S/G/T/M, RTSP/RTP on the wire, WS-Discovery onboarding at scale, and the vendor SDKs you reach for when ONVIF stops.

03

Put analytics where they belong

Edge, edge-server, or cloud. Budget the compute cost per camera and hit the latency and accuracy target for each analytic.

04

Specify the analytics stack

Object detection, tracking and re-ID, face recognition, LPR/ANPR, behavioral and anomaly detection, search-by-event, and false-alarm tuning.

05

Size storage and retention correctly

The bitrate-by-camera-by-retention math, H.264 vs H.265, storage tiers, and federation across many sites — the line item most projects underestimate.

06

Ship a compliant system

GDPR (Reg. (EU) 2016/679) and BIPA, consent and notice, face masking and redaction, and lawful retention limits — built in, not bolted on.

Syllabus

The full course in eight chapters

Every chapter is self-contained. Read in order, or jump straight to the block you need — from the camera-to-client anatomy to vertical reference architectures.

01

Video Surveillance Foundations

What surveillance is, end to end. VMS vs NVR vs DVR, IP vs analog, the camera-to-client anatomy, deployment models, and the cost model.

Beginner8 articles · ~3 hrs
Read

02

Standards & Interoperability

The interoperability layer. ONVIF for engineers, Profile S/G/T/M, RTSP/RTP on the wire, PSIA, camera SDKs, and onboarding at scale.

beginner8 articles · ~3 hrs
Read

03

Edge + Cloud AI

Where analytics run, and what it costs. Edge vs cloud, on-camera AI, edge servers, the hybrid pattern, and latency/accuracy by tier.

intermediate8 articles · ~3 hrs
Read

04

Video Analytics

What the system can detect and search. Object detection, tracking and re-ID, face recognition, LPR/ANPR, behavioral and anomaly detection.

intermediate9 articles · ~9 hrs
Read

05

Storage & Scale

The retention math every project gets wrong first. Recording strategies, storage tiers, retention policy, federation, and capacity planning.

intermediate8 articles · ~3 hrs
Read

06

Privacy & Compliance

The compliance engine. Privacy by design, GDPR for video, BIPA and US biometric law, consent and notice, face masking, and lawful retention.

Advanced8 articles · ~3 hrs
Read

07

Vendor Matrix

The buy side, honestly compared. Milestone, Genetec, Avigilon, Eagle Eye, Spot AI, Ambient.ai, and the custom-vs-off-the-shelf decision.

Advanced7 articles · ~2.5 hrs
Read

08

Reference Architectures

Vertical blueprints that end in a cost estimate. Retail, perimeter, city, smart building, industrial, scope-to-cost, and the deployment checklist.

Advanced7 articles · ~3 hrs
Read

Ship video surveillance at production scale

Talk to the engineers who build it. Fora Soft helps teams integrate ONVIF camera fleets, place video analytics on the right tier, size storage, and ship VMS products that hold up — and stay compliant — in the field.

Reference

The vocabulary of video surveillance

120+ terms with crisp definitions, aliases, and links to deep dives. From VMS, NVR, and ONVIF to LPR/ANPR, federation, and BIPA — the full A–Z is one click away.

VMS

Video Management System. The software platform that ingests, records, and manages many cameras and recorders across servers — beyond what a single NVR or DVR appliance can do.

NVR

Network Video Recorder. Records the digital streams of IP cameras over a network. The recorder tier beneath a VMS; software NVRs blur the line.

ONVIF

Open Network Video Interface Forum. The standard that lets cameras and VMS platforms from different vendors interoperate — discovery, streaming, PTZ, and metadata.

RTSP

Real-Time Streaming Protocol. The session protocol an IP camera speaks to deliver its RTP video stream into a VMS or NVR.

Video analytics

Software that turns raw video into searchable events — object detection, tracking, license-plate and face recognition, and behavioral and anomaly detection.

Edge AI

On-camera or edge-server analytics that run close to the camera, cutting bandwidth and latency and keeping raw footage local for privacy.

Written and maintained by

The author.

Nikolay Sapunov, CEO at Fora Soft

Nikolay Sapunov

CEO at Fora Soft

Leads a software studio specialising in video-centric products — streaming platforms, WebRTC apps, video conferencing, computer vision, and AI-driven video tools. Writes this course so product and engineering teams can reason clearly about cameras, ONVIF and RTSP, edge and cloud video analytics, storage, and the privacy trade-offs behind every surveillance architecture decision.

FAQ

Frequently asked questions.

What is a VMS, and how is it different from an NVR and a DVR?

A DVR (digital video recorder) records analog cameras over coax and is the legacy tier. An NVR (network video recorder) records the digital streams of IP cameras over a network. A VMS (video management system) is the software platform above both — it manages many cameras, recorders, and servers, adds analytics, access control, and multi-site federation, and scales far beyond a single appliance. Most modern builds are VMS-led with IP cameras.

What is ONVIF, and what does it standardize?

ONVIF (Open Network Video Interface Forum) is the standard that lets IP cameras, NVRs, and VMS platforms from different vendors work together. It standardizes the interfaces — device discovery via WS-Discovery, video and audio streaming, PTZ control, and analytics metadata — through Profiles S, G, T, and M. It does not standardize analytics quality or every vendor feature; for those you fall back to a camera SDK. ONVIF is a baseline, not a ceiling.

Should video analytics run on the edge or in the cloud?

It depends on latency, bandwidth, and privacy. Edge analytics — on the camera or an edge server — react in milliseconds, cut upload bandwidth, and keep raw footage local, but the compute is limited. Cloud analytics offer elastic GPU power and easier model updates, at the cost of bandwidth, recurring spend, and data exposure. Most production systems are hybrid: lightweight detection at the edge filters what the cloud analyzes in depth.

How much does a video surveillance system cost?

There is no single figure — cost scales with camera count, resolution and bitrate, retention period, on-prem versus cloud, and how much analytics compute you run. The big line items are cameras and mounting, storage (the part most projects underestimate), VMS and analytics licenses, servers or cloud, integration, and the ongoing run-rate. Model it per camera per month, not as a one-time number. Chapter 1 ships a cost-model worksheet.

How do you calculate surveillance storage and retention?

Storage follows simple arithmetic: per-camera bitrate (Mbps) ÷ 8 × 3,600 × recording hours per day × retention days × number of cameras gives gigabytes. H.265 roughly halves the bitrate of H.264 for the same quality, and motion-only recording cuts it further. Retention is usually set by policy or law — Chapter 6 covers the legal caps. Chapter 5 ships a storage and retention worksheet.

What does GDPR require for video surveillance, and is face recognition legal?

Under GDPR (Regulation (EU) 2016/679), CCTV needs a lawful basis, clear signage and notice, data minimization, defined retention limits, and often a Data Protection Impact Assessment for large-scale monitoring. Face recognition adds biometric rules — in Illinois, BIPA requires informed written consent before capturing a faceprint, with a private right of action. Rules vary by region, so design for masking, consent, and lawful retention. This is engineering guidance, not legal advice.

Need to ship surveillance or VMS product, not just understand it?

Fora Soft has built real-time video, audio, and AI products since 2005 — WebRTC, LiveKit, generative pipelines, and AI agents at scale. Tell us what you’re building and we’ll send a real engineer your way.

Specialist software house for video, real-time and AI products. Founded 2005. 50 in-house engineers.

+1 (914) 775-5855
New York · USA
© Fora Soft, 20052026
Describe your project and we will get in touch
Enter your message
Enter your email
Enter your name

By submitting data in this form, you agree with the Personal Data Processing Policy.

Your message has been sent successfully
We will contact you soon
Message not sent. Please try again.