Mean Opinion Score (MOS) is the single headline number a subjective test produces: the arithmetic mean of the ratings a panel of viewers gave one clip, almost always on the five-point Absolute Category Rating scale where 5 is excellent and 1 is bad. If N viewers rate a clip, the MOS is the sum of their ratings divided by N. The term came from telephony (ITU-T P.800.1) before video borrowed it. A MOS is only as meaningful as the scale it rides on, and that scale is ordinal, so the categories are ordered but the gaps between them are not guaranteed equal; a 4 is not twice as good as a 2. Crucially, a bare average is not yet a measurement. You compute the MOS only after screening unreliable viewers and you must attach a 95% confidence interval and state the method, scale, and conditions. Every objective metric (PSNR, SSIM, VMAF) is trained against MOS or DMOS.

