Auditory masking

Definition

The effect where a loud sound hides a quieter one nearby in frequency or time. Lossy codecs exploit it: masked detail can be removed and never heard.

Masking is the perceptual effect that lossy compression depends on: a loud sound makes nearby quieter sounds inaudible, either in frequency (a strong tone hides softer tones close to it) or in time (a loud transient hides sounds just before and after it). The ear simply doesn't register the masked components, so a codec can quantize them coarsely or discard them entirely without audible loss. Every psychoacoustic model is essentially a masking calculator, estimating a moment-by-moment threshold below which detail can be thrown away. Understanding masking explains why lossy codecs can drop the majority of the data yet sound transparent — they remove what was already inaudible.

Also known as

frequency masking, temporal masking