LLM-as-judge asks a capable model to score or compare outputs against criteria, giving fast, cheap, repeatable evaluation for transcripts, summaries, or video answers. It needs careful prompts and spot-checks against humans, since the judge can be biased or wrong.