Grounding DINO links language and detection: you type a description and it draws boxes around matching objects, even classes it was never explicitly trained on. This removes the need to collect data and retrain every time you want to detect something new.

