RAG retrieves the most relevant passages or clips from a knowledge store, then hands them to the model as context for its answer. It curbs hallucination, keeps responses current, and lets a model use private data it was never trained on.