What is RAG, and when should you use it?
A plain-English guide to retrieval-augmented generation — what it is, what problem it solves, and when it's the right tool (and when it isn't).
ChatGPT is impressive — but it doesn’t know anything about your company. It hasn’t read your employee handbook, your product specs, your customer support tickets, or the email thread you sent last Tuesday. Ask it about any of those, and it will either say “I don’t know,” or — worse — make something up.
That gap is what RAG solves. RAG stands for Retrieval-Augmented Generation, which is a fancy way of saying: before the AI answers, it looks things up in your data first.
The librarian analogy
Imagine your company’s knowledge as a giant library. ChatGPT on its own is like a well-read friend who has never set foot in your library. Useful, but not for your specific questions.
RAG turns that friend into a librarian. When you ask a question, the librarian walks into the stacks, pulls the most relevant books off the shelf, reads the relevant pages, and then answers — and tells you where the information came from.
That’s RAG. The AI is the same; what changes is what it’s allowed to read before answering.
When RAG is the right tool
Reach for RAG when any of these are true:
- Your data is private. Internal documents, contracts, customer records — anything that isn’t (and shouldn’t be) on the public internet.
- Your data changes. Pricing, inventory, policies, ongoing projects. A model trained six months ago doesn’t know about last week’s policy update; a RAG system that reads from your live documents does.
- You need citations. Legal, medical, finance, compliance — anywhere “the AI said so” isn’t a good enough answer. RAG can point at the exact page it pulled from.
- You want fewer hallucinations. When the AI is forced to answer from real source material, it makes things up far less often.
When RAG is the wrong tool
RAG isn’t the answer to everything. Skip it if:
- You want the AI to be creative (marketing copy, brainstorming, fiction). Retrieval doesn’t help here.
- The knowledge is general and already in the model. “Explain how interest rates work” doesn’t need to read your library.
- Your “knowledge base” is three documents and a Notion page. Small static corpuses might just fit directly in the prompt.
Common use cases
A few places RAG works well:
- Internal Q&A assistants — employees ask “how do I file expenses?” and get a grounded answer with a link to the source.
- Customer support copilots — read manuals, past tickets, and policy docs before drafting a reply.
- Document analysis — ask questions across a stack of contracts, reports, or research papers.
- Sales enablement — answer reps’ questions about pricing, competitors, or product details from internal sources.
What RAG doesn’t fix
A common misunderstanding: RAG is sometimes pitched as the answer to all enterprise AI problems. It isn’t.
RAG doesn’t fix bad source documents. If your knowledge base is contradictory, outdated, or vague, the AI will faithfully reflect that. Garbage in, articulate garbage out.
RAG also doesn’t fix wrong questions. If users ask things your data doesn’t cover, the system needs to know how to say “I don’t have that” — which is its own engineering problem.
How to know it worked
A working RAG system answers correctly, cites real sources, and refuses gracefully when it doesn’t know. That sounds simple, but it’s the difference between a system you can trust and one that confidently lies twice a day.
If you’re considering RAG, that’s the bar: not “does the demo look good?” but “does it still look good when someone asks the awkward edge-case version of the question that real users will ask on day one?”
Further reading:
- Different ways to build a RAG system — naive, hybrid, reranked, multi-vector. Which one fits which problem.
- Agentic RAG: when retrieval meets agents — what happens when the AI decides when and how to search.
- From RAG demo to production: the checklist nobody gave you — what you actually have to solve before launch.
Need help putting an LLM system into production?
Get in touch