ELI5: How does RAG work

18 views Mar 12, 2026 2 min read

RAG is like asking a friend (the Large Language Model) a question, but first, that friend quickly looks up relevant information in a big notebook (the Knowledge Base) before answering.

Imagine you want to know "What's the best place to get pizza?" Your friend, let's call him Lenny (the LLM), doesn't just give you a random answer. Instead, he uses RAG like this:

Retrieval: Lenny first checks his "pizza notebook" (Knowledge Base). This notebook has information about all the pizza places in town, like their menus, prices, and reviews. He quickly finds the pages about pizza places near you.
Augmentation: Lenny takes the information from his notebook (like "Luigi's has 5-star reviews and a deal on pepperoni pizza tonight") and combines it with your question. Now he has all the ingredients to give you a great answer.
Generation: Finally, Lenny uses all this information to answer your question in a helpful way. He might say, "Based on my pizza notebook, Luigi's is the best place to get pizza tonight because they have 5-star reviews and a special on pepperoni!"

Without the pizza notebook, Lenny might just guess a random pizza place, or give you outdated information. RAG helps Lenny give you a more accurate and helpful answer because he has the right information at his fingertips. So, RAG makes Lenny a much better friend to ask about pizza! It helps him give better answers by first finding relevant information before answering your question.

How was this explanation?

Follow-Up Questions

Still curious? Ask a follow-up!

Test Your Understanding

Take a quick quiz and challenge your friends!

📧 Get this explanation by email

Receive this explanation in your inbox, plus get weekly simple explanations of trending topics!

Want to learn more?

Ask another question and get a simple explanation!

Ask a New Question