Wiring Retrieval Into the Prompt
Answer with grounded, cited context.
Putting the Pieces Together
You can retrieve relevant chunks and call an LLM. Now you connect them: feed the retrieved text into the prompt as context.
The Augmented Prompt
A RAG prompt has three parts: an instruction, the retrieved context, and the user question. The model reads all three together.
All lessons in this course
- Why LLMs Need Retrieval
- Chunking and Embedding Documents
- Vector Search With a Vector Store
- Wiring Retrieval Into the Prompt