Vector Databases: Pinecone, Weaviate & pgvector · Lesson

Contextual Information Retrieval

Implement strategies for retrieving the most relevant context from your vector store to augment LLM prompts.

What is Context Retrieval?

In a Retrieval-Augmented Generation (RAG) system, the Large Language Model (LLM) needs relevant information to generate accurate responses.

Contextual Information Retrieval is the process of finding and fetching this relevant data from your vector database.
It's the bridge that connects the user's query to the knowledge stored in your specialized data.

When a user asks a question, several steps happen to get the right context:

The user's question (query) is converted into a vector embedding.
This query embedding is sent to the vector database.
The vector database searches for stored document embeddings that are most similar to the query embedding.
The text chunks associated with these similar embeddings are retrieved and sent to the LLM.