Multi-Agent Question Answering System with LangGraph

It's a multi-agent system that answers questions by routing queries to the most appropriate knowledge source: either a vector database containing LangChain documentation or Wikipedia for general knowledge questions.

Flow of Multi Agent

System Overview

The system uses a LangGraph workflow to create a decision tree that:

Receives a user question
Routes it to the most appropriate data source
Retrieves relevant information
Returns the results

Key Components

Agent 1: Vector Database Retriever

Purpose: Answers questions about LangChain concepts
Implementation:
- Uses Astra DB as a vector database
- Documents from LangChain websites are split into 500-token chunks
- Embeddings are created using HuggingFace's "all-MiniLM-L6-v2" model
- Stored in Astra DB in a table named "multi_agent"
- Retrieved using a retriever interface for semantic search

Agent 2: Wikipedia Search

Purpose: Answers general knowledge questions
Implementation:
- Uses WikipediaAPIWrapper to interface with the Wikipedia API
- Configured to return the top 2 most relevant results
- Results are limited to 200 characters per article for conciseness

Router Agent

Purpose: Determines which knowledge source to use for each question
Implementation:
- Uses a Groq LLM with the llama3-8b-8192 model
- Structured output using Pydantic models ensures consistent decisions
- System prompt instructs the LLM on routing logic:
  - LangChain questions → Vector Database
  - General knowledge questions → Wikipedia

Data Flow

Question Input: User submits a question
Routing: The route_question function:
- Extracts the question from the state
- Uses the question_router to determine the appropriate data source
- Returns either "vectorstore" or "wiki_search" as the next node
Retrieval:
- If routed to "vectorstore": the retrieve function queries the vector database
- If routed to "wiki_search": the wiki_search function queries Wikipedia
Output: The retrieved documents are returned as the final state

Why Use a Retriever for Vector DB?

We use a retriever interface to interact with the vector database because:

It provides a standardized way to query vector stores
It abstracts away the underlying vector database implementation
It allows for easy swapping of vector stores
It handles the similarity search operations efficiently

Technical Implementation

State Management

The system uses a GraphState TypedDict to manage and pass state between components:

question: The user's original question
documents: The retrieved documents

Document Processing

Documents are chunked using token-based splitting rather than character-based
This ensures consistency across different languages and text types
The tiktoken encoder aligns with how LLMs process text

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
multi-agent.ipynb		multi-agent.ipynb
multi-agent.png		multi-agent.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Question Answering System with LangGraph

Flow of Multi Agent

System Overview

Key Components

Agent 1: Vector Database Retriever

Agent 2: Wikipedia Search

Router Agent

Data Flow

Why Use a Retriever for Vector DB?

Technical Implementation

State Management

Document Processing

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Question Answering System with LangGraph

Flow of Multi Agent

System Overview

Key Components

Agent 1: Vector Database Retriever

Agent 2: Wikipedia Search

Router Agent

Data Flow

Why Use a Retriever for Vector DB?

Technical Implementation

State Management

Document Processing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages