MC1: Retrieval-Augmented Generation (RAG)

Overview

This project builds a RAG pipeline using:

BGE-M3 / BM42 for embeddings
Qdrant as the vector store
Ollama for local LLM generation
LangChain for orchestration
RAGAS for evaluation

Setup Instructions

1. Get the Dataset

Download from Kaggle and place the two .csv files into data/.

2. Set Up Ollama

Install from Ollama.

ollama pull gemma3:4b
ollama pull gemma3:1b
ollama pull bge-m3

3. Run Qdrant in Docker

Follow the Quickstart to run Qdrant locally.

Start the Qdrant server in Docker before running any Python code that connects to it.

mkdir data\qdrant_storage
docker run -d --name qdrant \
    -p 6333:6333 \
    -p 6334:6334 \
    -v "$(pwd)/data/qdrant_storage:/qdrant/storage/qdrant/storage" \
    qdrant/qdrant

4. Install Dependencies

conda env create -f environment.yml
conda activate npr

5. (Optional) Download preprocessed files

All chunks, embeddings, the qdrant storage, retrievals, augmented generations and evaluations Files are available at:

fhnw365.sharepoint

6. Run the Project

Run the notebooks 01, 02, 03, 04 ... one after the other manually.

The Pipeline

The RAG pipeline consists of two main phases: indexing and retrieval/generation.

Indexing Phase:

We preprocess a CSV file and split the 'content' column into text chunks using various strategies: sentence-level, paragraph-level, overlapping, semantic (with similarity threshold), and article-wise.
Chunks are embedded using dense (BGE-M3) and/or sparse (BM42) models.
The resulting embeddings are stored in a Qdrant vector database, supporting dense, sparse, or hybrid search.

Retrieval & Generation Phase:

A user submits a query.
It’s embedded using the coresponding model (embedding model is the same for indexing and retrieval).
A similarity search retrieves the top-k matching chunks from Qdrant.
Optional reranking improves result relevance.
The top results are sent to an LLM, which generates the final answer.

The system allows experimentation with chunking methods, embedding types, and hybrid retrieval approaches.

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
Documents		Documents
data		data
html_notebooks		html_notebooks
images		images
static/figures		static/figures
.gitignore		.gitignore
01_eda_rag_data.ipynb		01_eda_rag_data.ipynb
02_text_preprocessing.ipynb		02_text_preprocessing.ipynb
03_chunking.ipynb		03_chunking.ipynb
04_embed_and_ingest.ipynb		04_embed_and_ingest.ipynb
05_retrieval.ipynb		05_retrieval.ipynb
06_Augmented_generation.ipynb		06_Augmented_generation.ipynb
07_Evaluation.ipynb		07_Evaluation.ipynb
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MC1: Retrieval-Augmented Generation (RAG)

Overview

Setup Instructions

1. Get the Dataset

2. Set Up Ollama

3. Run Qdrant in Docker

4. Install Dependencies

5. (Optional) Download preprocessed files

6. Run the Project

The Pipeline

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

a-coding-Kat/RAG_pipeline

Folders and files

Latest commit

History

Repository files navigation

MC1: Retrieval-Augmented Generation (RAG)

Overview

Setup Instructions

1. Get the Dataset

2. Set Up Ollama

3. Run Qdrant in Docker

4. Install Dependencies

5. (Optional) Download preprocessed files

6. Run the Project

The Pipeline

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages