Offline SLM RAG Application

This application provides a chat interface to interact with your documents using a locally running Small Language Model (SLM). It works entirely offline.

Features

Offline SLM: Uses Phi-3-mini-4k-instruct via llama-cpp-python.
RAG (Retrieval-Augmented Generation): Uses ChromaDB and SentenceTransformers to index and retrieve relevant document chunks.
FastAPI Backend: Handles document ingestion and chat inference.
Streamlit Frontend: Provides a user-friendly chat interface.

Prerequisites

Python 3.10+
Basic build tools (for llama-cpp-python compilation)

Setup

Create a Virtual Environment (Recommended):

python3 -m venv venv
source venv/bin/activate

Install Dependencies:
```
pip install -r requirements.txt
```
Download Model:
```
python download_model.py
```

Running the App

You can use the helper script to start both backend and frontend:

./start.sh

Or run them manually:

Backend:

uvicorn backend.main:app --reload --port 8000

Frontend:

streamlit run frontend/app.py --server.port 8501

Usage

Open the Streamlit app (usually http://localhost:8501).
Upload a PDF or Text file in the sidebar.
Click "Ingest Document".
Ask questions in the chat interface.

Architecture

See archdocs/architecture.md for C4 model diagrams.

Configuration & Model Swapping

The application uses backend/config.py to manage settings. You can easily swap the SLM model by updating this file.

Edit backend/config.py:

class Config:
    MODEL_REPO = "microsoft/Phi-3-mini-4k-instruct-gguf"
    MODEL_FILENAME = "Phi-3-mini-4k-instruct-q4.gguf"
    PROMPT_TEMPLATE = "phi3"  # Options: phi3, chatml, llama2

Download New Model:
```
python download_model.py
```
Restart Application:
```
./stop.sh
./start.sh
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Offline SLM RAG Application

Features

Prerequisites

Setup

Running the App

Usage

Architecture

Configuration & Model Swapping

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
archdocs		archdocs
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
download_model.py		download_model.py
requirements.txt		requirements.txt
start.sh		start.sh
stop.sh		stop.sh

vikbht/slm

Folders and files

Latest commit

History

Repository files navigation

Offline SLM RAG Application

Features

Prerequisites

Setup

Running the App

Usage

Architecture

Configuration & Model Swapping

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages