Dual RAG Paper System

A Retrieval-Augmented Generation (RAG) system for documents, texts, and articles using a local Large Language Model (LLM).

Overview

This project implements a RAG system that stores documents on a Raspberry Pi (PC A, meant to be low performance but also capable of storing all the documents) and performs computations on a high-performance PC (PC B) equipped with a GPU. The system allows for semantic search and retrieval of documents based on user queries, utilizing the hkunlp/instructor-large model for generating embeddings.

Features

Document Storage: Store PDFs and text files on a Raspberry Pi (PC A).
Semantic Search: Perform similarity searches using embeddings.
Document Retrieval: Retrieve and download relevant documents (PDF or text) based on queries.
Local Processing: All computations are performed locally without relying on external services.

Project Structure

Dual RAG Paper System/
├── server/
│   ├── app.py
│   └── transform_pdf_text.sh
├── processing/
│   ├── embedding_generation.py
│   ├── vector_search.py
│   └── call_server.py
└── README.md

Prerequisites

PC A (Raspberry Pi)

Python 3
Flask (pip install flask)
NumPy (pip install numpy)
Annoy (pip install annoy)
Poppler Utils (sudo apt-get install poppler-utils)

PC B (High-Performance PC)

Python 3
PyTorch (pip install torch)
Sentence Transformers (pip install -U sentence-transformers)
NumPy (pip install numpy)
Annoy (pip install annoy)
Requests (pip install requests)

Setup Instructions

1. Setting Up PC A (Raspberry Pi)

a. Clone the Repository

git clone https://github.com/frrobledo/RAG_paper_search.git

b. Install Dependencies

cd RAG_paper_search/server
pip install flask numpy annoy
sudo apt-get install poppler-utils

c. Prepare Documents

Place your PDF files in ~/documents/PDF/.
Run the script to convert PDFs to text:
```
./transform_pdf_text.sh
```

d. Start the Flask Server

python app.py

2. Setting Up PC B (High-Performance PC)

a. Clone the Repository

git clone https://github.com/frrobledo/RAG_paper_search.git

b. Install Dependencies

cd RAG_paper_search/processing
pip install -U sentence-transformers numpy annoy requests

c. Generate Embeddings and Build Index

Generate Embeddings:
```
python embedding_generation.py
```
Build Annoy Index:
```
python vector_search.py
```
Transfer Files to PC A:

Copy embeddings.npy, doc_ids.npy, and annoy_index.ann to the Raspberry Pi.
```
scp embeddings.npy doc_ids.npy annoy_index.ann pi@<raspberry_pi_ip>:/home/pi/
```

d. Run the Client Script

python call_server.py

Usage

Start the Flask Server on PC A:
```
cd server
python app.py
```
Run the Client on PC B:
```
cd processing
python call_server.py
```
Enter Your Query:

When prompted, input your search query. The script will retrieve and save the most relevant documents as well as show the cosine similarity with the query.

Customization

Adjusting the Number of Results:

In call_server.py, you can change 'num_results': 5 to the desired number of documents to retrieve.
Changing the Instruction for Embeddings:

In embedding_generation.py and call_server.py, you can modify the instruction given to the model to better suit your documents.

Troubleshooting

Out of Memory Errors:

If you encounter memory errors on PC B, reduce the batch_size in embedding_generation.py.
Connection Issues:

Ensure that both PCs are on the same network and that the IP addresses are correctly specified.
File Not Found Errors:

Verify that the documents exist in the specified directories on the Raspberry Pi.

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dual RAG Paper System

Overview

Features

Project Structure

Prerequisites

PC A (Raspberry Pi)

PC B (High-Performance PC)

Setup Instructions

1. Setting Up PC A (Raspberry Pi)

a. Clone the Repository

b. Install Dependencies

c. Prepare Documents

d. Start the Flask Server

2. Setting Up PC B (High-Performance PC)

a. Clone the Repository

b. Install Dependencies

c. Generate Embeddings and Build Index

d. Run the Client Script

Usage

Customization

Troubleshooting

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
processing		processing
server		server
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Dual RAG Paper System

Overview

Features

Project Structure

Prerequisites

PC A (Raspberry Pi)

PC B (High-Performance PC)

Setup Instructions

1. Setting Up PC A (Raspberry Pi)

a. Clone the Repository

b. Install Dependencies

c. Prepare Documents

d. Start the Flask Server

2. Setting Up PC B (High-Performance PC)

a. Clone the Repository

b. Install Dependencies

c. Generate Embeddings and Build Index

d. Run the Client Script

Usage

Customization

Troubleshooting

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages