Offline Voice Chatbot

A fully offline, privacy-focused voice assistant that runs entirely on your local machine. This project integrates lightweight, high-performance AI models to provide a fluid voice-to-voice chat experience without sending any data to the cloud.

Features

100% Offline: No internet connection required after initial setup.
Low Latency: Optimized for local execution using llama.cpp and vosk.
High-Quality TTS: Uses Kokoro for natural-sounding speech synthesis.
Smart Conversation: Powered by Ministral 3B (or any GGUF model) for intelligent responses.
Resource Efficient: Designed to run smoothly on consumer hardware (Apple Silicon/CPU).
Privacy First: Your voice and data never leave your device.

Technologies Used

Language: Python 3.10+
Speech-to-Text (STT): Vosk (Lightweight, offline speech recognition).
Large Language Model (LLM): Llama.cpp (Running Ministral 3B GGUF).
Text-to-Speech (TTS): Kokoro (High-quality offline TTS).
Audio Handling: sounddevice, soundfile.

Setup Instructions

1. Prerequisites

Ensure you have the following installed on your system:

Python 3.10 or higher.
Git.
System Audio Libraries (Required for sounddevice and espeak):
- macOS (Homebrew):
```
brew install portaudio espeak-ng
```
- Linux (Ubuntu/Debian):
```
sudo apt-get install -y libportaudio2 espeak-ng
```

2. Installation

Clone the repository:

git clone https://github.com/ZaidK07/Offline-Voice-Chatbot
cd Offline-Voice-Chatbot

Create and activate a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install Python dependencies:
```
pip install -r requirements.txt
```

3. Model Setup (Crucial Step)

You need to download the models manually as they are too large to include in the repository.

A. Speech-to-Text (Vosk)

Create a models directory if it doesn't exist.
Download the vosk-model-small-en-us-0.15 from Vosk Models.
Extract it into models/ so the path is models/vosk-model-small-en-us-0.15.

B. LLM (Ministral 3B)

Download a GGUF quantized version of Ministral 3B (e.g., Ministral-3B-Instruct-v0.1.Q4_K_M.gguf).
- Recommended Source: Search for "Ministral GGUF" on Hugging Face.
Place the .gguf file directly inside the models/ directory.

Directory Structure Verification:

project_root/
├── main.py
├── vosk-model-small-stt    <-- Folder containing Vosk files
├── mistral-3b-instruct/ministral-3b-instruct.gguf    <-- Your GGUF model file
├── speech_to_text.py
├── text_to_speech.py
└── gen_ai_model.py

4. Configuration

You can tweak settings in the following files:

gen_ai_model.py: Change n_ctx (context size) or n_gpu_layers (GPU offloading).
text_to_speech.py: Change the voice variable (e.g., 'af_sarah', 'am_michael').

Usage

Start the Chatbot:
```
python main.py
```
Interaction:
- Wait for the initialization message ("I am online...").
- Speak clearly into your microphone.
- The bot will listen, process your request, and speak back.
- Note: The microphone is disabled while the bot is speaking to prevent it from hearing itself.

Testing

Since this is an interactive hardware-dependent project, automated testing is limited.

Manual Test: Run main.py and verify:
1. Initialization logs appear without error.
2. Microphone input is detected (text appears on screen).
3. LLM generates a coherent response.
4. Audio plays back clearly.

Contributing

Contributions are welcome!

Fork the repository.
Create a feature branch (git checkout -b feature/AmazingFeature).
Commit your changes (git commit -m 'Add some AmazingFeature').
Push to the branch (git push origin feature/AmazingFeature).
Open a Pull Request.

License

Distributed under the MIT License. See LICENSE for more information.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Offline Voice Chatbot

Features

Technologies Used

Setup Instructions

1. Prerequisites

2. Installation

3. Model Setup (Crucial Step)

4. Configuration

Usage

Testing

Contributing

License

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
gen_ai_model.py		gen_ai_model.py
main.py		main.py
requirements.txt		requirements.txt
speech_to_text.py		speech_to_text.py
text_to_speech.py		text_to_speech.py

ZaidK07/Offline-Voice-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Offline Voice Chatbot

Features

Technologies Used

Setup Instructions

1. Prerequisites

2. Installation

3. Model Setup (Crucial Step)

4. Configuration

Usage

Testing

Contributing

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages