Isschat - Enterprise RAG chatbot

A chatbot that provides semantic search and conversational AI capabilities for Confluence knowledge bases using advanced RAG (Retrieval-Augmented Generation) technology.

Features

Core RAG capabilities

Document ingestion: Automated Confluence space crawling and content extraction
Hierarchical chunking: Structure-preserving document segmentation for optimal context retrieval
Weaviate integration: Fast vector search with cosine similarity and HNSW indexing
Semantic query processing: Query reformulation and coreference resolution using LLM
Flexible data handling: Adaptive chunking strategies and support for multiple data sources

Chatbot intelligence

Persistent history: Session-aware conversations with memory across interactions
Multi-turn dialogue: Natural conversation flow with context preservation
Multilingual support: Optimized for French and English enterprise use cases
Response generation: Coherent answers synthesized from retrieved knowledge

User interfaces

Streamlit web app: Interactive chat interface
Streamlit Evaluation dashboard: Multi-category testing (retrieval, generation, business value, robustness)
Command line interface (CLI): Complete system management and querying capabilities

Enterprise features

Azure AD authentication: Secure access with enterprise domain validation
Cloud storage integration: Azure Blob Storage support for scalable deployments
Secret management: Azure Key Vault integration for secure credential handling
Environment support: Configurable settings for development, staging, and production
Feedback integration: User input collection for continuous model improvement

Monitoring & analytics

Evaluation dashboard: Multi-category testing (retrieval, generation, business value, robustness)
Performance dashboard: Real-time system metrics and usage analytics
Admin dashboard (in development): Backend management and monitoring tools
CI/CD support: Integrated testing pipelines and automated deployment workflows
Comprehensive logging: Detailed system activity tracking and debugging support

Installation

Prerequisites

Python 3.12+

Install uv package manager:

curl -LsSf https://astral.sh/uv/install.sh | sh

Configuration

Clone the repository

git clone https://github.com/Isskar/Isschat.git
cd Isschat

Install dependencies
```
uv sync
```

Configure environment variables Copy .env.example to .env file at root with:

# Required - Confluence Configuration
CONFLUENCE_API_KEY=your_confluence_api_key
CONFLUENCE_SPACE_NAME=https://your_company.atlassian.net
CONFLUENCE_EMAIL_ADDRESS=firstname.lastname@your_company.com

# Required - LLM Configuration
OPENROUTER_API_KEY=your_openrouter_api_key

# Optional - Advanced Configuration
LLM_MODEL=google/gemini-2.5-flash-lite-preview-06-17
EMBEDDINGS_MODEL=intfloat/multilingual-e5-small
CHUNK_SIZE=1000
SEARCH_K=3

# Optional - Semantic Features
USE_SEMANTIC_FEATURES=true
SEMANTIC_RERANKING_ENABLED=true

# Optional - Azure Integration (for production)
USE_AZURE_STORAGE=false
AZURE_STORAGE_ACCOUNT=your_storage_account
AZURE_BLOB_CONTAINER_NAME=your_container
KEY_VAULT_URL=https://your-keyvault.vault.azure.net/

Get your API keys from:

Confluence API: https://id.atlassian.com/manage-profile/security/api-tokens
OpenRouter API: https://openrouter.ai/

Launch

Web interface

Install dependencies
```
uv sync
```
Launch Streamlit app
```
uv run streamlit run src/webapp/app.py
```
Ask your question to Isschat

Command line interface (CLI)

Isschat provides a CLI tool for managing and querying your knowledge base:

Available commands

Status check: Check system components and configuration

uv run -m src.cli.main status [--verbose] [--component config|ingestion|rag|all]

Data ingestion: Build or update the vector database from Confluence

uv run -m src.cli.main ingest [--source confluence] [--force-rebuild] [--verbose]

Interactive chat: Start a chat session without the web interface
```
uv run -m src.cli.main chat [--user-id cli_user]
```
Direct query: Query the vector database with detailed results
```
uv run -m src.cli.main query -q "your question" [options]
```

Query command options

-q, --query: Your search query (required)
-k, --top-k: Number of chunks to retrieve (default: 5)
-s, --score-threshold: Minimum similarity score (default: 0.0)
-v, --verbose: Show detailed chunk information
--show-metadata: Display document metadata
--show-content: Display chunk content (default: true)
--show-stats: Display statistics about sources and scores
--no-llm: Skip LLM generation and only show retrieved chunks

Example usage

# Check system status and configuration
uv run -m src.cli.main status --verbose

# Ingest data from Confluence
uv run -m src.cli.main ingest --source confluence --verbose

# Start interactive chat session
uv run -m src.cli.main chat

# Query with detailed information
uv run -m src.cli.main query -q "How to configure authentication?" -k 3 --show-metadata --show-stats

# Query without LLM generation (retrieval only)
uv run -m src.cli.main query -q "project management" --no-llm --show-stats

Evaluation system

Run comprehensive RAG evaluation:

# View evaluation dashboard
uv run streamlit run rag_evaluation/evaluation_dashboard.py

# Run all evaluation categories
uv run rag_evaluation/run_evaluation.py

# Run specific evaluation category
uv run rag_evaluation/run_evaluation.py --category retrieval

Architecture

The system is built with a modular, enterprise-grade architecture supporting both local and cloud deployment:

Isschat/
├── src/
│   ├── cli/                    # Command-line interface
│   │   ├── commands/          # CLI commands (status, ingest, chat, query)
│   │   └── main.py            # CLI entry point
│   ├── config/                # Configuration management
│   │   ├── settings.py        # Main configuration with environment support
│   │   ├── secrets.py         # Secret management (Azure Key Vault)
│   │   └── keyvault.py        # Azure Key Vault integration
│   ├── core/                  # Core abstractions and interfaces
│   │   ├── documents.py       # Document models
│   │   ├── exceptions.py      # Custom exceptions
│   │   └── interfaces.py      # Abstract interfaces
│   ├── embeddings/            # Embedding service
│   │   ├── models.py          # Embedding models
│   │   └── service.py         # Embedding service implementation
│   ├── ingestion/             # Data ingestion pipeline
│   │   ├── base_pipeline.py   # Abstract ingestion framework
│   │   ├── confluence_pipeline.py # Confluence-specific ingestion
│   │   ├── connectors/        # Data source connectors
│   │   └── processors/        # Document processing (chunking, filtering)
│   ├── rag/                   # RAG pipeline implementation
│   │   ├── pipeline.py        # Standard RAG pipeline
│   │   ├── semantic_pipeline.py # Semantic-enhanced RAG pipeline
│   │   ├── reformulation_service.py # LLM-based query reformulation
│   │   └── tools/             # RAG tools (retrieval, generation)
│   ├── storage/               # Storage abstraction
│   │   ├── storage_factory.py # Storage factory (local/Azure)
│   │   ├── azure_storage.py   # Azure Blob Storage
│   │   └── local_storage.py   # Local file storage
│   ├── vectordb/              # Vector database abstraction
│   │   ├── interface.py       # Vector database interface
│   │   ├── weaviate_client.py # Weaviate implementation
│   │   └── factory.py         # Vector database factory
│   └── webapp/                # Web application
│       ├── app.py             # Main Streamlit application
│       ├── auth/              # Authentication (Azure AD)
│       ├── components/        # UI components
│       └── pages/             # Multi-page application
├── rag_evaluation/            # Comprehensive evaluation framework
│   ├── core/                  # Evaluation core (LLM judge, base evaluator)
│   ├── evaluators/            # Specialized evaluators
│   ├── config/                # Evaluation configuration and test datasets
│   └── evaluation_dashboard.py # Evaluation dashboard
├── tests/                     # Test suite
├── .env.example              # Configuration template
├── pyproject.toml            # Project configuration (uv package manager)
├── Dockerfile                # Container deployment
└── README.md                 # This documentation

Production deployment

Azure cloud deployment

For production deployment with Azure integration:

# Azure Storage Configuration
USE_AZURE_STORAGE=true
AZURE_STORAGE_ACCOUNT=your_storage_account_name
AZURE_BLOB_CONTAINER_NAME=your_container_name

# Azure Key Vault for Secret Management
KEY_VAULT_URL=https://your-keyvault.vault.azure.net/

# Azure AD Authentication (for web app)
AZURE_CLIENT_ID=your_azure_app_client_id
AZURE_CLIENT_SECRET=your_azure_app_client_secret
AZURE_TENANT_ID=your_azure_tenant_id

Docker deployment

Build and run with Docker:

# Build the container
docker build -t isschat .

# Run with environment variables
docker run -d \
  --name isschat \
  -p 8501:8501 \
  --env-file .env \
  isschat

# Run with volume mounting for local data
docker run -d \
  --name isschat \
  -p 8501:8501 \
  -v $(pwd)/data:/app/data \
  --env-file .env \
  isschat

Local development

For local development, leave Azure settings disabled:

USE_AZURE_STORAGE=false

Testing

Run the test suite:

# Install test dependencies
uv sync --extra test

# Run tests
uv run pytest tests/ -v

# Run tests with coverage
uv run pytest tests/ --cov=src --cov-report=html

License

This project is licensed under the MIT License - see the LICENSE file for details.

Developed by Nicolas Lambropoulos

Name		Name	Last commit message	Last commit date
Latest commit History 501 Commits
.dvc		.dvc
.github/workflows		.github/workflows
.streamlit		.streamlit
Images		Images
rag_evaluation		rag_evaluation
src		src
tests		tests
.dockerignore		.dockerignore
.dvcignore		.dvcignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
data_flow.mmd		data_flow.mmd
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Isschat - Enterprise RAG chatbot

Table of contents

Features

Core RAG capabilities

Chatbot intelligence

User interfaces

Enterprise features

Monitoring & analytics

Installation

Prerequisites

Configuration

Launch

Web interface

Command line interface (CLI)

Available commands

Query command options

Example usage

Evaluation system

Architecture

Production deployment

Azure cloud deployment

Docker deployment

Local development

Testing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

Isskar/Isschat

Folders and files

Latest commit

History

Repository files navigation

Isschat - Enterprise RAG chatbot

Table of contents

Features

Core RAG capabilities

Chatbot intelligence

User interfaces

Enterprise features

Monitoring & analytics

Installation

Prerequisites

Configuration

Launch

Web interface

Command line interface (CLI)

Available commands

Query command options

Example usage

Evaluation system

Architecture

Production deployment

Azure cloud deployment

Docker deployment

Local development

Testing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages