SELENE

Privacy-first menopause assistant: local symptom tracking, RAG-backed chat, and clinician-style summaries powered by MedGemma.

Overview

Runs fully on-device: user data stays under data/user_data/; LLM calls target a local Ollama endpoint.
Core flows: Daily Attune logging, chat with RAG + safety guardrails, clinical insight reports with PDF export.

Key Features

Daily Attune: capture rest/internal weather/clarity + notes; validated saves with backups.
Chat: contextualized queries, Chroma RAG, past-session recall, streaming MedGemma responses.
Clinical summary: deterministic stats/patterns/risk + single MedGemma call; PDF export via xhtml2pdf.
Local knowledge base: Chroma collections (medical_docs, chat_history) with SentenceTransformer embeddings.
Safety: deterministic risk flags, conservative prompts, low temperature, offline defaults.

Architecture (brief)

RAG + LLM orchestration: med_logic.py
Context building: context_builder.py (chat) and context_builder_multi_agent.py (reports)
Deterministic analysis: deterministic_analysis.py
Persistence: data_manager.py, chat_db.py
Insights reporting: insights_generator.py, UI in views/clinical.py
Streamlit views: views/home.py, views/pulse.py, views/chat.py, views/clinical.py
Configuration: settings.py
Full details: docs/technical_reference.md

Prerequisites

Python 3.11+
Ollama running locally with model MedAIBase/MedGemma1.5:4b pulled
Basic build deps for scientific stack (numpy/scipy/pandas) and xhtml2pdf; install via requirements.txt

Quick Start

# 1. Clone the repository
git clone https://github.com/innacampo/selene.git
cd selene/selene

# 2. Create venv and install
python3 -m venv med_env
source med_env/bin/activate
pip install -e ".[dev]"

# 3. Pull model in Ollama (once)
ollama pull MedAIBase/MedGemma1.5:4b

# 4. Launch app
streamlit run app.py

Or use the setup script: ./scripts/setup_project.sh

Repository Structure

selene/
├── app.py                    # Thin entry point (from selene.ui.app import main)
├── src/selene/               # Installable Python package
│   ├── core/                 # Business logic (med_logic, context, analysis)
│   ├── storage/              # Persistence (data_manager, chat_db)
│   └── ui/                   # Streamlit UI & views
├── tests/                    # Test suite (pytest)
├── scripts/                  # Utility scripts (setup, KB management)
├── docs/                     # Documentation and guides
├── examples/                 # Example code and usage demonstrations
├── data/                     # Data directories (mostly gitignored)
├── pyproject.toml            # Build config & metadata
└── requirements.txt          # Dependencies

See DIRECTORY_STRUCTURE.md for the full tree.

Usage

Daily Attune: enter Rest/Internal Weather/Clarity + notes → saves to pulse_history.json and invalidates caches.
Chat: ask questions; system contextualizes follow-ups, retrieves KB + prior chats, streams MedGemma output with sources.
Clinical Summary: pick a date range; generates report if ≥3 pulse entries and completeness ≥0.4; download PDF.

Data & Storage (local)

Profile: data/user_data/user_profile.json
Pulse history: data/user_data/pulse_history.json (+ backups in data/user_data/backups/)
Chroma DB: data/user_data/user_med_db (medical_docs, chat_history)
Reports (optional): data/reports/
Logs (if enabled): ../logs/selene.log (rotating)

Configuration Highlights

Paths, model names, cache TTLs in settings.py: RAG_TOP_K=2, contextualize cache 300s, RAG cache 600s, user context cache 180s.
Offline/telemetry disabled by default via envs set in settings (TRANSFORMERS_OFFLINE, HF_*_OFFLINE, CHROMA_TELEMETRY=False).
Logging defaults to DEBUG; file logging enabled by default (toggle LOG_TO_FILE).

Knowledge Base Management

Chroma collections live under data/user_data/user_med_db; embeddings via SentenceTransformer all-MiniLM-L6-v2.
Import/export and collection maintenance via scripts/update_kb_chroma.py (keeps collection IDs stable).

Safety & Guardrails

Deterministic risk scoring (recent 7–14 days) flags severe/rapid changes and concerning notes; injected into chat/report prompts for conservative language and referrals.
MedGemma calls use low temperature (≤0.2) and stop tokens; evidence sections include source headers.
Contextualization cache reduces ambiguous follow-ups; RAG returns empty-safe outputs if DB is empty.

Testing

# Run the test suite
python -m pytest tests/ -v

# With coverage
python -m pytest tests/ -v --cov=src/selene --cov-report=term-missing

Coverage focus (see tests/README.md):

test_deterministic_analysis.py - symptom mapping/stats/patterns/risk formatting
test_context_builder.py - profile/pulse context, notes/chat aggregation, completeness scoring
test_med_logic_cache.py - TTL cache behavior, eviction, stats, and cache invalidation helpers

Contributing

Please read CONTRIBUTING.md for guidelines on how to contribute.

License

This project is licensed under CC BY 4.0 - see LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SELENE

Overview

Key Features

Architecture (brief)

Prerequisites

Quick Start

Repository Structure

Usage

Data & Storage (local)

Configuration Highlights

Knowledge Base Management

Safety & Guardrails

Testing

Contributing

License

More Documentation

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github		.github
.streamlit		.streamlit
data		data
docs		docs
examples		examples
output		output
scripts		scripts
src/selene		src/selene
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DIRECTORY_STRUCTURE.md		DIRECTORY_STRUCTURE.md
LICENSE		LICENSE
QUICKSTART.md		QUICKSTART.md
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
app.py		app.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Uh oh!

License

innacampo/selene

Folders and files

Latest commit

History

Repository files navigation

SELENE

Overview

Key Features

Architecture (brief)

Prerequisites

Quick Start

Repository Structure

Usage

Data & Storage (local)

Configuration Highlights

Knowledge Base Management

Safety & Guardrails

Testing

Contributing

License

More Documentation

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages