koko

Local CLI text-to-speech using Kokoro-82M for on-demand inference.

No daemon/background service.
Run directly as koko.
Default voice: af_heart.
Designed for automation workflows (including agent-completion voice notifications).

Features

koko "Your message" default speak flow (sag-style UX)
-v/--voice flag to pick a voice (af_heart by default)
koko voices and koko -v ? voice discovery
Message input from args, file, or stdin
WAV output via --output
Optional playback toggle via --play/--no-play
Optional --summarize pre-vocalization filter (OpenAI-compatible API; local llama.cpp friendly)
Optional JSONC config file at ~/.config/koko/config.jsonc for defaults (including LLM settings)

Requirements

Python 3.13.11
A local audio player for playback (auto-detected):
- Linux: pw-play, ffplay, aplay, or paplay
- macOS: afplay

koko is configured for local-only execution by default. It will not make network requests unless you explicitly pass --no-offline.

Quickstart (development)

uv sync --dev
export KOKO_MODEL_DIR=/path/to/kokoro-82m
uv run koko "Hello from koko"

CLI Usage

koko [flags] [message...]

Common examples

# Default voice (af_heart)
koko "Agent task finished successfully."

# Explicit voice
koko --voice bf_emma "Build complete."

# Save output to WAV (no playback)
koko --no-play --output /tmp/done.wav "Pipeline completed."

# Read text from stdin
echo "Deployment completed." | koko

# Read text from file
koko --input-file ./message.txt

# Summarize noisy structured text before speech (local llama.cpp)
koko --summarize --llm-model Mistral-7B-Instruct-v0.3-Q6_K "### Build\n- ✅ tests pass\n- changed 14 files"

# List voices
koko voices
koko -v ?

Local model setup (required by default)

koko runs in offline mode by default. Place model assets on disk and point koko to them.

Download model assets (one-time):

just download-model

Or download only specific voices:

just download-model voices="af_heart,bf_emma"

Expected local model layout:

/path/to/kokoro-82m/
  config.json
  kokoro-v1_0.pth
  voices/
    af_heart.pt
    ...

Use it directly:

koko --model-dir /path/to/kokoro-82m --voice af_heart "Task complete"

You can also set a default path via environment variable:

export KOKO_MODEL_DIR=/path/to/kokoro-82m
koko "Task complete"

koko fails fast if required local assets are missing.

Key flags

-v, --voice — voice ID (default: af_heart)
-l, --lang-code — explicit language code (a,b,e,f,h,i,j,p,z)
-s, --speed — speech speed multiplier (> 0)
-o, --output — write WAV output file
-f, --input-file — input text file (- for stdin)
--device — auto (default), cpu, or cuda
--play/--no-play — enable or disable local playback
--summarize/--no-summarize — summarize input text before TTS (default: off)
--llm-base-url — OpenAI-compatible API URL (default: http://127.0.0.1:11434/v1)
--llm-model — summarization model id (default: Mistral-7B-Instruct-v0.3-Q6_K)
--llm-api-key — optional API key for summarization endpoint
--llm-timeout-seconds — summarization request timeout (> 0)
--llm-max-input-chars — maximum input size sent to LLM (>= 256)
--repo-id — Hugging Face model repo (default: hexgrad/Kokoro-82M)
--model-dir — local model asset directory (config.json, kokoro-v1_0.pth, voices/*.pt)
--offline/--no-offline — local-only mode toggle (default: --offline)

Environment defaults (via pydantic-settings)

You can set defaults with KOKO_* environment variables:

KOKO_REPO_ID
KOKO_DEFAULT_VOICE
KOKO_OFFLINE
KOKO_MODEL_DIR
KOKO_SUMMARIZE
KOKO_LLM_BASE_URL
KOKO_LLM_MODEL
KOKO_LLM_API_KEY
KOKO_LLM_TIMEOUT_SECONDS
KOKO_LLM_MAX_INPUT_CHARS
KOKO_CONFIG_FILE (optional override for config file path)

CLI flags still take precedence over environment defaults.

Configuration file (`~/.config/koko/config.jsonc`)

koko also reads optional JSONC defaults from ~/.config/koko/config.jsonc. This makes it easy to manage koko settings declaratively from NixOS/Home Manager dotfiles.

Supports // and /* ... */ comments.
Supports trailing commas.
Use KOKO_CONFIG_FILE to point at a different path.

Example:

{
  // Keep summarize on by default
  "summarize": true,

  // Top-level keys map to settings fields
  "llm_model": "Mistral-7B-Instruct-v0.3-Q6_K",
  "llm_base_url": "http://127.0.0.1:11434/v1",

  // Optional nested section also works
  "llm": {
    "timeout_seconds": 15,
    "max_input_chars": 6000
  }
}

Effective precedence is: CLI flags > environment variables > config file > built-in defaults.

When --summarize is enabled and summarization fails, koko logs an error and exits without generating or playing audio.

Summarization behavior

--summarize runs before Kokoro synthesis.
In offline mode, summarization requires a local --llm-base-url (for example http://127.0.0.1:11434/v1).
Prompting targets a single short spoken summary sentence (hard max 1).
koko keeps post-processing minimal and relies on prompt instructions for conversational phrasing.
If summarization fails, koko exits non-zero and does not produce audio (no playback, no WAV write).

Tooling

This repository uses:

uv for dependency/runtime management
ruff for Python linting/formatting
ty for type checking
pytest for tests
prek for pre-commit hooks
just for task orchestration
Nix linting parity with nixos-config: statix, deadnix, and nixfmt --check

Run local checks:

just fmt
just fmt-nix
just lint
just test
just smoke-e2e
just smoke-llm
# optional local hook simulation:
just precommit-run
just prepush-run
# or specify a custom asset path (positional args):
just smoke-e2e /path/to/kokoro-82m /tmp/koko-smoke.wav "Koko local inference smoke test"

NixOS / Nix installation

This repo ships a flake package that exposes a koko binary. It also vendors a local pydantic-ai dependency chain in flake.nix so --summarize works in Nix-installed builds.

Run directly from the repo

nix run .#koko -- "Hello from nix"

Install to your user profile

nix profile install .#koko
koko "Installed globally via nix profile"

Add to NixOS system packages (flake-based config)

{
  inputs.koko.url = "path:/home/rbright/Projects/koko";

  outputs = { self, nixpkgs, koko, ... }: {
    nixosConfigurations.my-host = nixpkgs.lib.nixosSystem {
      system = "x86_64-linux";
      modules = [
        ({ pkgs, ... }: {
          environment.systemPackages = [ koko.packages.${pkgs.system}.koko ];
        })
      ];
    };
  };
}

Troubleshooting

No audio player found
- Install one of: pw-play, ffplay, aplay, paplay, or use --no-play --output file.wav.
Offline mode fails with missing assets
- Ensure --model-dir (or KOKO_MODEL_DIR) contains config.json, kokoro-v1_0.pth, and voices/*.pt.
You still see Hugging Face requests/warnings
- Ensure you are not passing --no-offline and that local assets are valid (config.json, kokoro-v1_0.pth, voices/*.pt).
CUDA errors
- Force CPU mode: koko --device cpu "hello".
No input text provided
- Provide message args, --input-file, or pipe stdin.
Summarization failed
- Check --llm-base-url and --llm-model against your local llama.cpp server.
- koko intentionally aborts synthesis when --summarize fails.

References

Kokoro GitHub: https://github.com/hexgrad/kokoro
Kokoro-82M model: https://huggingface.co/hexgrad/Kokoro-82M
CLI UX inspiration: https://github.com/steipete/sag

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
src/koko_cli		src/koko_cli
tests		tests
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix
justfile		justfile
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

koko

Features

Requirements

Quickstart (development)

CLI Usage

Common examples

Local model setup (required by default)

Key flags

Environment defaults (via pydantic-settings)

Configuration file (`~/.config/koko/config.jsonc`)

Summarization behavior

Tooling

NixOS / Nix installation

Run directly from the repo

Install to your user profile

Add to NixOS system packages (flake-based config)

Troubleshooting

References

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

koko

Features

Requirements

Quickstart (development)

CLI Usage

Common examples

Local model setup (required by default)

Key flags

Environment defaults (via pydantic-settings)

Configuration file (~/.config/koko/config.jsonc)

Summarization behavior

Tooling

NixOS / Nix installation

Run directly from the repo

Install to your user profile

Add to NixOS system packages (flake-based config)

Troubleshooting

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages

Configuration file (`~/.config/koko/config.jsonc`)