ClawLayer

A lightweight, intelligent proxy and routing layer for OpenClaw AI agents - delivering instant responses, cost savings, and deep observability into agent-LLM communication for security, development, and troubleshooting.

Purpose

ClawLayer provides a simple YAML-driven configuration to route OpenClaw agent requests intelligently:

Instant responses for greetings and routine queries via semantic matching
Zero-latency tool calls for command execution patterns via quick router (regex)
Multi-stage cascade from fast/cheap to accurate/expensive models
Transparent fallback to full LLM inference when needed
Real-time monitoring with web UI to inspect prompts, context, and responses

Key Benefits:

⚡ Quick Response: Zero-latency routing for commands, ~100ms for semantic matching vs 2-5s LLM inference
💰 Cost Savings: Real-time cost tracking shows actual savings vs always using expensive LLM
🔍 Deep Observability: Inspect full agent-LLM communication (prompts, context, responses) via web UI
🛡️ Security Guardrails: Monitor and analyze all requests/responses for security compliance
🐛 Development & Troubleshooting: Understand OpenClaw's behavior, debug issues, optimize workflows
🎯 Easy Configuration: Everything in YAML with web-based config editor - no code changes needed
🔧 Highly Customizable: Mix embedding and LLM stages, adjust thresholds, add custom routers

See Architecture for detailed system design and performance analysis.

Features

Core Capabilities

Multi-Stage Cascade Routing: Cost-optimized semantic matching with confidence-based fallback
YAML-Driven Configuration: No code changes - configure everything via config.yml
Flexible Provider System: Mix local/remote, embedding/LLM, Ollama/OpenAI providers
Semantic Routing: Embedding-based matching for greetings, summaries, and custom patterns
Fast Regex Routing: Pattern matching for commands and tool execution
Streaming Support: Full SSE streaming for both static and proxied responses

Why ClawLayer?

Lightweight: ~1800 lines total, minimal dependencies, simple architecture
Easy to Configure: Add new routes by editing YAML, no Python required
Highly Customizable:
- Adjust confidence thresholds per stage
- Mix embedding and LLM providers in cascade
- Add custom routers with simple Python interface
- Configure router priority and enable/disable per route
Understand OpenClaw: Web UI provides real-time visibility into how OpenClaw agents work
- See which routers handle which requests
- Inspect full request/response data
- Monitor routing performance and latency
- Perfect for learning, debugging, and optimizing OpenClaw workflows

Quick Start

1. Install

pip install -r requirements.txt

2. Configure

Quick Router Only (Fastest - Zero API Costs)

cp config.quickrouter.yml config.yml

Minimal config for instant responses:

# Quick router only - zero-latency pattern matching
routers:
  handlers:
    priority:
      - quick
    quick:
      enabled: true
      patterns:
        - pattern: Greet the user
          response: Hi
        - pattern: <conversation>.*</conversation>.*summarize
          response: '## Goal\n\n  No user goal provided...'
  semantic:
    priority: []

3. Run

CLI Only:

python run.py -v

With Web UI:

# Install web UI dependencies (first time only)
cd webui && npm install && cd ..

# Start both backend and web UI
./start-dev.sh

# Or start separately:
# Terminal 1: python run.py -v
# Terminal 2: cd webui && npm run dev

Access:

Backend API: http://localhost:11435
Web UI: http://localhost:3000

Stop services:

./stop-dev.sh

That's it! ClawLayer is now routing requests intelligently. See Configuration Guide for detailed setup and Web UI Guide for monitoring features.

4. Configure OpenClaw (Optional)

To use ClawLayer with OpenClaw agents:

Automatic setup:

# Preview what will be added
python run.py --inject-openclaw ~/.openclaw/openclaw.json --dry-run

# Inject ClawLayer provider into OpenClaw config
python run.py --inject-openclaw ~/.openclaw/openclaw.json

Manual setup: Add to your ~/.openclaw/openclaw.json:

{
  "models": {
    "providers": {
      "clawlayer": {
        "baseUrl": "http://localhost:11435/v1",
        "apiKey": "clawlayer-local",
        "api": "openai-completions",
        "models": [{"id": "any", "name": "ClawLayer"}]
      }
    }
  },
  "agents": {
    "defaults": {
      "models": {
        "clawlayer/any": {"alias": "clawlayer"}
      }
    }
  }
}

Then use clawlayer/any as the model in your OpenClaw agents.

Usage

# Development mode (Flask dev server)
python run.py -v

# Production mode (Gunicorn WSGI server - recommended)
./start-prod.sh

# Run with full request logging
python run.py -vv

# Run with streaming chunk logging
python run.py -vvvv

# Run tests
python -m unittest tests.test_clawlayer -v

Cost Tracking

ClawLayer automatically tracks cost savings in real-time:

Handlers (quick/echo/command): $0.00 - zero cost, instant responses
Semantic routers (embedding-based): ~$0.02/1M tokens - cheap similarity matching
LLM fallback: ~$0.50/1M tokens - full inference when needed

View cost savings via:

Web UI: http://localhost:3000 - see total savings and request distribution
API: GET /api/stats - returns cost_saved and distribution percentages

Example stats response:

{
  "requests": 1000,
  "cost_saved": 0.0234,
  "distribution": {
    "handlers_pct": 75.0,
    "semantic_pct": 20.0,
    "llm_pct": 5.0
  }
}

Production vs Development

Development mode (python run.py):

Uses Flask's built-in development server (werkzeug)
Good for testing and debugging
⚠️ Has unicode encoding limitations with emojis in responses
Not suitable for production

Production mode (./start-prod.sh):

Uses Gunicorn WSGI server
Handles unicode/emoji characters properly
Better performance and stability
Recommended for production use

API

OpenAI-compatible endpoints:

GET /v1/models
POST /v1/chat/completions

Supports both streaming and non-streaming modes.

Documentation

Architecture - System design, router priority, and performance analysis
Configuration Guide - Detailed configuration and customization examples
Web UI Guide - Web interface for monitoring and observability
Handlers - Zero-latency pattern-based routing for instant responses
Cascade Patterns - Advanced multi-stage routing configurations
Testing Guide - Python and Node.js test coverage, running tests
File Structure - Project layout and component organization

Related Projects

ClawRouter - Advanced routing with load balancing, fallback chains, and cost optimization for production deployments

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.github/workflows		.github/workflows
clawlayer		clawlayer
docs		docs
examples		examples
tests		tests
webui		webui
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
REFACTORING_SUMMARY.md		REFACTORING_SUMMARY.md
config.example.yml		config.example.yml
config.quickrouter.yml		config.quickrouter.yml
gunicorn.conf.py		gunicorn.conf.py
requirements.txt		requirements.txt
run.py		run.py
start-dev.sh		start-dev.sh
start-prod.sh		start-prod.sh
stop-dev.sh		stop-dev.sh
stop-prod.sh		stop-prod.sh
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ClawLayer

Purpose

Features

Core Capabilities

Why ClawLayer?

Quick Start

1. Install

2. Configure

3. Run

4. Configure OpenClaw (Optional)

Usage

Cost Tracking

Production vs Development

API

Documentation

Related Projects

About

Uh oh!

Contributors

Uh oh!

Languages

neoalienson/ClawLayer

Folders and files

Latest commit

History

Repository files navigation

ClawLayer

Purpose

Features

Core Capabilities

Why ClawLayer?

Quick Start

1. Install

2. Configure

3. Run

4. Configure OpenClaw (Optional)

Usage

Cost Tracking

Production vs Development

API

Documentation

Related Projects

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages