Mock LLM Server

A simple mock LLM server for end-to-end testing. Provides request/response mocking for OpenAI and Anthropic APIs using their official SDK types.

Features

OpenAI Chat Completions API (streaming and non-streaming)
OpenAI Responses API (streaming and non-streaming, including function outputs)
Anthropic Messages API (non-streaming)
Exact and contains matching
Tool/function calls support
JSON configuration files

Architecture

Server: HTTP server with Gorilla mux router
Providers: Separate handlers for OpenAI and Anthropic
Matching: Linear search through mocks with exact/contains matching
SDK Integration: Uses official OpenAI and Anthropic SDK types directly

Key Types

Current implementation uses these core types:

Configuration

Config: Root configuration containing arrays of OpenAI and Anthropic mocks
OpenAIMock and OpenAIResponseMock Maps OpenAI requests to responses using official SDK types
AnthropicMock: Maps Anthropic requests to responses using official SDK types

Matching

MatchType: Enum for matching strategies (exact, contains)
OpenAIRequestMatch and OpenAIResponseRequestMatch: Defines how to match OpenAI requests (match type + message)
AnthropicRequestMatch: Defines how to match Anthropic requests (match type + message)

API Coverage

OpenAI Chat Completions

Endpoint: POST /v1/chat/completions
Auth: Authorization: Bearer <token> (presence check only)
Request: openai.ChatCompletionNewParams
Response: openai.ChatCompletion (streaming: openai.ChatCompletionChunk)
Matching: Exact or contains on last message

OpenAI Responses API

Endpoint: POST /v1/responses
Auth: Authorization: Bearer <token> (presence check only)
Request: responses.ResponseNewParams
Response: responses.Response
Matching: Exact or contains on input field
Features: Supports text output and function call outputs

Anthropic Messages API

Endpoint: POST /v1/messages
Auth: x-api-key (presence check only)
Headers: anthropic-version required
Request: anthropic.MessageNewParams
Response: anthropic.Message
Matching: Exact or contains on last message

Configuration

Go Structs

config := mockllm.Config{
    OpenAI: []mockllm.OpenAIMock{
        {
            Name: "simple-response",
            Match: mockllm.OpenAIRequestMatch{
                MatchType: mockllm.MatchTypeExact,
                Message: /* openai.ChatCompletionMessageParamUnion */,
            },
            Response: /* openai.ChatCompletion */,
        },
    },
    OpenAIResponse: []mockllm.OpenAIResponseMock{
        {
            Name: "haiku-response",
            Match: mockllm.OpenAIResponseRequestMatch{
                MatchType: mockllm.MatchTypeContains,
                Input: /* responses.ResponseNewParamsInputUnion */,
            },
            Response: /* responses.Response */,
        },
    },
    Anthropic: []mockllm.AnthropicMock{/* ... */},
}

JSON Files

{
  "openai": [
    {
      "name": "initial_request",
      "match": {
        "match_type": "exact",
        "message" : {
          "content": "List all nodes in the cluster",
          "role": "user"
        }
      },
      "response": {
        "id": "chatcmpl-1",
        "object": "chat.completion",
        "created": 1677652288,
        "model": "gpt-4.1-mini",
        "choices": [
          {
            "index": 0,
            "role": "assistant",
            "message": {
              "content": "",
              "tool_calls": [
                ...
              ]
            },
            "finish_reason": "tool_calls"
          }
        ]
      }
    },
    {
      "name": "k8s_get_resources_response",
      "match": {
        "match_type": "contains",
        "message" : {
          "content": "kagent-control-plane",
          "role": "tool",
          "tool_call_id": "call_1"
        }
      },
      "response": {
        "id": "call_1",
        "object": "chat.completion.tool_message",
        "created": 1677652288,
        "model": "gpt-4.1-mini",
        "choices": [
          ...
        ]
      }
    }
  ],
  "openai_response": [
    /* ... */
  ],
  "anthropic": [
    /* ... */
  ]
}

Matching Algorithm

Simple linear search through mocks:

Parse incoming request into appropriate SDK type
Iterate through provider-specific mocks in order
For each mock, check if the match criteria are met:
- Exact: JSON comparison of the last message
- Contains: String contains check on message content (OpenAI only)
Return the response from the first matching mock
Return 404 if no match found

Response Types

Non-streaming: JSON responses using SDK types
Streaming: Server-Sent Events (SSE) for Chat Completions and Responses API
Uses official SDK response types directly

Usage

config := mockllm.Config{/* mocks */}
server := mockllm.NewServer(config)
baseURL, err := server.Start(context.Background())
defer server.Stop(context.Background())

// Use baseURL for API calls in tests
client := openai.NewClient(
    option.WithBaseURL(baseURL+"/v1/"),
    option.WithAPIKey("test-key"),
)

Project Structure

server.go — HTTP server, routing, lifecycle
types.go — Configuration types
openai.go — OpenAI handler (Chat Completions + Responses)
anthropic.go — Anthropic handler
server_test.go — Integration tests
testdata/ — Test fixtures

Dependencies

github.com/openai/openai-go/v3
github.com/anthropics/anthropic-sdk-go
github.com/gorilla/mux

Limitations

Simple matching only (exact/contains on last message/input)
Does not mock hosted tools (e.g. OpenAI file search, code execution) calls, reasoning, and MCP calls
No stateful conversation tracking
No latency simulation
No error injection

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
testdata		testdata
LICENSE		LICENSE
README.md		README.md
anthropic.go		anthropic.go
go.mod		go.mod
go.sum		go.sum
openai.go		openai.go
openai_response.go		openai_response.go
retry.go		retry.go
server.go		server.go
server_test.go		server_test.go
types.go		types.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mock LLM Server

Features

Architecture

Key Types

Configuration

Matching

API Coverage

OpenAI Chat Completions

OpenAI Responses API

Anthropic Messages API

Configuration

Go Structs

JSON Files

Matching Algorithm

Response Types

Usage

Project Structure

Dependencies

Limitations

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

kagent-dev/mockllm

Folders and files

Latest commit

History

Repository files navigation

Mock LLM Server

Features

Architecture

Key Types

Configuration

Matching

API Coverage

OpenAI Chat Completions

OpenAI Responses API

Anthropic Messages API

Configuration

Go Structs

JSON Files

Matching Algorithm

Response Types

Usage

Project Structure

Dependencies

Limitations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages