openapi-batch

openapi-batch is a small Python library for running batches of LLM requests reliably.

It provides:

Async submission by default (you don’t block while the batch runs)
Durable state in SQLite (track progress, resume inspection)
Retries + partial failure handling
Native batch support where providers offer it (OpenAI, Gemini)
Provider adapters (no gateway required)
Callbacks for progress, per-item completion, and job completion

Install

pip install openapi-batch

Provider extras:

pip install openapi-batch[openai]
pip install openapi-batch[gemini]
pip install openapi-batch[openai,gemini]

Quick start

Async batch with callbacks (recommended)

from openapi.batch import BatchClient, JobStatus

def on_progress(job, p):
    # p is store.Progress (emulated) or NativeProgress (native)
    if hasattr(p, "ok"):
        # emulated progress
        print(f"[progress] {p.ok}/{p.total} ok, {p.dead} failed")
    else:
        # native polling progress
        print(f"[progress] status={p.status}, elapsed={p.elapsed_s}s")

def on_item(job, item_id, result):
    print(f"[item done] {item_id}")

def on_complete(job):
    print(f"[done] status={job.status()}")
    results = job.results_dict()
    print("results:", results)

client = BatchClient(
    provider="openai",
    api_key="...",
    default_model="gpt-4o-mini",
)

job = client.map(
    mode="native",  # native | emulated | auto
    items=[
        {"item_id": "a", "input": {"prompt": "Return OK"}},
        {"item_id": "b", "input": {"prompt": "Return YES"}},
    ],
    on_progress=on_progress,
    on_item=on_item,
    on_complete=on_complete,
    callback_executor="thread",  # don’t block the worker
)

print("submitted job:", job.job_id)

# The process continues running while progress is printed.
# If this is a short-lived script, you can optionally wait:
job.wait()

The call returns immediately. Processing happens in the background.

Run callbacks in a thread pool

If your callbacks do I/O (write to DB, publish to queue, HTTP calls), run them in a thread pool:

job = client.map(
    items=items,
    on_progress=on_progress,
    on_item=on_item,
    on_complete=on_complete,
    callback_executor="thread",
    callback_workers=8,
)

Blocking mode (useful for scripts/tests)

job = client.map(items=items, async_submit=False)
job.wait()

results = job.results_dict()
print(results)

Concepts

Job

A batch execution with a stable job_id. Stored in SQLite.

job.status()
job.progress()
job.info()

Item

One request in the batch, identified by item_id. If you don’t provide it, a deterministic ID is generated.

Result mapping

Results are returned as a dict keyed by item_id.

results = job.results_dict()
ok = results["a"]      # ResultOk
err = results["b"]     # ResultErr

Modes

emulated: concurrency-controlled requests (works for any provider adapter)
native: provider batch APIs (OpenAI, Gemini)
auto: uses native if available, otherwise emulated

Logging

Enable lightweight progress logs:

export OPENAPI_BATCH_LOG=1

Providers

Currently included:

OpenAI
Gemini
local_echo (tests)

No gateway required — pass the provider to BatchClient.

Testing

Unit tests:

pytest

Integration tests (real APIs, opt-in, may incur cost):

export OPENAI_API_KEY=...
export GEMINI_API_KEY=...
pytest -m integration

What this library does not try to do

Prompt abstraction
Workflow orchestration
Hiding provider semantics

It focuses only on batch execution, durability, and developer experience.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
.idea		.idea
docs		docs
src/openapi		src/openapi
tests		tests
.gitignore		.gitignore
.release-please-manifest.json		.release-please-manifest.json
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
release-please-config.json		release-please-config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

openapi-batch

Install

Quick start

Async batch with callbacks (recommended)

Run callbacks in a thread pool

Blocking mode (useful for scripts/tests)

Concepts

Job

Item

Result mapping

Modes

Logging

Providers

Testing

What this library does not try to do

License

About

Uh oh!

Packages

Languages

License

sireto/openapi-batch

Folders and files

Latest commit

History

Repository files navigation

openapi-batch

Install

Quick start

Async batch with callbacks (recommended)

Run callbacks in a thread pool

Blocking mode (useful for scripts/tests)

Concepts

Job

Item

Result mapping

Modes

Logging

Providers

Testing

What this library does not try to do

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Packages 0

Languages

Packages