feat(core): Add high-precision performance telemetry (Latency, TPS, TTFT) to LLM events #7176

Yash3561 · 2026-01-16T16:20:50Z

Why are these changes needed?

Current LLMCallEvent and LLMStreamEndEvent capture token counts but lack the temporal telemetry required for production-grade agent monitoring. To optimize agentic workflows and enforce Service Level Agreements (SLAs), developers need to measure:

TTFT (Time To First Token): Critical for evaluating user experience in streaming agents.
TPS (Tokens Per Second): Essential for benchmarking throughput across different inference providers.
End-to-End Latency: Required to identify bottlenecks in complex multi-agent orchestration loops.

This PR implements high-precision timing using time.perf_counter() to provide these metrics without breaking backward compatibility.

Related issue number

Closes #5790

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. (No public documentation changes required for this internal telemetry upgrade).
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed (Ran black and ruff linting).

Technical Details

Event Refactor: Added latency_ms, tokens_per_second, and ttft_ms optional fields to LLMCallEvent and LLMStreamEndEvent in logging.py.
Client Integration: Integrated timing logic into OpenAIChatCompletionClient and AzureAIChatCompletionClient.
Streaming Logic: Captured ttft_ms by measuring the interval between request initiation and the first yielded chunk containing content or tool calls.
Precision: Utilized time.perf_counter() to ensure monotonic, high-resolution measurements immune to system clock adjustments.
Testing: Implemented regression tests in python/packages/autogen-core/tests/test_logging_events.py verifying both data presence and safe handling of optional fields.

…TFT) to LLM events

feat(core): Add high-precision performance telemetry (Latency, TPS, T…

eef5138

…TFT) to LLM events

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(core): Add high-precision performance telemetry (Latency, TPS, TTFT) to LLM events #7176

feat(core): Add high-precision performance telemetry (Latency, TPS, TTFT) to LLM events #7176

Yash3561 commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(core): Add high-precision performance telemetry (Latency, TPS, TTFT) to LLM events #7176

Are you sure you want to change the base?

feat(core): Add high-precision performance telemetry (Latency, TPS, TTFT) to LLM events #7176

Conversation

Yash3561 commented Jan 16, 2026

Why are these changes needed?

Related issue number

Checks

Technical Details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant