feat: add conversation recorder extension #1950

Roei-Bracha · 2026-01-06T18:50:05Z

Conversation Recorder Extension

This PR adds the conversation_recorder extension, which allows for recording both user and agent audio during a session, mixing them into a single WAV file, and saving them to various storage backends.

Changes Overview

The PR introduces a new extension located at ai_agents/agents/ten_packages/extension/conversation_recorder/. Below is a breakdown of the included files:

File	Description
`extension.py`	Core extension logic. Manages the recording lifecycle, listens for `on_user_joined`/`on_user_left` events, and coordinates audio mixing and storage.
`audio_mixer.py`	Handles mixing multiple PCM audio streams into one. Includes support for real-time resampling (e.g., matching 16kHz user audio with 24kHz agent audio).
`storage.py`	Provides an abstraction for storage backends. Supports Local Filesystem, Google Cloud Storage (GCS), and S3-compatible storage.
`README.md`	Comprehensive documentation on configuration, features, and graph integration.
`manifest.json`	Extension metadata and definition of all configuration properties.
`requirements.txt`	Lists dependencies: `numpy`, `scipy`, `google-cloud-storage`, and `boto3`.
`addon.py` / `__init__.py`	Necessary boilerplate for TEN Agent extension registration and loading.
`property.json`	Default values for the extension's properties.

Key Features

Multi-Source Audio Mixing: Successfully combines user and agent audio frames into a single high-quality WAV recording.
Graceful Shutdown: Implements signal handlers (SIGTERM, SIGINT) and atexit hooks to ensure recordings are correctly flushed and closed even if the agent process is interrupted.
Pluggable Storage: Native support for uploading recordings directly to GCS or S3 buckets, or saving them locally.
Event-Driven Recording: Can be configured to start automatically on agent startup or be triggered by specific commands like on_user_joined.

How to use

Integrate the extension into your graph by:

Connecting the pcm_frame outputs from audio sources (e.g., streamid_adapter and v2v) to the conversation_recorder node.
Connecting the on_user_joined and on_user_left commands from your RTC extension to the recorder.
Configuring the storage_type and relevant credentials in the graph properties.

feat: add conversation recorder extension

866c4e2

Roei-Bracha requested review from halajohn and plutoless as code owners January 6, 2026 18:50

Roei-Bracha and others added 2 commits January 6, 2026 21:05

refactor: fix linting

2692ec4

Merge branch 'main' into conversation-recorder

9267745

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add conversation recorder extension #1950

feat: add conversation recorder extension #1950

Uh oh!

Roei-Bracha commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add conversation recorder extension #1950

Are you sure you want to change the base?

feat: add conversation recorder extension #1950

Uh oh!

Conversation

Roei-Bracha commented Jan 6, 2026

Conversation Recorder Extension

Changes Overview

Key Features

How to use

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants