Add Mid-Generation Streaming Support by cvaz1306 · Pull Request #124 · devnen/Chatterbox-TTS-Server

cvaz1306 · 2026-02-19T23:49:34Z

Summary

This PR introduces a high-performance streaming architecture to the TTS server. Users can now hear and see audio as it is being generated chunk-by-chunk, significantly reducing "Time to First Sound" (TTFS) for long texts. This PR also includes critical stability fixes for audio degradation and digital noise.

Key Features

Added a new /tts/stream API endpoint that yields audio chunks immediately using an asynchronous generator.
A new UI control allows users to switch between standard batch generation and live streaming mode.

Testing Instructions

Restart the server.
Enter a long paragraph of text (3-4 sentences).
Ensure "Stream Audio" is checked.
Click "Generate Speech".
Verify: Audio should start playing within seconds.
Verify: Press the "Pause" button during generation to ensure audio stops and can be resumed.

Add streaming support

428bd62

cvaz1306 changed the title ~~Add Mid-Generation Streaming Support and Real-Time Visualization~~ Add Mid-Generation Streaming Support Feb 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Mid-Generation Streaming Support#124

Add Mid-Generation Streaming Support#124
cvaz1306 wants to merge 1 commit intodevnen:mainfrom
cvaz1306:main

cvaz1306 commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

cvaz1306 commented Feb 19, 2026

Summary

Key Features

Testing Instructions

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant