feat: optimizations for strix halo by 0xrushi · Pull Request #125 · devnen/Chatterbox-TTS-Server

0xrushi · 2026-02-24T13:59:02Z

Renamed Docker and compose files to clearly target AMD Strix Halo hardware and updated references accordingly.

Added bfloat16 optimization for the T3 model with autocast support, improving token generation speed by ~40% while keeping S3Gen in float32 for stability.

Introduced a voice conditioning cache to skip redundant voice encoding, saving 2–5 seconds on repeat requests.
Added configurable TTS_BF16 env var with auto hardware detection and backward-compatible fallbacks across GPUs/CPU.

Patched torch 2.9+ dtype issues and tuned Strix Halo ROCm settings, delivering 50% faster inference with zero breaking changes.

Before:

After:

0xrushi added 2 commits February 24, 2026 08:51

strix halo optimizations

2610936

new reqs

62ff3f1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: optimizations for strix halo#125

feat: optimizations for strix halo#125
0xrushi wants to merge 2 commits intodevnen:mainfrom
0xrushi:fix/strixhalo

0xrushi commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

0xrushi commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant