From e18d985a2e0516cf83f9b8d25efda65e95b1126a Mon Sep 17 00:00:00 2001
From: Alok Kumar <kumaralok344@gmail.com>
Date: Tue, 11 Nov 2025 12:18:06 +0530
Subject: [PATCH 1/2] Add multilingual TTS support with
 ChatterboxMultilingualTTS

- Upgrade to ChatterboxMultilingualTTS from GitHub (supports 23 languages)
- Add MPS device compatibility fixes for Apple Silicon Macs
- Fix attention implementation issues with SDPA on MPS devices
- Update config to enable multilingual support with English as default language
- Enhance UI with model selection and language controls
- Add graceful fallback for missing multilingual support
- Update server API to handle language_id parameter
- Fix torch.load map_location for non-CUDA devices
- Add comprehensive documentation for multilingual setup

Now working with multilingual model on Mac, supporting 23 languages including Hindi, Arabic, Chinese, French, German, etc. Default language set to English while maintaining full multilingual capabilities.
---
 .python-version       |    1 +
 UI_CHANGES_SUMMARY.md |  151 +++
 UPGRADE_FOR_HINDI.md  |  224 ++++
 config.py             |    5 +-
 config.yaml           |   52 +-
 engine.py             |  127 ++-
 main.py               |    6 +
 models.py             |    6 +-
 pyproject.toml        |    7 +
 requirements.txt      |    2 +-
 run.bash              |    1 +
 server.py             |   11 +
 ui/index.html         | 1436 ++++++++++++++++--------
 ui/script.js          | 2458 +++++++++++++++++++++++++----------------
 uv.lock               |    8 +
 15 files changed, 3050 insertions(+), 1445 deletions(-)
 create mode 100644 .python-version
 create mode 100644 UI_CHANGES_SUMMARY.md
 create mode 100644 UPGRADE_FOR_HINDI.md
 create mode 100644 main.py
 create mode 100644 pyproject.toml
 create mode 100755 run.bash
 create mode 100644 uv.lock

diff --git a/.python-version b/.python-version
new file mode 100644
index 0000000..c8cfe39
--- /dev/null
+++ b/.python-version
@@ -0,0 +1 @@
+3.10
diff --git a/UI_CHANGES_SUMMARY.md b/UI_CHANGES_SUMMARY.md
new file mode 100644
index 0000000..531e7b9
--- /dev/null
+++ b/UI_CHANGES_SUMMARY.md
@@ -0,0 +1,151 @@
+# UI Changes for Multilingual Support
+
+## Summary
+Added UI controls to allow users to select between the multilingual and English-only TTS models, along with improved language selection and status indicators.
+
+## Changes Made
+
+### 1. Navigation Bar (index.html)
+**Added Model Status Badge** - Shows which model is currently loaded
+- Location: Next to the title in the navigation bar
+- Displays: "🌐 Multilingual (23 Languages)" or "🇬🇧 English Only"
+- Updates automatically based on server configuration
+- Provides visual feedback about the active model
+
+### 2. Generation Parameters Section (index.html)
+**Added Model Type Selector** - Dropdown to choose TTS model
+- Location: Just before the Language selector in the Generation Parameters
+- Options:
+  - "Multilingual (23 Languages)" - Supports 23 languages including Hindi
+  - "English Only" - Smaller model, English only
+- Note: Warns users that server restart is required to apply changes
+
+**Updated Language Selector**
+- Changed default selection from "en" (English) to "hi" (Hindi)
+- Added language codes in parentheses for clarity (e.g., "Hindi (hi)")
+- Updated help text to be more informative
+- Now properly syncs with the config.yaml default language setting
+
+### 3. JavaScript Updates (script.js)
+
+#### Model Type Handling
+- Added `modelTypeSelect` element reference
+- Loads current model type from server config on page load
+- Sets dropdown value based on `config.model.use_multilingual`
+
+#### Save Generation Parameters
+- Extended to save both generation parameters AND model type selection
+- Saves `model.use_multilingual` boolean value to server config
+- Shows warning notification when model type changes
+- Detects if restart is needed and notifies user
+
+#### Model Status Badge Updates
+- Badge text and icon update on page load based on config
+- Shows multilingual status with globe emoji (🌐) or English with flag (🇬🇧)
+- Tooltip provides detailed information about loaded model
+
+#### Change Detection & Warnings
+- Detects when user changes model type from current config
+- Shows persistent notification reminding user to:
+  1. Click "Save Generation Parameters"
+  2. Restart the server for changes to take effect
+- Prevents confusion about why model didn't change
+
+## User Workflow
+
+### To Switch Models:
+1. Open the UI in browser
+2. Look at the Generation Parameters section
+3. Find "TTS Model" dropdown
+4. Select desired model:
+   - "Multilingual (23 Languages)" - for Hindi and other languages
+   - "English Only" - for English-only use case
+5. Click "Save Generation Parameters" button
+6. Notice the warning about server restart
+7. Click "Restart Server" button (or manually restart)
+8. Refresh the page
+9. Verify the model status badge shows the new model
+
+### To Use Hindi (Default):
+1. Model is already set to Multilingual
+2. Language is already set to Hindi (hi)
+3. Simply enter Hindi text and generate
+
+### To Use Other Languages:
+1. Ensure Model Type is "Multilingual"
+2. Select desired language from Language dropdown
+3. Enter text in that language
+4. Generate speech
+
+## Visual Indicators
+
+### Model Status Badge (Top Navigation)
+```
+Current State          Badge Display
+------------------     -------------------------------
+Multilingual loaded -> "🌐 Multilingual (23 Languages)"
+English-only loaded -> "🇬🇧 English Only"
+```
+
+### Model Type Selector (Form)
+```
+Config Value                      Dropdown Shows
+--------------------------------  ---------------------------
+use_multilingual: true        ->  "Multilingual (23 Languages)"
+use_multilingual: false       ->  "English Only"
+```
+
+### Language Selector (Form)
+- Shows all 23 supported languages
+- Hindi (hi) is selected by default
+- Language codes shown for clarity
+
+## Benefits
+
+1. **User-Friendly**: Clear visual feedback about which model is loaded
+2. **Flexible**: Easy switching between multilingual and English-only models
+3. **Informative**: Warnings and tooltips guide users through the process
+4. **Consistent**: UI state syncs with server configuration
+5. **Safe**: Clear warnings about server restart requirements
+
+## Technical Details
+
+### Config Values Saved
+```json
+{
+  "model": {
+    "use_multilingual": true  // or false
+  },
+  "generation_defaults": {
+    "language": "hi",  // or any of 23 supported codes
+    "temperature": 0.8,
+    "exaggeration": 0.5,
+    // ... other params
+  }
+}
+```
+
+### Supported Language Codes
+ar, da, de, el, en, es, fi, fr, he, **hi**, it, ja, ko, ms, nl, no, pl, pt, ru, sv, sw, tr, zh
+
+### Files Modified
+1. `ui/index.html` - Added model selector, updated language selector, added status badge
+2. `ui/script.js` - Added model handling logic, status updates, change detection
+3. `config.yaml` - Set default to multilingual model and Hindi language
+4. `config.py` - Added default multilingual setting to DEFAULT_CONFIG
+5. `engine.py` - Added multilingual model support
+6. `server.py` - Added language_id parameter handling
+
+## Testing Checklist
+
+- [ ] Model status badge shows correct model on page load
+- [ ] Model type selector reflects current config
+- [ ] Language selector defaults to Hindi
+- [ ] Changing model type shows notification
+- [ ] Save button updates config.yaml correctly
+- [ ] Restart button triggers server restart
+- [ ] After restart, new model is loaded
+- [ ] Badge updates after model change
+- [ ] Hindi text generates proper speech (not noise)
+- [ ] Other languages work correctly
+- [ ] English still works when English-only model selected
diff --git a/UPGRADE_FOR_HINDI.md b/UPGRADE_FOR_HINDI.md
new file mode 100644
index 0000000..b40200b
--- /dev/null
+++ b/UPGRADE_FOR_HINDI.md
@@ -0,0 +1,224 @@
+# Upgrade Guide: Enable Hindi & Multilingual Support
+
+## Current Status
+
+Your Chatterbox TTS server is currently running with:
+- **chatterbox-tts version**: 0.1.2 (English-only)
+- **Model**: English-only TTS model
+- **Languages**: English only
+
+## To Enable Hindi & 22 Other Languages
+
+The code has been updated to support multilingual TTS, but you need to upgrade the `chatterbox-tts` package to access the multilingual model.
+
+### Step 1: Upgrade chatterbox-tts Package
+
+```bash
+# Stop the server first (Ctrl+C if running)
+
+# Activate your virtual environment
+source .venv/bin/activate
+
+# Upgrade to the latest version
+pip install --upgrade chatterbox-tts
+
+# Or install a specific version if available
+pip install chatterbox-tts>=0.1.4
+```
+
+### Step 2: Verify the Upgrade
+
+```bash
+# Check the installed version
+pip show chatterbox-tts
+
+# Verify multilingual support is available
+python -c "from chatterbox import ChatterboxMultilingualTTS; print('Multilingual support: Available')" || echo "Multilingual support: Not yet available"
+```
+
+### Step 3: Enable Multilingual in Configuration
+
+Edit `config.yaml`:
+
+```yaml
+model:
+  repo_id: ResembleAI/chatterbox
+  use_multilingual: true  # Change from false to true
+
+generation_defaults:
+  temperature: 0.8
+  exaggeration: 0.5
+  cfg_weight: 0.5
+  seed: 0
+  speed_factor: 1.0
+  language: hi  # Change from 'en' to 'hi' for Hindi
+```
+
+### Step 4: Restart the Server
+
+```bash
+./run.bash
+```
+
+Or manually:
+
+```bash
+source .venv/bin/activate
+python server.py
+```
+
+### Step 5: Verify Hindi Support
+
+Check the server logs for:
+```
+Successfully loaded Multilingual TTS model on mps. Supports 23 languages including Hindi.
+```
+
+Open the web UI and you should see:
+- Model badge showing: "🌐 Multilingual (23 Languages)"
+- Language selector with Hindi (hi) as default
+
+## Current Fallback Behavior
+
+The code has been designed to gracefully handle the missing multilingual model:
+
+1. **If multilingual is requested but not available**:
+   - Server logs a warning
+   - Automatically falls back to English-only model
+   - Server continues to run normally
+
+2. **If you try to generate Hindi audio**:
+   - Server logs a warning: "Language 'hi' requested but multilingual model not available"
+   - Generates English audio instead
+   - No errors or crashes
+
+## Supported Languages (After Upgrade)
+
+Once upgraded, your server will support these 23 languages:
+
+| Code | Language | Code | Language | Code | Language |
+|------|----------|------|----------|------|----------|
+| ar | Arabic | da | Danish | de | German |
+| el | Greek | en | English | es | Spanish |
+| fi | Finnish | fr | French | he | Hebrew |
+| **hi** | **Hindi** | it | Italian | ja | Japanese |
+| ko | Korean | ms | Malay | nl | Dutch |
+| no | Norwegian | pl | Polish | pt | Portuguese |
+| ru | Russian | sv | Swedish | sw | Swahili |
+| tr | Turkish | zh | Chinese | | |
+
+## Troubleshooting
+
+### Issue: Multilingual import still fails after upgrade
+
+**Solution**: The multilingual model might be in a different package or version:
+
+```bash
+# Check if there's a separate multilingual package
+pip search chatterbox-multilingual
+
+# Or check the Chatterbox GitHub for latest installation instructions
+# https://github.com/resemble-ai/chatterbox
+```
+
+### Issue: Model download fails
+
+**Solution**: The multilingual model is larger (~2-3GB). Ensure you have:
+- Sufficient disk space
+- Stable internet connection
+- Access to Hugging Face (not blocked by firewall)
+
+### Issue: Server shows "Multilingual model requested but not available"
+
+This means the package doesn't have the multilingual class yet. Check:
+
+```bash
+# Verify the package contents
+python -c "import chatterbox; print(dir(chatterbox))"
+
+# Look for ChatterboxMultilingualTTS in the output
+```
+
+### Issue: Still generating English audio for Hindi text
+
+Possible causes:
+1. Multilingual model not loaded (check logs)
+2. Config still has `use_multilingual: false`
+3. Model badge still shows "English Only"
+
+**Solution**: Follow all upgrade steps again and restart server.
+
+## Alternative: Use Latest GitHub Version
+
+If PyPI doesn't have the latest multilingual version yet:
+
+```bash
+# Install directly from GitHub
+pip uninstall chatterbox-tts -y
+pip install git+https://github.com/resemble-ai/chatterbox.git
+
+# Or clone and install locally
+git clone https://github.com/resemble-ai/chatterbox.git
+cd chatterbox
+pip install -e .
+```
+
+## Rollback to English-Only
+
+If you prefer to use only English:
+
+1. Edit `config.yaml`:
+   ```yaml
+   model:
+     use_multilingual: false
+   
+   generation_defaults:
+     language: en
+   ```
+
+2. Restart server
+
+The server will use the smaller, faster English-only model.
+
+## Code Changes Summary
+
+The following changes were made to support gradual migration:
+
+### engine.py
+- Added conditional import for multilingual model
+- Falls back gracefully if multilingual not available
+- Warns user to upgrade when multilingual is requested
+- Uses English-only model as fallback
+
+### config.yaml
+- Added `model.use_multilingual` setting (currently `false`)
+- Language default set to `en` (will be `hi` after upgrade)
+
+### UI
+- Model selector dropdown to switch between models
+- Status badge showing currently loaded model
+- Language selector for all 23 languages
+- Automatic warnings when changes require restart
+
+## Benefits After Upgrade
+
+✅ **Full Hindi Support** - Generate natural-sounding Hindi speech
+✅ **22 Additional Languages** - Support for multiple languages
+✅ **Better Quality** - Improved voice quality for non-English languages
+✅ **No Accent Issues** - Native language synthesis without English accent
+✅ **UI Ready** - All UI controls already in place
+
+## Next Steps
+
+1. Upgrade `chatterbox-tts` package
+2. Update `config.yaml` settings
+3. Restart server
+4. Test Hindi generation
+5. Enjoy multilingual TTS! 🎉
+
+---
+
+**Need Help?**
+- Check server logs for detailed error messages
+- Review [Chatterbox GitHub](https://github.com/resemble-ai/chatterbox)
+- Check [Chatterbox Multilingual Demo](https://huggingface.co/spaces/ResembleAI/Chatterbox-Multilingual-TTS)
diff --git a/config.py b/config.py
index 2fdea6f..91f9ac8 100644
--- a/config.py
+++ b/config.py
@@ -50,6 +50,7 @@
     },
     "model": {  # Added section for model source configuration
         "repo_id": "ResembleAI/chatterbox",  # Default Hugging Face repository ID for the model
+        "use_multilingual": True,  # Use multilingual model for 23 languages support (default: True)
     },
     "tts_engine": {
         "device": "auto",  # TTS processing device: 'auto', 'cuda', 'mps', or 'cpu'.
@@ -72,11 +73,11 @@
     },
     "generation_defaults": {  # Default parameters for TTS audio generation.
         "temperature": 0.8,  # Controls randomness: lower is more deterministic.
-        "exaggeration": 0.5,  # Controls expressiveness or exaggeration in speech.
+        "exaggeration": 0.5,  # Controls expressiveness or exaggeration in speech. Range: 0.5 (subdued) to 3.0 (very dramatic).
         "cfg_weight": 0.5,  # Classifier-Free Guidance weight, influences adherence to prompt/style.
         "seed": 0,  # Random seed for generation. 0 often means random or engine default.
         "speed_factor": 1.0,  # Controls the speed of the generated speech.
-        "language": "en",  # Default language for TTS.
+        "language": "en",  # Default language for TTS. Supported: ar (Arabic), da (Danish), de (German), el (Greek), en (English), es (Spanish), fi (Finnish), fr (French), he (Hebrew), hi (Hindi), it (Italian), ja (Japanese), ko (Korean), ms (Malay), nl (Dutch), no (Norwegian), pl (Polish), pt (Portuguese), ru (Russian), sv (Swedish), sw (Swahili), tr (Turkish), zh (Chinese). Note: Quality may vary by language.
     },
     "audio_output": {  # Settings related to the format of generated audio.
         "format": "wav",  # Output audio format (e.g., 'wav', 'mp3').
diff --git a/config.yaml b/config.yaml
index f610286..30f827f 100644
--- a/config.yaml
+++ b/config.yaml
@@ -1,6 +1,6 @@
 server:
   host: 0.0.0.0
-  port: 8004
+  port: 8000
   use_ngrok: false
   use_auth: false
   auth_username: user
@@ -10,44 +10,60 @@ server:
   log_file_backup_count: 5
 model:
   repo_id: ResembleAI/chatterbox
+  use_multilingual: true
 tts_engine:
-  device: cuda
+  device: mps
   predefined_voices_path: voices
   reference_audio_path: reference_audio
-  default_voice_id: Emily.wav
+  default_voice_id: default_sample.wav
 paths:
   model_cache: model_cache
   output: outputs
 generation_defaults:
   temperature: 0.8
-  exaggeration: 1.3
+  exaggeration: 0.5
   cfg_weight: 0.5
   seed: 0
   speed_factor: 1.0
-  language: en
+  language: hi
 audio_output:
   format: wav
   sample_rate: 24000
   max_reference_duration_sec: 30
 ui_state:
-  last_text: 'Are you tired of slow, unreliable connections? Upgrade today to Quantum
-    Fiber, the fastest internet in the galaxy! Experience seamless streaming, lag-free
-    gaming, and instant downloads. Call now and get your first three months half price!
-    Don''t wait, this offer won''t last forever!
-
-    '
-  last_voice_mode: predefined
-  last_predefined_voice: Emily.wav
-  last_reference_file: Gianna.wav
-  last_seed: 3000
-  last_chunk_size: 240
+  last_text:
+    "\u092E\u0948\u0902 \u0938\u093E\u0915\u094D\u0937\u0940 \u0906\u0928\
+    \u0902\u0926 \u0939\u0942\u0901\u0964 \u092E\u0948\u0902 \u092C\u093F\u0939\u093E\
+    \u0930 \u0915\u0947 \u092A\u091F\u0928\u093E \u092E\u0947\u0902 \u092A\u0948\u0926\
+    \u093E \u0939\u0941\u0908 \u0925\u0940, \u0914\u0930 \u0905\u092C \u092A\u0941\
+    \u0923\u0947 \u092E\u0947\u0902 \u0930\u0939 \u0930\u0939\u0940 \u0939\u0942\u0901\
+    \u0964\n\n\u092E\u0948\u0902 \u092C\u0939\u0941\u0924 \u0906\u0932\u0938\u0940\
+    \ \u0932\u0921\u093C\u0915\u0940 \u0939\u0942\u0901\u0964 \u092E\u0948\u0902 \u0938\
+    \u0941\u092C\u0939 \u092C\u0939\u0941\u0924 \u0926\u0947\u0930 \u0938\u0947 \u0909\
+    \u0920\u0924\u0940 \u0939\u0942\u0901, \u092E\u0941\u091D\u0947 \u0917\u092A\u0936\
+    \u092A \u0915\u0930\u0928\u093E \u092C\u0939\u0941\u0924 \u092A\u0938\u0902\u0926\
+    \ \u0939\u0948\u0964 \u092E\u0948\u0902 \u090F\u0915 \u092E\u0930\u094D\u0926\
+    \ \u092C\u0928\u0928\u093E \u091A\u093E\u0939\u0924\u0940 \u0939\u0942\u0901\u0964\
+    \ \u092E\u0941\u091D\u0947 \u0913\u0936\u094B \u0915\u0947 \u0935\u093F\u091A\u093E\
+    \u0930 \u092C\u0939\u0941\u0924 \u092A\u0938\u0902\u0926 \u0939\u0948\u0902, \u091C\
+    \u093F\u0928\u094D\u0939\u094B\u0902\u0928\u0947 \u092E\u0941\u091D\u0947 \u0938\
+    \u093F\u0916\u093E\u092F\u093E \u0915\u093F \u092E\u0941\u0936\u094D\u0915\u093F\
+    \u0932 \u0938\u0935\u093E\u0932 \u092A\u0942\u091B\u0928\u093E \u0915\u093F\u0924\
+    \u0928\u093E \u091C\u093C\u0930\u0942\u0930\u0940 \u0939\u0948\u0964 \u0914\u0930\
+    \ \u092E\u0947\u0930\u093E \u0926\u093F\u092E\u093E\u0917 \u0918\u0941\u091F\u0928\
+    \u0947 \u092E\u0947\u0902 \u0939\u0948\u0964"
+  last_voice_mode: clone
+  last_predefined_voice: none
+  last_reference_file: recn.wav
+  last_seed: 2024
+  last_chunk_size: 250
   last_split_text_enabled: true
   hide_chunk_warning: false
-  hide_generation_warning: false
+  hide_generation_warning: true
   theme: light
 ui:
   title: Chatterbox TTS Server
   show_language_select: true
-  max_predefined_voices_in_dropdown: 50
+  max_predefined_voices_in_dropdown: 20
 debug:
   save_intermediate_audio: false
diff --git a/engine.py b/engine.py
index e42426e..b764a03 100644
--- a/engine.py
+++ b/engine.py
@@ -5,10 +5,20 @@
 import random
 import numpy as np
 import torch
-from typing import Optional, Tuple
+from typing import Optional, Tuple, Union
 from pathlib import Path
 
 from chatterbox.tts import ChatterboxTTS  # Main TTS engine class
+
+# Try to import multilingual model if available (newer versions)
+try:
+    from chatterbox.mtl_tts import ChatterboxMultilingualTTS
+    MULTILINGUAL_AVAILABLE = True
+except ImportError:
+    ChatterboxMultilingualTTS = None  # type: ignore
+    MULTILINGUAL_AVAILABLE = False
+    logging.warning("Multilingual TTS model not available. Please upgrade chatterbox-tts for multilingual support.")
+
 from chatterbox.models.s3gen.const import (
     S3GEN_SR,
 )  # Default sample rate from the engine
@@ -19,11 +29,12 @@
 logger = logging.getLogger(__name__)
 
 # --- Global Module Variables ---
-chatterbox_model: Optional[ChatterboxTTS] = None
+chatterbox_model: Optional[Union[ChatterboxTTS, 'ChatterboxMultilingualTTS']] = None
 MODEL_LOADED: bool = False
 model_device: Optional[str] = (
-    None  # Stores the resolved device string ('cuda' or 'cpu')
+    None  # Stores the resolved device string ('cuda', 'mps', or 'cpu')
 )
+use_multilingual_model: bool = True  # Default to multilingual for broader language support
 
 
 def set_seed(seed_value: int):
@@ -87,12 +98,13 @@ def load_model() -> bool:
     Loads the TTS model.
     This version directly attempts to load from the Hugging Face repository (or its cache)
     using `from_pretrained`, bypassing the local `paths.model_cache` directory.
+    Automatically uses the multilingual model for broader language support.
     Updates global variables `chatterbox_model`, `MODEL_LOADED`, and `model_device`.
 
     Returns:
         bool: True if the model was loaded successfully, False otherwise.
     """
-    global chatterbox_model, MODEL_LOADED, model_device
+    global chatterbox_model, MODEL_LOADED, model_device, use_multilingual_model
 
     if MODEL_LOADED:
         logger.info("TTS model is already loaded.")
@@ -157,27 +169,60 @@ def load_model() -> bool:
         model_device = resolved_device_str
         logger.info(f"Final device selection: {model_device}")
 
-        # Get configured model_repo_id for logging and context,
-        # though from_pretrained might use its own internal default if not overridden.
-        model_repo_id_config = config_manager.get_string(
-            "model.repo_id", "ResembleAI/chatterbox"
-        )
-
+        # Check if multilingual model should be used (default: True for broader language support)
+        use_multilingual_model = config_manager.get_bool("model.use_multilingual", True)
+        
+        # Check if multilingual model is actually available
+        if use_multilingual_model and not MULTILINGUAL_AVAILABLE:
+            logger.warning(
+                "Multilingual model requested but not available in current chatterbox-tts version. "
+                "Using English-only model. To enable multilingual support, upgrade chatterbox-tts: "
+                "pip install --upgrade chatterbox-tts"
+            )
+            use_multilingual_model = False
+        
         logger.info(
-            f"Attempting to load model directly using from_pretrained (expected from Hugging Face repository: {model_repo_id_config} or library default)."
+            f"Attempting to load {'multilingual' if use_multilingual_model else 'English-only'} model using from_pretrained."
         )
         try:
             # Directly use from_pretrained. This will utilize the standard Hugging Face cache.
-            # The ChatterboxTTS.from_pretrained method handles downloading if the model is not in the cache.
-            chatterbox_model = ChatterboxTTS.from_pretrained(device=model_device)
-            # The actual repo ID used by from_pretrained is often internal to the library,
-            # but logging the configured one provides user context.
-            logger.info(
-                f"Successfully loaded TTS model using from_pretrained on {model_device} (expected from '{model_repo_id_config}' or library default)."
-            )
+            # The model's from_pretrained method handles downloading if the model is not in the cache.
+            if use_multilingual_model and MULTILINGUAL_AVAILABLE:
+                # Workaround for MPS/CPU: Patch torch.load to use map_location for non-CUDA devices
+                original_torch_load = torch.load
+                if model_device != "cuda":
+                    device_obj = torch.device(model_device)
+                    def patched_torch_load(f, *args, **kwargs):
+                        if 'map_location' not in kwargs:
+                            kwargs['map_location'] = device_obj
+                        return original_torch_load(f, *args, **kwargs)
+                    torch.load = patched_torch_load
+                
+                try:
+                    chatterbox_model = ChatterboxMultilingualTTS.from_pretrained(device=model_device)
+                    
+                    # Fix for MPS: Set attention implementation to 'eager' to avoid SDPA issues
+                    if hasattr(chatterbox_model, 't3') and hasattr(chatterbox_model.t3, 'tfmr'):
+                        try:
+                            chatterbox_model.t3.tfmr.config._attn_implementation = 'eager'
+                            logger.info("Set attention implementation to 'eager' for MPS compatibility")
+                        except Exception as e:
+                            logger.warning(f"Could not set attention implementation: {e}")
+                    
+                    logger.info(
+                        f"Successfully loaded Multilingual TTS model on {model_device}. Supports 23 languages including Hindi."
+                    )
+                finally:
+                    # Restore original torch.load
+                    torch.load = original_torch_load
+            else:
+                chatterbox_model = ChatterboxTTS.from_pretrained(device=model_device)
+                logger.info(
+                    f"Successfully loaded English-only TTS model on {model_device}."
+                )
         except Exception as e_hf:
             logger.error(
-                f"Failed to load model using from_pretrained (expected from '{model_repo_id_config}' or library default): {e_hf}",
+                f"Failed to load {'multilingual' if use_multilingual_model else 'English-only'} model: {e_hf}",
                 exc_info=True,
             )
             chatterbox_model = None
@@ -214,6 +259,7 @@ def synthesize(
     exaggeration: float = 0.5,
     cfg_weight: float = 0.5,
     seed: int = 0,
+    language_id: Optional[str] = None,
 ) -> Tuple[Optional[torch.Tensor], Optional[int]]:
     """
     Synthesizes audio from text using the loaded TTS model.
@@ -226,12 +272,14 @@ def synthesize(
         cfg_weight: Classifier-Free Guidance weight.
         seed: Random seed for generation. If 0, default randomness is used.
               If non-zero, a global seed is set for reproducibility.
+        language_id: Language code for multilingual model (e.g., 'hi' for Hindi, 'en' for English).
+                     Only used with multilingual model. If None, defaults to config language.
 
     Returns:
         A tuple containing the audio waveform (torch.Tensor) and the sample rate (int),
         or (None, None) if synthesis fails.
     """
-    global chatterbox_model
+    global chatterbox_model, use_multilingual_model
 
     if not MODEL_LOADED or chatterbox_model is None:
         logger.error("TTS model is not loaded. Cannot synthesize audio.")
@@ -249,19 +297,40 @@ def synthesize(
 
         logger.debug(
             f"Synthesizing with params: audio_prompt='{audio_prompt_path}', temp={temperature}, "
-            f"exag={exaggeration}, cfg_weight={cfg_weight}, seed_applied_globally_if_nonzero={seed}"
+            f"exag={exaggeration}, cfg_weight={cfg_weight}, seed_applied_globally_if_nonzero={seed}, "
+            f"language_id={language_id}"
         )
 
         # Call the core model's generate method
-        wav_tensor = chatterbox_model.generate(
-            text=text,
-            audio_prompt_path=audio_prompt_path,
-            temperature=temperature,
-            exaggeration=exaggeration,
-            cfg_weight=cfg_weight,
-        )
+        # For multilingual model, include language_id parameter if available
+        if use_multilingual_model and MULTILINGUAL_AVAILABLE and isinstance(chatterbox_model, ChatterboxMultilingualTTS):
+            # Use provided language_id or default from config
+            effective_language = language_id or config_manager.get_string("generation_defaults.language", "en")
+            logger.info(f"Generating speech for language: {effective_language}")
+            wav_tensor = chatterbox_model.generate(
+                text=text,
+                audio_prompt_path=audio_prompt_path,
+                temperature=temperature,
+                exaggeration=exaggeration,
+                cfg_weight=cfg_weight,
+                language_id=effective_language,
+            )
+        else:
+            # English-only model doesn't use language_id parameter
+            if language_id and language_id != "en":
+                logger.warning(
+                    f"Language '{language_id}' requested but multilingual model not available. "
+                    "Generating in English. Upgrade chatterbox-tts for multilingual support."
+                )
+            wav_tensor = chatterbox_model.generate(
+                text=text,
+                audio_prompt_path=audio_prompt_path,
+                temperature=temperature,
+                exaggeration=exaggeration,
+                cfg_weight=cfg_weight,
+            )
 
-        # The ChatterboxTTS.generate method already returns a CPU tensor.
+        # The model's generate method already returns a CPU tensor.
         return wav_tensor, chatterbox_model.sr
 
     except Exception as e:
diff --git a/main.py b/main.py
new file mode 100644
index 0000000..21ed8de
--- /dev/null
+++ b/main.py
@@ -0,0 +1,6 @@
+def main():
+    print("Hello from chatterbox-tts-server!")
+
+
+if __name__ == "__main__":
+    main()
diff --git a/models.py b/models.py
index 0bef8d6..8e06888 100644
--- a/models.py
+++ b/models.py
@@ -16,9 +16,9 @@ class GenerationParams(BaseModel):
     )
     exaggeration: Optional[float] = Field(
         None,
-        ge=0.25,  # Based on Chatterbox Gradio app
-        le=2.0,  # Based on Chatterbox Gradio app
-        description="Controls expressiveness/exaggeration. (Range: 0.25-2.0)",
+        ge=0.5,  # Based on Chatterbox tutorial notebook
+        le=3.0,  # Based on Chatterbox tutorial notebook
+        description="Controls expressiveness/exaggeration. Lower values are more subdued/calm, higher values are more dramatic/energetic. (Range: 0.5-3.0)",
     )
     cfg_weight: Optional[float] = Field(
         None,
diff --git a/pyproject.toml b/pyproject.toml
new file mode 100644
index 0000000..7b92d81
--- /dev/null
+++ b/pyproject.toml
@@ -0,0 +1,7 @@
+[project]
+name = "chatterbox-tts-server"
+version = "0.1.0"
+description = "Add your description here"
+readme = "README.md"
+requires-python = ">=3.10"
+dependencies = []
diff --git a/requirements.txt b/requirements.txt
index 700d35c..16342e2 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -23,7 +23,7 @@ fastapi                         # Modern async web framework
 uvicorn[standard]               # ASGI server with performance extras
 
 # --- Scientific Computing & ML Libraries ---
-numpy==1.26.4                   # Fundamental numerical computing
+numpy                # Fundamental numerical computing
 librosa                         # Advanced audio/music analysis
 safetensors                     # Safe tensor serialization format
 descript-audio-codec            # Audio codec for ML applications
diff --git a/run.bash b/run.bash
new file mode 100755
index 0000000..9e5f3e0
--- /dev/null
+++ b/run.bash
@@ -0,0 +1 @@
+source .venv/bin/activate && python server.py
diff --git a/server.py b/server.py
index dbe3a04..69e6ae2 100644
--- a/server.py
+++ b/server.py
@@ -77,6 +77,7 @@ class OpenAISpeechRequest(BaseModel):
     response_format: Literal["wav", "opus", "mp3"] = "wav"  # Add "mp3"
     speed: float = 1.0
     seed: Optional[int] = None
+    language: Optional[str] = None  # Added language support for multilingual
 
 
 # --- Logging Configuration ---
@@ -758,6 +759,11 @@ async def custom_tts_endpoint(
                 seed=(
                     request.seed if request.seed is not None else get_gen_default_seed()
                 ),
+                language_id=(
+                    request.language
+                    if request.language is not None
+                    else get_gen_default_language()
+                ),
             )
             perf_monitor.record(f"Engine synthesized chunk {i+1}")
 
@@ -936,6 +942,11 @@ async def openai_speech_endpoint(request: OpenAISpeechRequest):
             exaggeration=get_gen_default_exaggeration(),
             cfg_weight=get_gen_default_cfg_weight(),
             seed=seed_to_use,
+            language_id=(
+                request.language
+                if request.language is not None
+                else get_gen_default_language()
+            ),
         )
 
         if audio_tensor is None or sr is None:
diff --git a/ui/index.html b/ui/index.html
index 07ecb87..615336c 100644
--- a/ui/index.html
+++ b/ui/index.html
@@ -1,9 +1,10 @@
 <!DOCTYPE html>
-<html lang="en" class="dark"> <!-- Default to dark mode, script will manage preference -->
+<html lang="en" class="dark">
+  <!-- Default to dark mode, script will manage preference -->
 
-<head>
-    <meta charset="UTF-8">
-    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+  <head>
+    <meta charset="UTF-8" />
+    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
     <title>Chatterbox TTS Server</title>
 
     <!--
@@ -11,482 +12,1019 @@
       - styles.css: Pre-compiled stylesheet containing all Tailwind and custom component styles.
       - wavesurfer.min.js: The WaveSurfer.js library for audio visualization, served locally.
     -->
-    <link rel="stylesheet" href="styles.css">
+    <link rel="stylesheet" href="styles.css" />
     <script src="vendor/wavesurfer.min.js"></script>
-</head>
+  </head>
 
-<body class="body-base">
+  <body class="body-base">
     <div class="min-h-full">
-        <!-- Navigation Bar -->
-        <nav class="nav-base">
-            <div class="mx-auto max-w-7xl px-4 sm:px-6 lg:px-8">
-                <div class="flex h-16 items-center justify-between">
-                    <div class="flex items-center">
-                        <div class="flex-shrink-0">
-                            <a href="/" id="app-title-link" class="title-link">Chatterbox TTS Server</a>
-                        </div>
-                    </div>
-                    <div class="flex items-center space-x-3 sm:space-x-4">
-                        <a href="/docs" target="_blank" class="nav-link">API Docs</a>
-                        <button id="theme-toggle-btn" type="button" class="theme-switch-button"
-                            title="Toggle light/dark mode">
-                            <span class="sr-only">Toggle theme</span>
-                            <span class="theme-switch-thumb">
-                                <svg class="theme-icon-sun" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"
-                                    fill="currentColor">
-                                    <path
-                                        d="M12 2.25a.75.75 0 01.75.75v2.25a.75.75 0 01-1.5 0V3a.75.75 0 01.75-.75zM7.5 12a4.5 4.5 0 119 0 4.5 4.5 0 01-9 0zM18.894 6.166a.75.75 0 00-1.06-1.06l-1.591 1.59a.75.75 0 101.06 1.061l1.591-1.59zM21.75 12a.75.75 0 01-.75.75h-2.25a.75.75 0 010-1.5H21a.75.75 0 01.75.75zM17.834 18.894a.75.75 0 001.06-1.06l-1.59-1.591a.75.75 0 10-1.061 1.06l1.59 1.591zM12 18a.75.75 0 01.75.75V21a.75.75 0 01-1.5 0v-2.25A.75.75 0 0112 18zM7.758 17.303a.75.75 0 00-1.061-1.06l-1.591 1.59a.75.75 0 001.06 1.061l1.591-1.59zM6 12a.75.75 0 01-.75.75H3a.75.75 0 010-1.5h2.25A.75.75 0 016 12zM6.697 7.757a.75.75 0 001.06-1.06l-1.59-1.591a.75.75 0 00-1.061 1.06l1.59 1.591z" />
-                                </svg>
-                                <svg class="theme-icon-moon" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"
-                                    fill="currentColor">
-                                    <path fill-rule="evenodd"
-                                        d="M9.528 1.718a.75.75 0 01.162.819A8.97 8.97 0 009 6a9 9 0 009 9 8.97 8.97 0 003.463-.69.75.75 0 01.981.98 10.503 10.503 0 01-9.694 6.46c-5.799 0-10.5-4.701-10.5-10.5 0-4.368 2.667-8.112 6.46-9.694a.75.75 0 01.818.162z"
-                                        clip-rule="evenodd" />
-                                </svg>
-                            </span>
-                        </button>
-                    </div>
-                </div>
+      <!-- Navigation Bar -->
+      <nav class="nav-base">
+        <div class="mx-auto max-w-7xl px-4 sm:px-6 lg:px-8">
+          <div class="flex h-16 items-center justify-between">
+            <div class="flex items-center">
+              <div class="flex-shrink-0">
+                <a href="/" id="app-title-link" class="title-link"
+                  >Chatterbox TTS Server</a
+                >
+              </div>
+              <div class="ml-4">
+                <span
+                  id="model-status-badge"
+                  class="inline-flex items-center px-2.5 py-0.5 rounded-full text-xs font-medium bg-blue-100 text-blue-800 dark:bg-blue-900 dark:text-blue-200"
+                  title="Currently loaded TTS model"
+                >
+                  <svg
+                    class="w-3 h-3 mr-1"
+                    fill="currentColor"
+                    viewBox="0 0 20 20"
+                  >
+                    <path
+                      fill-rule="evenodd"
+                      d="M18 10a8 8 0 11-16 0 8 8 0 0116 0zm-7-4a1 1 0 11-2 0 1 1 0 012 0zM9 9a.75.75 0 000 1.5h.253a.25.25 0 01.244.304l-.459 2.066A1.75 1.75 0 0010.747 15H11a.75.75 0 000-1.5h-.253a.25.25 0 01-.244-.304l.459-2.066A1.75 1.75 0 009.253 9H9z"
+                      clip-rule="evenodd"
+                    />
+                  </svg>
+                  Multilingual
+                </span>
+              </div>
             </div>
-        </nav>
-
-        <!-- Main Content Area -->
-        <main>
-            <div class="mx-auto max-w-7xl px-4 py-1 sm:px-6 lg:px-8">
-
-                <div id="notification-area" class="mb-6 space-y-3"></div>
-
-                <div class="card-base">
-                    <form id="tts-form" action="#" method="post" class="flex flex-col" onsubmit="return false;">
-                        <div class="p-6 sm:p-8">
-                            <h2 id="tts-form-header" class="card-header">Generate Speech</h2>
-
-                            <div class="mb-5">
-                                <label for="text" class="label-base">Text to synthesize</label>
-                                <p class="text-xs text-slate-500 dark:text-slate-400 mb-2">
-                                    Enter the text you want to convert to speech. For audiobooks, you can paste long
-                                    chapters.
-                                </p>
-                                <div class="relative">
-                                    <textarea name="text" id="text" rows="8" class="textarea-base"
-                                        placeholder="Enter text here..." required></textarea>
-                                    <div class="absolute bottom-2.5 right-3 text-xs text-slate-500 dark:text-slate-400">
-                                        <span id="char-count">0</span> Characters
-                                    </div>
-                                </div>
-                            </div>
+            <div class="flex items-center space-x-3 sm:space-x-4">
+              <a href="/docs" target="_blank" class="nav-link">API Docs</a>
+              <button
+                id="theme-toggle-btn"
+                type="button"
+                class="theme-switch-button"
+                title="Toggle light/dark mode"
+              >
+                <span class="sr-only">Toggle theme</span>
+                <span class="theme-switch-thumb">
+                  <svg
+                    class="theme-icon-sun"
+                    xmlns="http://www.w3.org/2000/svg"
+                    viewBox="0 0 24 24"
+                    fill="currentColor"
+                  >
+                    <path
+                      d="M12 2.25a.75.75 0 01.75.75v2.25a.75.75 0 01-1.5 0V3a.75.75 0 01.75-.75zM7.5 12a4.5 4.5 0 119 0 4.5 4.5 0 01-9 0zM18.894 6.166a.75.75 0 00-1.06-1.06l-1.591 1.59a.75.75 0 101.06 1.061l1.591-1.59zM21.75 12a.75.75 0 01-.75.75h-2.25a.75.75 0 010-1.5H21a.75.75 0 01.75.75zM17.834 18.894a.75.75 0 001.06-1.06l-1.59-1.591a.75.75 0 10-1.061 1.06l1.59 1.591zM12 18a.75.75 0 01.75.75V21a.75.75 0 01-1.5 0v-2.25A.75.75 0 0112 18zM7.758 17.303a.75.75 0 00-1.061-1.06l-1.591 1.59a.75.75 0 001.06 1.061l1.591-1.59zM6 12a.75.75 0 01-.75.75H3a.75.75 0 010-1.5h2.25A.75.75 0 016 12zM6.697 7.757a.75.75 0 001.06-1.06l-1.59-1.591a.75.75 0 00-1.061 1.06l1.59 1.591z"
+                    />
+                  </svg>
+                  <svg
+                    class="theme-icon-moon"
+                    xmlns="http://www.w3.org/2000/svg"
+                    viewBox="0 0 24 24"
+                    fill="currentColor"
+                  >
+                    <path
+                      fill-rule="evenodd"
+                      d="M9.528 1.718a.75.75 0 01.162.819A8.97 8.97 0 009 6a9 9 0 009 9 8.97 8.97 0 003.463-.69.75.75 0 01.981.98 10.503 10.503 0 01-9.694 6.46c-5.799 0-10.5-4.701-10.5-10.5 0-4.368 2.667-8.112 6.46-9.694a.75.75 0 01.818.162z"
+                      clip-rule="evenodd"
+                    />
+                  </svg>
+                </span>
+              </button>
+            </div>
+          </div>
+        </div>
+      </nav>
 
-                            <div class="flex flex-wrap items-center gap-x-6 gap-y-4 mb-6">
-                                <button type="button" id="generate-btn" class="btn-primary">
-                                    <svg xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24"
-                                        stroke-width="1.5" stroke="currentColor" class="w-5 h-5 mr-1.5 inline-block">
-                                        <path stroke-linecap="round" stroke-linejoin="round"
-                                            d="M19.114 5.636a9 9 0 0 1 0 12.728M16.463 8.288a5.25 5.25 0 0 1 0 7.424M6.75 8.25l4.72-4.72a.75.75 0 0 1 1.28.53v15.88a.75.75 0 0 1-1.28.53l-4.72-4.72H4.51c-.88 0-1.704-.507-1.938-1.354A9.009 9.009 0 0 1 2.25 12c0-.83.112-1.633.322-2.396C2.806 8.756 3.63 8.25 4.51 8.25H6.75Z" />
-                                    </svg>
-                                    Generate Speech
-                                </button>
-                                <label for="split-text-toggle"
-                                    class="flex items-center cursor-pointer text-sm label-base select-none">
-                                    <input type="checkbox" id="split-text-toggle" name="split_text"
-                                        class="mr-2 h-4 w-4 rounded border-slate-300 text-indigo-600 focus:ring-indigo-500 dark:border-slate-600 dark:bg-slate-700 dark:focus:ring-offset-slate-800 dark:checked:bg-indigo-500 dark:checked:border-indigo-500">
-                                    Split text into chunks
-                                </label>
-                                <div id="chunk-size-controls" class="flex items-center space-x-2 hidden">
-                                    <label for="chunk-size-slider" class="text-sm label-base whitespace-nowrap">Chunk
-                                        Size:</label>
-                                    <input type="range" id="chunk-size-slider" name="chunk_size" min="50" max="1000"
-                                        step="10" value="120" class="slider-base w-32">
-                                    <span id="chunk-size-value"
-                                        class="text-sm font-medium text-slate-700 dark:text-slate-300 w-10 text-right tabular-nums">120</span>
-                                </div>
-                            </div>
-                            <p id="chunk-explanation" class="text-xs text-slate-500 dark:text-slate-400 mb-6 hidden">
-                                Splitting is essential for longer texts like articles or audiobook chapters.
-                                Recommended chunk size: ~150-400 characters.
-                            </p>
+      <!-- Main Content Area -->
+      <main>
+        <div class="mx-auto max-w-7xl px-4 py-1 sm:px-6 lg:px-8">
+          <div id="notification-area" class="mb-6 space-y-3"></div>
 
-                            <div class="mb-6">
-                                <label class="label-base mb-2">Voice Mode:</label>
-                                <div class="grid grid-cols-1 md:grid-cols-2 gap-4">
-                                    <label class="voice-mode-option" data-mode="predefined">
-                                        <input type="radio" name="voice_mode" value="predefined" class="hidden peer"
-                                            checked>
-                                        <span>Predefined Voices</span>
-                                    </label>
-                                    <label class="voice-mode-option" data-mode="clone">
-                                        <input type="radio" name="voice_mode" value="clone" class="hidden peer">
-                                        <span>Voice Cloning (Reference)</span>
-                                    </label>
-                                </div>
-                            </div>
+          <div class="card-base">
+            <form
+              id="tts-form"
+              action="#"
+              method="post"
+              class="flex flex-col"
+              onsubmit="return false;"
+            >
+              <div class="p-6 sm:p-8">
+                <h2 id="tts-form-header" class="card-header">
+                  Generate Speech
+                </h2>
 
-                            <div id="predefined-voice-options" class="mb-6">
-                                <label for="predefined-voice-select" class="label-base">Select Predefined Voice:</label>
-                                <div class="flex items-center gap-2">
-                                    <select id="predefined-voice-select" name="predefined_voice_select"
-                                        class="select-base flex-grow">
-                                        <option value="none">-- Select Voice --</option>
-                                    </select>
-                                    <input type="file" id="predefined-voice-file-input" class="hidden" multiple
-                                        accept=".wav,.mp3" aria-label="Upload predefined voice file">
-                                    <button type="button" id="predefined-voice-import-button" class="btn-import"
-                                        title="Import new predefined voice files">
-                                        <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 20 20" fill="currentColor">
-                                            <path
-                                                d="M9.25 13.25a.75.75 0 0 0 1.5 0V4.636l2.955 3.129a.75.75 0 0 0 1.09-1.03l-4.25-4.5a.75.75 0 0 0-1.09 0l-4.25 4.5a.75.75 0 1 0 1.09 1.03L9.25 4.636v8.614Z" />
-                                            <path
-                                                d="M3.5 12.75a.75.75 0 0 0-1.5 0v2.5A2.75 2.75 0 0 0 4.75 18h10.5A2.75 2.75 0 0 0 18 15.25v-2.5a.75.75 0 0 0-1.5 0v2.5c0 .69-.56 1.25-1.25 1.25H4.75c-.69 0-1.25-.56-1.25-1.25v-2.5Z" />
-                                        </svg>
-                                        Import
-                                    </button>
-                                    <button type="button" id="predefined-voice-refresh-button" class="btn-refresh"
-                                        title="Refresh predefined voice list">
-                                        <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 20 20" fill="currentColor">
-                                            <path fill-rule="evenodd"
-                                                d="M15.312 11.424a5.5 5.5 0 0 1-9.201-4.42 5.5 5.5 0 0 1 10.89 2.01a.75.75 0 0 1-1.49.174a4 4 0 0 0-7.486-2.641a4 4 0 0 0 6.688 2.947a.75.75 0 0 1 .999.057l-.004-.005ZM4.688 8.576a5.5 5.5 0 0 1 9.201 4.42a5.5 5.5 0 0 1-10.89-2.01a.75.75 0 0 1 1.49-.174a4 4 0 0 0 7.486 2.641a4 4 0 0 0-6.688-2.947a.75.75 0 0 1-.999-.057l.004.005Z"
-                                                clip-rule="evenodd" />
-                                        </svg>
-                                        Refresh
-                                    </button>
-                                </div>
-                            </div>
+                <div class="mb-5">
+                  <label for="text" class="label-base"
+                    >Text to synthesize</label
+                  >
+                  <p class="text-xs text-slate-500 dark:text-slate-400 mb-2">
+                    Enter the text you want to convert to speech. For
+                    audiobooks, you can paste long chapters.
+                  </p>
+                  <div class="relative">
+                    <textarea
+                      name="text"
+                      id="text"
+                      rows="8"
+                      class="textarea-base"
+                      placeholder="Enter text here..."
+                      required
+                    ></textarea>
+                    <div
+                      class="absolute bottom-2.5 right-3 text-xs text-slate-500 dark:text-slate-400"
+                    >
+                      <span id="char-count">0</span> Characters
+                    </div>
+                  </div>
+                </div>
 
-                            <div id="clone-options" class="mb-6 hidden">
-                                <label for="clone-reference-select" class="label-base">Reference Audio File</label>
-                                <p class="text-xs text-slate-500 dark:text-slate-400 mb-2">
-                                    Select an uploaded <code class="code-inline">.wav</code> or <code
-                                        class="code-inline">.mp3</code> file.
-                                    For best results, use clean audio recordings.
-                                </p>
-                                <div class="flex items-center gap-2">
-                                    <select id="clone-reference-select" name="clone_reference_select"
-                                        class="select-base flex-grow">
-                                        <option value="none">-- Select Reference File --</option>
-                                    </select>
-                                    <input type="file" id="clone-file-input" class="hidden" multiple accept=".wav,.mp3"
-                                        aria-label="Upload reference audio file">
-                                    <button type="button" id="clone-import-button" class="btn-import"
-                                        title="Import new reference files">
-                                        <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 20 20" fill="currentColor">
-                                            <path
-                                                d="M9.25 13.25a.75.75 0 0 0 1.5 0V4.636l2.955 3.129a.75.75 0 0 0 1.09-1.03l-4.25-4.5a.75.75 0 0 0-1.09 0l-4.25 4.5a.75.75 0 1 0 1.09 1.03L9.25 4.636v8.614Z" />
-                                            <path
-                                                d="M3.5 12.75a.75.75 0 0 0-1.5 0v2.5A2.75 2.75 0 0 0 4.75 18h10.5A2.75 2.75 0 0 0 18 15.25v-2.5a.75.75 0 0 0-1.5 0v2.5c0 .69-.56 1.25-1.25 1.25H4.75c-.69 0-1.25-.56-1.25-1.25v-2.5Z" />
-                                        </svg>
-                                        Import
-                                    </button>
-                                    <button type="button" id="clone-refresh-button" class="btn-refresh"
-                                        title="Refresh reference file list">
-                                        <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 20 20" fill="currentColor">
-                                            <path fill-rule="evenodd"
-                                                d="M15.312 11.424a5.5 5.5 0 0 1-9.201-4.42 5.5 5.5 0 0 1 10.89 2.01a.75.75 0 0 1-1.49.174a4 4 0 0 0-7.486-2.641a4 4 0 0 0 6.688 2.947a.75.75 0 0 1 .999.057l-.004-.005ZM4.688 8.576a5.5 5.5 0 0 1 9.201 4.42a5.5 5.5 0 0 1-10.89-2.01a.75.75 0 0 1 1.49-.174a4 4 0 0 0 7.486 2.641a4 4 0 0 0-6.688-2.947a.75.75 0 0 1-.999-.057l.004.005Z"
-                                                clip-rule="evenodd" />
-                                        </svg>
-                                        Refresh
-                                    </button>
-                                </div>
-                            </div>
+                <div class="flex flex-wrap items-center gap-x-6 gap-y-4 mb-6">
+                  <button type="button" id="generate-btn" class="btn-primary">
+                    <svg
+                      xmlns="http://www.w3.org/2000/svg"
+                      fill="none"
+                      viewBox="0 0 24 24"
+                      stroke-width="1.5"
+                      stroke="currentColor"
+                      class="w-5 h-5 mr-1.5 inline-block"
+                    >
+                      <path
+                        stroke-linecap="round"
+                        stroke-linejoin="round"
+                        d="M19.114 5.636a9 9 0 0 1 0 12.728M16.463 8.288a5.25 5.25 0 0 1 0 7.424M6.75 8.25l4.72-4.72a.75.75 0 0 1 1.28.53v15.88a.75.75 0 0 1-1.28.53l-4.72-4.72H4.51c-.88 0-1.704-.507-1.938-1.354A9.009 9.009 0 0 1 2.25 12c0-.83.112-1.633.322-2.396C2.806 8.756 3.63 8.25 4.51 8.25H6.75Z"
+                      />
+                    </svg>
+                    Generate Speech
+                  </button>
+                  <label
+                    for="split-text-toggle"
+                    class="flex items-center cursor-pointer text-sm label-base select-none"
+                  >
+                    <input
+                      type="checkbox"
+                      id="split-text-toggle"
+                      name="split_text"
+                      class="mr-2 h-4 w-4 rounded border-slate-300 text-indigo-600 focus:ring-indigo-500 dark:border-slate-600 dark:bg-slate-700 dark:focus:ring-offset-slate-800 dark:checked:bg-indigo-500 dark:checked:border-indigo-500"
+                    />
+                    Split text into chunks
+                  </label>
+                  <div
+                    id="chunk-size-controls"
+                    class="flex items-center space-x-2 hidden"
+                  >
+                    <label
+                      for="chunk-size-slider"
+                      class="text-sm label-base whitespace-nowrap"
+                      >Chunk Size:</label
+                    >
+                    <input
+                      type="range"
+                      id="chunk-size-slider"
+                      name="chunk_size"
+                      min="50"
+                      max="1000"
+                      step="10"
+                      value="120"
+                      class="slider-base w-32"
+                    />
+                    <span
+                      id="chunk-size-value"
+                      class="text-sm font-medium text-slate-700 dark:text-slate-300 w-10 text-right tabular-nums"
+                      >120</span
+                    >
+                  </div>
+                </div>
+                <p
+                  id="chunk-explanation"
+                  class="text-xs text-slate-500 dark:text-slate-400 mb-6 hidden"
+                >
+                  Splitting is essential for longer texts like articles or
+                  audiobook chapters. Recommended chunk size: ~150-400
+                  characters.
+                </p>
 
-                            <div class="mb-6">
-                                <label class="label-base mb-2">Load Example Preset:</label>
-                                <div id="presets-container" class="flex flex-wrap gap-2">
-                                    <p id="presets-placeholder" class="text-sm text-slate-500 dark:text-slate-400">
-                                        Loading presets...</p>
-                                </div>
-                            </div>
+                <div class="mb-6">
+                  <label class="label-base mb-2">Voice Mode:</label>
+                  <div class="grid grid-cols-1 md:grid-cols-2 gap-4">
+                    <label class="voice-mode-option" data-mode="predefined">
+                      <input
+                        type="radio"
+                        name="voice_mode"
+                        value="predefined"
+                        class="hidden peer"
+                        checked
+                      />
+                      <span>Predefined Voices</span>
+                    </label>
+                    <label class="voice-mode-option" data-mode="clone">
+                      <input
+                        type="radio"
+                        name="voice_mode"
+                        value="clone"
+                        class="hidden peer"
+                      />
+                      <span>Voice Cloning (Reference)</span>
+                    </label>
+                  </div>
+                </div>
 
-                            <div class="mb-6">
-                                <details class="group" open>
-                                    <summary class="list-none flex cursor-pointer items-center group">
-                                        <span
-                                            class="text-sm font-medium label-base group-hover:text-indigo-600 dark:group-hover:text-indigo-400 transition-colors">Generation
-                                            Parameters</span>
-                                        <span
-                                            class="ml-1.5 text-slate-400 group-hover:text-indigo-500 dark:group-hover:text-indigo-400 transition-colors">
-                                            <svg class="group-open:rotate-180 h-5 w-5 transition-transform duration-200 ease-in-out"
-                                                viewBox="0 0 20 20" fill="currentColor">
-                                                <path fill-rule="evenodd"
-                                                    d="M5.293 7.293a1 1 0 011.414 0L10 10.586l3.293-3.293a1 1 0 111.414 1.414l-4 4a1 1 0 01-1.414 0l-4-4a1 1 0 010-1.414z"
-                                                    clip-rule="evenodd" />
-                                            </svg>
-                                        </span>
-                                    </summary>
-                                    <div class="mt-4 grid grid-cols-1 md:grid-cols-2 gap-x-6 gap-y-5">
-                                        <div>
-                                            <label for="temperature" class="label-base">Temperature (<span
-                                                    id="temperature-value"
-                                                    class="font-semibold tabular-nums">0.8</span>)</label>
-                                            <input type="range" id="temperature" name="temperature" min="0.0" max="1.5"
-                                                step="0.01" value="0.8" class="slider-base">
-                                        </div>
-                                        <div>
-                                            <label for="exaggeration" class="label-base">Exaggeration (<span
-                                                    id="exaggeration-value"
-                                                    class="font-semibold tabular-nums">0.5</span>)</label>
-                                            <input type="range" id="exaggeration" name="exaggeration" min="0.0"
-                                                max="2.0" step="0.01" value="0.5" class="slider-base">
-                                        </div>
-                                        <div>
-                                            <label for="cfg-weight" class="label-base">CFG Weight (<span
-                                                    id="cfg-weight-value"
-                                                    class="font-semibold tabular-nums">0.5</span>)</label>
-                                            <input type="range" id="cfg-weight" name="cfg_weight" min="0.0" max="2.0"
-                                                step="0.01" value="0.5" class="slider-base">
-                                        </div>
-                                        <div>
-                                            <label for="speed-factor" class="label-base">
-                                                Speed Factor (<span id="speed-factor-value"
-                                                    class="font-semibold tabular-nums">1.0</span>)
-                                                <span id="speed-factor-warning"
-                                                    class="text-xs text-yellow-600 dark:text-yellow-400 ml-2 hidden"></span>
-                                            </label>
-                                            <input type="range" id="speed-factor" name="speed_factor" min="0.25"
-                                                max="4.0" step="0.05" value="1.0" class="slider-base">
-                                        </div>
-                                        <div>
-                                            <label for="seed" class="label-base">Generation Seed</label>
-                                            <div class="w-36">
-                                                <input type="number" id="seed" name="seed" value="0"
-                                                    placeholder="0 (or -1 for random)" class="input-base"
-                                                    title="Enter an integer seed for reproducible results, or -1 for random. 0 may also be random depending on engine.">
-                                            </div>
-                                            <p class="text-xs text-slate-500 dark:text-slate-400 mt-1.5">
-                                                Integer for reproducible results. Some engines use 0 or -1 for random.
-                                            </p>
-                                        </div>
-                                        <div id="language-select-container">
-                                            <label for="language" class="label-base">Language</label>
-                                            <select id="language" name="language" class="select-base w-36">
-                                                <option value="en">English</option>
-                                            </select>
-                                        </div>
-                                        <div>
-                                            <label for="output-format" class="label-base">Output Format</label>
-                                            <select id="output-format" name="output_format" class="select-base w-36">
-                                                <option value="wav">WAV</option>
-                                                <option value="mp3" selected>MP3</option>
-                                                <option value="opus">Opus</option>
-                                            </select>
-                                            <p class="text-xs text-slate-500 dark:text-slate-400 mt-1.5">
-                                                MP3 is recommended for smaller file sizes (e.g., audiobooks).
-                                            </p>
-                                        </div>
-                                        <div class="col-span-1 md:col-span-2 mt-4 flex items-center gap-4">
-                                            <button id="save-gen-defaults-btn" type="button" class="btn-settings">Save
-                                                Generation Parameters</button>
-                                            <span id="gen-defaults-status" class="text-xs hidden"></span>
-                                        </div>
-                                    </div>
-                                </details>
-                            </div>
+                <div id="predefined-voice-options" class="mb-6">
+                  <label for="predefined-voice-select" class="label-base"
+                    >Select Predefined Voice:</label
+                  >
+                  <div class="flex items-center gap-2">
+                    <select
+                      id="predefined-voice-select"
+                      name="predefined_voice_select"
+                      class="select-base flex-grow"
+                    >
+                      <option value="none">-- Select Voice --</option>
+                    </select>
+                    <input
+                      type="file"
+                      id="predefined-voice-file-input"
+                      class="hidden"
+                      multiple
+                      accept=".wav,.mp3"
+                      aria-label="Upload predefined voice file"
+                    />
+                    <button
+                      type="button"
+                      id="predefined-voice-import-button"
+                      class="btn-import"
+                      title="Import new predefined voice files"
+                    >
+                      <svg
+                        xmlns="http://www.w3.org/2000/svg"
+                        viewBox="0 0 20 20"
+                        fill="currentColor"
+                      >
+                        <path
+                          d="M9.25 13.25a.75.75 0 0 0 1.5 0V4.636l2.955 3.129a.75.75 0 0 0 1.09-1.03l-4.25-4.5a.75.75 0 0 0-1.09 0l-4.25 4.5a.75.75 0 1 0 1.09 1.03L9.25 4.636v8.614Z"
+                        />
+                        <path
+                          d="M3.5 12.75a.75.75 0 0 0-1.5 0v2.5A2.75 2.75 0 0 0 4.75 18h10.5A2.75 2.75 0 0 0 18 15.25v-2.5a.75.75 0 0 0-1.5 0v2.5c0 .69-.56 1.25-1.25 1.25H4.75c-.69 0-1.25-.56-1.25-1.25v-2.5Z"
+                        />
+                      </svg>
+                      Import
+                    </button>
+                    <button
+                      type="button"
+                      id="predefined-voice-refresh-button"
+                      class="btn-refresh"
+                      title="Refresh predefined voice list"
+                    >
+                      <svg
+                        xmlns="http://www.w3.org/2000/svg"
+                        viewBox="0 0 20 20"
+                        fill="currentColor"
+                      >
+                        <path
+                          fill-rule="evenodd"
+                          d="M15.312 11.424a5.5 5.5 0 0 1-9.201-4.42 5.5 5.5 0 0 1 10.89 2.01a.75.75 0 0 1-1.49.174a4 4 0 0 0-7.486-2.641a4 4 0 0 0 6.688 2.947a.75.75 0 0 1 .999.057l-.004-.005ZM4.688 8.576a5.5 5.5 0 0 1 9.201 4.42a5.5 5.5 0 0 1-10.89-2.01a.75.75 0 0 1 1.49-.174a4 4 0 0 0 7.486 2.641a4 4 0 0 0-6.688-2.947a.75.75 0 0 1-.999-.057l.004.005Z"
+                          clip-rule="evenodd"
+                        />
+                      </svg>
+                      Refresh
+                    </button>
+                  </div>
+                </div>
 
-                            <div class="mb-6">
-                                <details class="group" open> <!-- Server Config expanded by default -->
-                                    <summary class="list-none flex cursor-pointer items-center group">
-                                        <span
-                                            class="text-sm font-medium label-base group-hover:text-indigo-600 dark:group-hover:text-indigo-400 transition-colors">Server
-                                            Configuration</span>
-                                        <span
-                                            class="ml-1.5 text-slate-400 group-hover:text-indigo-500 dark:group-hover:text-indigo-400 transition-colors">
-                                            <svg class="group-open:rotate-180 h-5 w-5 transition-transform duration-200 ease-in-out"
-                                                viewBox="0 0 20 20" fill="currentColor">
-                                                <path fill-rule="evenodd"
-                                                    d="M5.293 7.293a1 1 0 011.414 0L10 10.586l3.293-3.293a1 1 0 111.414 1.414l-4 4a1 1 0 01-1.414 0l-4-4a1 1 0 010-1.414z"
-                                                    clip-rule="evenodd" />
-                                            </svg>
-                                        </span>
-                                    </summary>
-                                    <div id="server-config-form"
-                                        class="mt-4 border-t border-slate-200 dark:border-slate-700 pt-5">
-                                        <p class="text-xs text-slate-500 dark:text-slate-400 mb-4">
-                                            These settings are loaded from <code class="code-inline">config.yaml</code>
-                                            via an API call.
-                                            <strong class="font-medium">Restart the server</strong> to apply changes to
-                                            Host, Port, Model, or Path settings if modified here or directly in the
-                                            file.
-                                        </p>
-                                        <div class="grid grid-cols-1 md:grid-cols-2 gap-x-6 gap-y-4">
-                                            <div><label for="config-server-host" class="label-base text-xs">Server
-                                                    Host</label><input type="text" id="config-server-host"
-                                                    name="server.host" class="input-base text-sm"></div>
-                                            <div><label for="config-server-port" class="label-base text-xs">Server
-                                                    Port</label><input type="number" id="config-server-port"
-                                                    name="server.port" class="input-base text-sm"></div>
-                                            <div><label for="config-tts-engine-device" class="label-base text-xs">TTS
-                                                    Device</label><input type="text" id="config-tts-engine-device"
-                                                    name="tts_engine.device" class="input-base text-sm"></div>
-                                            <div><label for="config-tts-engine-default_voice_id"
-                                                    class="label-base text-xs">Default Voice ID</label><input
-                                                    type="text" id="config-tts-engine-default_voice_id"
-                                                    name="tts_engine.default_voice_id" class="input-base text-sm"></div>
-                                            <div><label for="config-paths-model_cache" class="label-base text-xs">Model
-                                                    Cache Path</label><input type="text" id="config-paths-model_cache"
-                                                    name="paths.model_cache" class="input-base text-sm"></div>
-                                            <div><label for="config-tts-engine-predefined_voices_path"
-                                                    class="label-base text-xs">Predefined Voices Path</label><input
-                                                    type="text" id="config-tts-engine-predefined_voices_path"
-                                                    name="tts_engine.predefined_voices_path" class="input-base text-sm">
-                                            </div>
-                                            <div><label for="config-tts-engine-reference_audio_path"
-                                                    class="label-base text-xs">Reference Audio Path</label><input
-                                                    type="text" id="config-tts-engine-reference_audio_path"
-                                                    name="tts_engine.reference_audio_path" class="input-base text-sm">
-                                            </div>
-                                            <div><label for="config-paths-output" class="label-base text-xs">Output
-                                                    Path</label><input type="text" id="config-paths-output"
-                                                    name="paths.output" class="input-base text-sm"></div>
-                                            <div><label for="config-audio_output-format"
-                                                    class="label-base text-xs">Audio Output Format</label><input
-                                                    type="text" id="config-audio_output-format"
-                                                    name="audio_output.format" class="input-base text-sm"></div>
-                                            <div><label for="config-audio_output-sample_rate"
-                                                    class="label-base text-xs">Audio Sample Rate</label><input
-                                                    type="number" id="config-audio_output-sample_rate"
-                                                    name="audio_output.sample_rate" class="input-base text-sm"></div>
+                <div id="clone-options" class="mb-6 hidden">
+                  <label for="clone-reference-select" class="label-base"
+                    >Reference Audio File</label
+                  >
+                  <p class="text-xs text-slate-500 dark:text-slate-400 mb-2">
+                    Select an uploaded <code class="code-inline">.wav</code> or
+                    <code class="code-inline">.mp3</code> file. For best
+                    results, use clean audio recordings.
+                  </p>
+                  <div class="flex items-center gap-2">
+                    <select
+                      id="clone-reference-select"
+                      name="clone_reference_select"
+                      class="select-base flex-grow"
+                    >
+                      <option value="none">-- Select Reference File --</option>
+                    </select>
+                    <input
+                      type="file"
+                      id="clone-file-input"
+                      class="hidden"
+                      multiple
+                      accept=".wav,.mp3"
+                      aria-label="Upload reference audio file"
+                    />
+                    <button
+                      type="button"
+                      id="clone-import-button"
+                      class="btn-import"
+                      title="Import new reference files"
+                    >
+                      <svg
+                        xmlns="http://www.w3.org/2000/svg"
+                        viewBox="0 0 20 20"
+                        fill="currentColor"
+                      >
+                        <path
+                          d="M9.25 13.25a.75.75 0 0 0 1.5 0V4.636l2.955 3.129a.75.75 0 0 0 1.09-1.03l-4.25-4.5a.75.75 0 0 0-1.09 0l-4.25 4.5a.75.75 0 1 0 1.09 1.03L9.25 4.636v8.614Z"
+                        />
+                        <path
+                          d="M3.5 12.75a.75.75 0 0 0-1.5 0v2.5A2.75 2.75 0 0 0 4.75 18h10.5A2.75 2.75 0 0 0 18 15.25v-2.5a.75.75 0 0 0-1.5 0v2.5c0 .69-.56 1.25-1.25 1.25H4.75c-.69 0-1.25-.56-1.25-1.25v-2.5Z"
+                        />
+                      </svg>
+                      Import
+                    </button>
+                    <button
+                      type="button"
+                      id="clone-refresh-button"
+                      class="btn-refresh"
+                      title="Refresh reference file list"
+                    >
+                      <svg
+                        xmlns="http://www.w3.org/2000/svg"
+                        viewBox="0 0 20 20"
+                        fill="currentColor"
+                      >
+                        <path
+                          fill-rule="evenodd"
+                          d="M15.312 11.424a5.5 5.5 0 0 1-9.201-4.42 5.5 5.5 0 0 1 10.89 2.01a.75.75 0 0 1-1.49.174a4 4 0 0 0-7.486-2.641a4 4 0 0 0 6.688 2.947a.75.75 0 0 1 .999.057l-.004-.005ZM4.688 8.576a5.5 5.5 0 0 1 9.201 4.42a5.5 5.5 0 0 1-10.89-2.01a.75.75 0 0 1 1.49-.174a4 4 0 0 0 7.486 2.641a4 4 0 0 0-6.688-2.947a.75.75 0 0 1-.999-.057l.004.005Z"
+                          clip-rule="evenodd"
+                        />
+                      </svg>
+                      Refresh
+                    </button>
+                  </div>
+                </div>
 
-                                            <div
-                                                class="col-span-1 md:col-span-2 mt-4 flex flex-col md:flex-row gap-4 items-center">
-                                                <button id="save-config-btn" type="button" class="btn-settings">Save
-                                                    Server Configuration</button>
-                                                <button id="restart-server-btn" type="button"
-                                                    class="btn-danger w-full md:w-auto hidden">
-                                                    <svg xmlns="http://www.w3.org/2000/svg" fill="none"
-                                                        viewBox="0 0 24 24" stroke-width="1.5" stroke="currentColor"
-                                                        class="w-5 h-5 mr-1.5 inline-block">
-                                                        <path stroke-linecap="round" stroke-linejoin="round"
-                                                            d="M16.023 9.348h4.992v-.001M2.985 19.644v-4.992m0 0h4.992m-4.993 0 3.181 3.183a8.25 8.25 0 0 0 13.803-3.7M4.031 9.865a8.25 8.25 0 0 1 13.803-3.7l3.181 3.182m0-4.991v4.99" />
-                                                    </svg>
-                                                    Restart Server
-                                                </button>
-                                                <span id="config-status" class="text-xs ml-2 hidden"></span>
-                                            </div>
-                                        </div>
-                                    </div>
-                                </details>
-                            </div>
-                        </div>
+                <div class="mb-6">
+                  <label class="label-base mb-2">Load Example Preset:</label>
+                  <div id="presets-container" class="flex flex-wrap gap-2">
+                    <p
+                      id="presets-placeholder"
+                      class="text-sm text-slate-500 dark:text-slate-400"
+                    >
+                      Loading presets...
+                    </p>
+                  </div>
+                </div>
 
-                        <div class="card-footer">
-                            <button id="reset-settings-btn" type="button" class="btn-settings">Reset All
-                                Settings</button>
-                            <div class="text-xs text-slate-500 dark:text-slate-400 ml-2">
-                                Resets settings to their defaults.
-                            </div>
+                <div class="mb-6">
+                  <details class="group" open>
+                    <summary
+                      class="list-none flex cursor-pointer items-center group"
+                    >
+                      <span
+                        class="text-sm font-medium label-base group-hover:text-indigo-600 dark:group-hover:text-indigo-400 transition-colors"
+                        >Generation Parameters</span
+                      >
+                      <span
+                        class="ml-1.5 text-slate-400 group-hover:text-indigo-500 dark:group-hover:text-indigo-400 transition-colors"
+                      >
+                        <svg
+                          class="group-open:rotate-180 h-5 w-5 transition-transform duration-200 ease-in-out"
+                          viewBox="0 0 20 20"
+                          fill="currentColor"
+                        >
+                          <path
+                            fill-rule="evenodd"
+                            d="M5.293 7.293a1 1 0 011.414 0L10 10.586l3.293-3.293a1 1 0 111.414 1.414l-4 4a1 1 0 01-1.414 0l-4-4a1 1 0 010-1.414z"
+                            clip-rule="evenodd"
+                          />
+                        </svg>
+                      </span>
+                    </summary>
+                    <div
+                      class="mt-4 grid grid-cols-1 md:grid-cols-2 gap-x-6 gap-y-5"
+                    >
+                      <div>
+                        <label for="temperature" class="label-base"
+                          >Temperature (<span
+                            id="temperature-value"
+                            class="font-semibold tabular-nums"
+                            >0.8</span
+                          >)</label
+                        >
+                        <input
+                          type="range"
+                          id="temperature"
+                          name="temperature"
+                          min="0.0"
+                          max="1.5"
+                          step="0.01"
+                          value="0.8"
+                          class="slider-base"
+                        />
+                      </div>
+                      <div>
+                        <label for="exaggeration" class="label-base"
+                          >Exaggeration (<span
+                            id="exaggeration-value"
+                            class="font-semibold tabular-nums"
+                            >0.5</span
+                          >)</label
+                        >
+                        <input
+                          type="range"
+                          id="exaggeration"
+                          name="exaggeration"
+                          min="0.5"
+                          max="3.0"
+                          step="0.1"
+                          value="0.5"
+                          class="slider-base"
+                        />
+                        <p
+                          class="text-xs text-slate-500 dark:text-slate-400 mt-1.5"
+                        >
+                          0.5-0.8: Calm/subdued • 1.0: Natural • 1.5-2.0:
+                          Energetic • 2.5-3.0: Very dramatic
+                        </p>
+                      </div>
+                      <div>
+                        <label for="cfg-weight" class="label-base"
+                          >CFG Weight (<span
+                            id="cfg-weight-value"
+                            class="font-semibold tabular-nums"
+                            >0.5</span
+                          >)</label
+                        >
+                        <input
+                          type="range"
+                          id="cfg-weight"
+                          name="cfg_weight"
+                          min="0.0"
+                          max="2.0"
+                          step="0.01"
+                          value="0.5"
+                          class="slider-base"
+                        />
+                      </div>
+                      <div>
+                        <label for="speed-factor" class="label-base">
+                          Speed Factor (<span
+                            id="speed-factor-value"
+                            class="font-semibold tabular-nums"
+                            >1.0</span
+                          >)
+                          <span
+                            id="speed-factor-warning"
+                            class="text-xs text-yellow-600 dark:text-yellow-400 ml-2 hidden"
+                          ></span>
+                        </label>
+                        <input
+                          type="range"
+                          id="speed-factor"
+                          name="speed_factor"
+                          min="0.25"
+                          max="4.0"
+                          step="0.05"
+                          value="1.0"
+                          class="slider-base"
+                        />
+                      </div>
+                      <div>
+                        <label for="seed" class="label-base"
+                          >Generation Seed</label
+                        >
+                        <div class="w-36">
+                          <input
+                            type="number"
+                            id="seed"
+                            name="seed"
+                            value="0"
+                            placeholder="0 (or -1 for random)"
+                            class="input-base"
+                            title="Enter an integer seed for reproducible results, or -1 for random. 0 may also be random depending on engine."
+                          />
                         </div>
-                    </form>
+                        <p
+                          class="text-xs text-slate-500 dark:text-slate-400 mt-1.5"
+                        >
+                          Integer for reproducible results. Some engines use 0
+                          or -1 for random.
+                        </p>
+                      </div>
+                      <div id="model-select-container">
+                        <label for="model-type" class="label-base"
+                          >TTS Model</label
+                        >
+                        <select
+                          id="model-type"
+                          name="model_type"
+                          class="select-base w-48"
+                        >
+                          <option value="multilingual" selected>
+                            Multilingual (23 Languages)
+                          </option>
+                          <option value="english-only">English Only</option>
+                        </select>
+                        <p
+                          class="text-xs text-slate-500 dark:text-slate-400 mt-1.5"
+                        >
+                          Multilingual model supports 23 languages. Requires
+                          server restart to apply.
+                        </p>
+                      </div>
+                      <div id="language-select-container">
+                        <label for="language" class="label-base"
+                          >Language</label
+                        >
+                        <select
+                          id="language"
+                          name="language"
+                          class="select-base w-36"
+                        >
+                          <option value="ar">Arabic (ar)</option>
+                          <option value="da">Danish (da)</option>
+                          <option value="de">German (de)</option>
+                          <option value="el">Greek (el)</option>
+                          <option value="en">English (en)</option>
+                          <option value="es">Spanish (es)</option>
+                          <option value="fi">Finnish (fi)</option>
+                          <option value="fr">French (fr)</option>
+                          <option value="he">Hebrew (he)</option>
+                          <option value="hi" selected>Hindi (hi)</option>
+                          <option value="it">Italian (it)</option>
+                          <option value="ja">Japanese (ja)</option>
+                          <option value="ko">Korean (ko)</option>
+                          <option value="ms">Malay (ms)</option>
+                          <option value="nl">Dutch (nl)</option>
+                          <option value="no">Norwegian (no)</option>
+                          <option value="pl">Polish (pl)</option>
+                          <option value="pt">Portuguese (pt)</option>
+                          <option value="ru">Russian (ru)</option>
+                          <option value="sv">Swedish (sv)</option>
+                          <option value="sw">Swahili (sw)</option>
+                          <option value="tr">Turkish (tr)</option>
+                          <option value="zh">Chinese (zh)</option>
+                        </select>
+                        <p
+                          class="text-xs text-slate-500 dark:text-slate-400 mt-1.5"
+                        >
+                          Select the language for speech synthesis. Hindi
+                          selected by default.
+                        </p>
+                      </div>
+                      <div>
+                        <label for="output-format" class="label-base"
+                          >Output Format</label
+                        >
+                        <select
+                          id="output-format"
+                          name="output_format"
+                          class="select-base w-36"
+                        >
+                          <option value="wav">WAV</option>
+                          <option value="mp3" selected>MP3</option>
+                          <option value="opus">Opus</option>
+                        </select>
+                        <p
+                          class="text-xs text-slate-500 dark:text-slate-400 mt-1.5"
+                        >
+                          MP3 is recommended for smaller file sizes (e.g.,
+                          audiobooks).
+                        </p>
+                      </div>
+                      <div
+                        class="col-span-1 md:col-span-2 mt-4 flex items-center gap-4"
+                      >
+                        <button
+                          id="save-gen-defaults-btn"
+                          type="button"
+                          class="btn-settings"
+                        >
+                          Save Generation Parameters
+                        </button>
+                        <span
+                          id="gen-defaults-status"
+                          class="text-xs hidden"
+                        ></span>
+                      </div>
+                    </div>
+                  </details>
                 </div>
 
-                <div id="audio-player-container" class="mt-8"></div>
+                <div class="mb-6">
+                  <details class="group" open>
+                    <!-- Server Config expanded by default -->
+                    <summary
+                      class="list-none flex cursor-pointer items-center group"
+                    >
+                      <span
+                        class="text-sm font-medium label-base group-hover:text-indigo-600 dark:group-hover:text-indigo-400 transition-colors"
+                        >Server Configuration</span
+                      >
+                      <span
+                        class="ml-1.5 text-slate-400 group-hover:text-indigo-500 dark:group-hover:text-indigo-400 transition-colors"
+                      >
+                        <svg
+                          class="group-open:rotate-180 h-5 w-5 transition-transform duration-200 ease-in-out"
+                          viewBox="0 0 20 20"
+                          fill="currentColor"
+                        >
+                          <path
+                            fill-rule="evenodd"
+                            d="M5.293 7.293a1 1 0 011.414 0L10 10.586l3.293-3.293a1 1 0 111.414 1.414l-4 4a1 1 0 01-1.414 0l-4-4a1 1 0 010-1.414z"
+                            clip-rule="evenodd"
+                          />
+                        </svg>
+                      </span>
+                    </summary>
+                    <div
+                      id="server-config-form"
+                      class="mt-4 border-t border-slate-200 dark:border-slate-700 pt-5"
+                    >
+                      <p
+                        class="text-xs text-slate-500 dark:text-slate-400 mb-4"
+                      >
+                        These settings are loaded from
+                        <code class="code-inline">config.yaml</code>
+                        via an API call.
+                        <strong class="font-medium">Restart the server</strong>
+                        to apply changes to Host, Port, Model, or Path settings
+                        if modified here or directly in the file.
+                      </p>
+                      <div
+                        class="grid grid-cols-1 md:grid-cols-2 gap-x-6 gap-y-4"
+                      >
+                        <div>
+                          <label
+                            for="config-server-host"
+                            class="label-base text-xs"
+                            >Server Host</label
+                          ><input
+                            type="text"
+                            id="config-server-host"
+                            name="server.host"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-server-port"
+                            class="label-base text-xs"
+                            >Server Port</label
+                          ><input
+                            type="number"
+                            id="config-server-port"
+                            name="server.port"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-tts-engine-device"
+                            class="label-base text-xs"
+                            >TTS Device</label
+                          ><input
+                            type="text"
+                            id="config-tts-engine-device"
+                            name="tts_engine.device"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-tts-engine-default_voice_id"
+                            class="label-base text-xs"
+                            >Default Voice ID</label
+                          ><input
+                            type="text"
+                            id="config-tts-engine-default_voice_id"
+                            name="tts_engine.default_voice_id"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-paths-model_cache"
+                            class="label-base text-xs"
+                            >Model Cache Path</label
+                          ><input
+                            type="text"
+                            id="config-paths-model_cache"
+                            name="paths.model_cache"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-tts-engine-predefined_voices_path"
+                            class="label-base text-xs"
+                            >Predefined Voices Path</label
+                          ><input
+                            type="text"
+                            id="config-tts-engine-predefined_voices_path"
+                            name="tts_engine.predefined_voices_path"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-tts-engine-reference_audio_path"
+                            class="label-base text-xs"
+                            >Reference Audio Path</label
+                          ><input
+                            type="text"
+                            id="config-tts-engine-reference_audio_path"
+                            name="tts_engine.reference_audio_path"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-paths-output"
+                            class="label-base text-xs"
+                            >Output Path</label
+                          ><input
+                            type="text"
+                            id="config-paths-output"
+                            name="paths.output"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-audio_output-format"
+                            class="label-base text-xs"
+                            >Audio Output Format</label
+                          ><input
+                            type="text"
+                            id="config-audio_output-format"
+                            name="audio_output.format"
+                            class="input-base text-sm"
+                          />
+                        </div>
+                        <div>
+                          <label
+                            for="config-audio_output-sample_rate"
+                            class="label-base text-xs"
+                            >Audio Sample Rate</label
+                          ><input
+                            type="number"
+                            id="config-audio_output-sample_rate"
+                            name="audio_output.sample_rate"
+                            class="input-base text-sm"
+                          />
+                        </div>
 
-                <div class="mt-10">
-                    <h2 class="card-header mb-4 text-base font-semibold">Tips & Tricks</h2>
-                    <div class="card-base">
-                        <div class="p-6 sm:p-8">
-                            <ul class="list-disc space-y-2.5 pl-5 text-sm text-slate-700 dark:text-slate-300">
-                                <li>For **Audiobooks**, use **MP3** format, enable **Split text**, and set a chunk size
-                                    of ~250-500.</li>
-                                <li>Use **Predefined Voices** for consistent, high-quality output. You can import new
-                                    ones.</li>
-                                <li>For **Voice Cloning**, upload clean reference audio (<code
-                                        class="code-inline">.wav</code>/<code class="code-inline">.mp3</code>). Quality
-                                    of reference is key.</li>
-                                <li>Experiment with **Temperature** and other generation parameters to fine-tune output.
-                                </li>
-                                <li>Adjusting **Speed Factor** away from 1.0 is experimental and may cause echo.</li>
-                                <li>Check the <code class="code-inline">/docs</code> endpoint for API details.</li>
-                            </ul>
+                        <div
+                          class="col-span-1 md:col-span-2 mt-4 flex flex-col md:flex-row gap-4 items-center"
+                        >
+                          <button
+                            id="save-config-btn"
+                            type="button"
+                            class="btn-settings"
+                          >
+                            Save Server Configuration
+                          </button>
+                          <button
+                            id="restart-server-btn"
+                            type="button"
+                            class="btn-danger w-full md:w-auto hidden"
+                          >
+                            <svg
+                              xmlns="http://www.w3.org/2000/svg"
+                              fill="none"
+                              viewBox="0 0 24 24"
+                              stroke-width="1.5"
+                              stroke="currentColor"
+                              class="w-5 h-5 mr-1.5 inline-block"
+                            >
+                              <path
+                                stroke-linecap="round"
+                                stroke-linejoin="round"
+                                d="M16.023 9.348h4.992v-.001M2.985 19.644v-4.992m0 0h4.992m-4.993 0 3.181 3.183a8.25 8.25 0 0 0 13.803-3.7M4.031 9.865a8.25 8.25 0 0 1 13.803-3.7l3.181 3.182m0-4.991v4.99"
+                              />
+                            </svg>
+                            Restart Server
+                          </button>
+                          <span
+                            id="config-status"
+                            class="text-xs ml-2 hidden"
+                          ></span>
                         </div>
+                      </div>
                     </div>
+                  </details>
                 </div>
-            </div>
-        </main>
+              </div>
 
-        <footer class="bg-slate-100 dark:bg-slate-800 py-8 mt-16 border-t border-slate-200 dark:border-slate-700/80">
-            <div class="mx-auto max-w-7xl px-4 sm:px-6 lg:px-8">
-                <div class="flex justify-center">
-                    <a href="https://github.com/devnen/Chatterbox-TTS-Server" target="_blank" rel="noopener noreferrer"
-                        class="flex items-center gap-2 text-slate-500 dark:text-slate-400 text-sm hover:text-indigo-600 dark:hover:text-indigo-400 transition-colors">
-                        <svg xmlns="http://www.w3.org/2000/svg" width="18" height="18" fill="currentColor"
-                            viewBox="0 0 16 16" class="flex-shrink-0">
-                            <path
-                                d="M8 0C3.58 0 0 3.58 0 8c0 3.54 2.29 6.53 5.47 7.59.4.07.55-.17.55-.38 0-.19-.01-.82-.01-1.49-2.01.37-2.53-.49-2.69-.94-.09-.23-.48-.94-.82-1.13-.28-.15-.68-.52-.01-.53.63-.01 1.08.58 1.23.82.72 1.21 1.87.87 2.33.66.07-.52.28-.87.51-1.07-1.78-.2-3.64-.89-3.64-3.95 0-.87.31-1.59.82-2.15-.08-.2-.36-1.02.08-2.12 0 0 .67-.21 2.2.82.64-.18 1.32-.27 2-.27.68 0 1.36.09 2 .27 1.53-1.04 2.2-.82 2.2-.82.44 1.1.16 1.92.08 2.12.51.56.82 1.27.82 2.15 0 3.07-1.87 3.75-3.65 3.95.29.25.54.73.54 1.48 0 1.07-.01 1.93-.01 2.2 0 .21.15.46.55.38A8.012 8.012 0 0 0 16 8c0-4.42-3.58-8-8-8z" />
-                        </svg>
-                        <span>View Project on GitHub | Powered by FastAPI</span>
-                    </a>
+              <div class="card-footer">
+                <button
+                  id="reset-settings-btn"
+                  type="button"
+                  class="btn-settings"
+                >
+                  Reset All Settings
+                </button>
+                <div class="text-xs text-slate-500 dark:text-slate-400 ml-2">
+                  Resets settings to their defaults.
                 </div>
+              </div>
+            </form>
+          </div>
+
+          <div id="audio-player-container" class="mt-8"></div>
+
+          <div class="mt-10">
+            <h2 class="card-header mb-4 text-base font-semibold">
+              Tips & Tricks
+            </h2>
+            <div class="card-base">
+              <div class="p-6 sm:p-8">
+                <ul
+                  class="list-disc space-y-2.5 pl-5 text-sm text-slate-700 dark:text-slate-300"
+                >
+                  <li>
+                    For **Audiobooks**, use **MP3** format, enable **Split
+                    text**, and set a chunk size of ~250-500.
+                  </li>
+                  <li>
+                    Use **Predefined Voices** for consistent, high-quality
+                    output. You can import new ones.
+                  </li>
+                  <li>
+                    For **Voice Cloning**, upload clean reference audio (<code
+                      class="code-inline"
+                      >.wav</code
+                    >/<code class="code-inline">.mp3</code>). Quality of
+                    reference is key.
+                  </li>
+                  <li>
+                    Experiment with **Temperature** and other generation
+                    parameters to fine-tune output.
+                  </li>
+                  <li>
+                    Adjusting **Speed Factor** away from 1.0 is experimental and
+                    may cause echo.
+                  </li>
+                  <li>
+                    Check the <code class="code-inline">/docs</code> endpoint
+                    for API details.
+                  </li>
+                </ul>
+              </div>
             </div>
-        </footer>
-    </div>
+          </div>
+        </div>
+      </main>
 
-    <div id="loading-overlay" class="loading-overlay-base hidden opacity-0" data-state="closed" style="display: none;">
-        <div class="loading-box-base">
-            <svg class="loading-spinner animate-spin" xmlns="http://www.w3.org/2000/svg" fill="none"
-                viewBox="0 0 24 24">
-                <circle class="opacity-25" cx="12" cy="12" r="10" stroke="currentColor" stroke-width="4"></circle>
-                <path class="opacity-75" fill="currentColor"
-                    d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4zm2 5.291A7.962 7.962 0 014 12H0c0 3.042 1.135 5.824 3 7.938l3-2.647z">
-                </path>
-            </svg>
-            <p id="loading-message" class="loading-text">Generating audio...</p>
-            <p id="loading-status" class="loading-status">Please wait.</p>
-            <button id="loading-cancel-btn" type="button" class="btn-secondary mt-1"
-                title="Cancel UI request. Backend may continue briefly.">Cancel</button>
+      <footer
+        class="bg-slate-100 dark:bg-slate-800 py-8 mt-16 border-t border-slate-200 dark:border-slate-700/80"
+      >
+        <div class="mx-auto max-w-7xl px-4 sm:px-6 lg:px-8">
+          <div class="flex justify-center">
+            <a
+              href="https://github.com/devnen/Chatterbox-TTS-Server"
+              target="_blank"
+              rel="noopener noreferrer"
+              class="flex items-center gap-2 text-slate-500 dark:text-slate-400 text-sm hover:text-indigo-600 dark:hover:text-indigo-400 transition-colors"
+            >
+              <svg
+                xmlns="http://www.w3.org/2000/svg"
+                width="18"
+                height="18"
+                fill="currentColor"
+                viewBox="0 0 16 16"
+                class="flex-shrink-0"
+              >
+                <path
+                  d="M8 0C3.58 0 0 3.58 0 8c0 3.54 2.29 6.53 5.47 7.59.4.07.55-.17.55-.38 0-.19-.01-.82-.01-1.49-2.01.37-2.53-.49-2.69-.94-.09-.23-.48-.94-.82-1.13-.28-.15-.68-.52-.01-.53.63-.01 1.08.58 1.23.82.72 1.21 1.87.87 2.33.66.07-.52.28-.87.51-1.07-1.78-.2-3.64-.89-3.64-3.95 0-.87.31-1.59.82-2.15-.08-.2-.36-1.02.08-2.12 0 0 .67-.21 2.2.82.64-.18 1.32-.27 2-.27.68 0 1.36.09 2 .27 1.53-1.04 2.2-.82 2.2-.82.44 1.1.16 1.92.08 2.12.51.56.82 1.27.82 2.15 0 3.07-1.87 3.75-3.65 3.95.29.25.54.73.54 1.48 0 1.07-.01 1.93-.01 2.2 0 .21.15.46.55.38A8.012 8.012 0 0 0 16 8c0-4.42-3.58-8-8-8z"
+                />
+              </svg>
+              <span>View Project on GitHub | Powered by FastAPI</span>
+            </a>
+          </div>
         </div>
+      </footer>
     </div>
 
-    <div id="chunk-warning-modal" class="modal-overlay hidden opacity-0" data-state="closed" style="display: none;">
-        <div class="modal-box">
-            <h3 class="modal-title">Chunking Voice Consistency Warning</h3>
-            <div class="modal-content">
-                <p>You are generating with text splitting enabled and not using Predefined Voices or Voice Cloning with
-                    a fixed seed.</p>
-                <p class="mt-2">This combination may result in <strong class="font-semibold">different voices for
-                        different text chunks</strong>, leading to inconsistent audio.</p>
-                <p class="mt-2">For consistent voices across chunks, please use:</p>
-                <ul class="list-disc pl-5 mt-1 space-y-1">
-                    <li>"Predefined Voices" mode.</li>
-                    <li>"Voice Cloning" mode.</li>
-                    <li>A specific integer in "Generation Seed" (not random).</li>
-                </ul>
-            </div>
-            <div class="mt-4 mb-5 flex items-center">
-                <input type="checkbox" id="hide-chunk-warning-checkbox"
-                    class="h-4 w-4 rounded border-slate-300 text-indigo-600 focus:ring-indigo-500 dark:border-slate-600 dark:bg-slate-700 dark:focus:ring-offset-slate-800 dark:checked:bg-indigo-500 dark:checked:border-indigo-500">
-                <label for="hide-chunk-warning-checkbox"
-                    class="ml-2 block text-sm text-slate-700 dark:text-slate-300 cursor-pointer">Do not show this
-                    warning again</label>
-            </div>
-            <div class="modal-actions">
-                <button id="chunk-warning-cancel" type="button" class="btn-secondary">Cancel Generation</button>
-                <button id="chunk-warning-ok" type="button" class="btn-primary">Proceed Anyway</button>
-            </div>
+    <div
+      id="loading-overlay"
+      class="loading-overlay-base hidden opacity-0"
+      data-state="closed"
+      style="display: none"
+    >
+      <div class="loading-box-base">
+        <svg
+          class="loading-spinner animate-spin"
+          xmlns="http://www.w3.org/2000/svg"
+          fill="none"
+          viewBox="0 0 24 24"
+        >
+          <circle
+            class="opacity-25"
+            cx="12"
+            cy="12"
+            r="10"
+            stroke="currentColor"
+            stroke-width="4"
+          ></circle>
+          <path
+            class="opacity-75"
+            fill="currentColor"
+            d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4zm2 5.291A7.962 7.962 0 014 12H0c0 3.042 1.135 5.824 3 7.938l3-2.647z"
+          ></path>
+        </svg>
+        <p id="loading-message" class="loading-text">Generating audio...</p>
+        <p id="loading-status" class="loading-status">Please wait.</p>
+        <button
+          id="loading-cancel-btn"
+          type="button"
+          class="btn-secondary mt-1"
+          title="Cancel UI request. Backend may continue briefly."
+        >
+          Cancel
+        </button>
+      </div>
+    </div>
+
+    <div
+      id="chunk-warning-modal"
+      class="modal-overlay hidden opacity-0"
+      data-state="closed"
+      style="display: none"
+    >
+      <div class="modal-box">
+        <h3 class="modal-title">Chunking Voice Consistency Warning</h3>
+        <div class="modal-content">
+          <p>
+            You are generating with text splitting enabled and not using
+            Predefined Voices or Voice Cloning with a fixed seed.
+          </p>
+          <p class="mt-2">
+            This combination may result in
+            <strong class="font-semibold"
+              >different voices for different text chunks</strong
+            >, leading to inconsistent audio.
+          </p>
+          <p class="mt-2">For consistent voices across chunks, please use:</p>
+          <ul class="list-disc pl-5 mt-1 space-y-1">
+            <li>"Predefined Voices" mode.</li>
+            <li>"Voice Cloning" mode.</li>
+            <li>A specific integer in "Generation Seed" (not random).</li>
+          </ul>
+        </div>
+        <div class="mt-4 mb-5 flex items-center">
+          <input
+            type="checkbox"
+            id="hide-chunk-warning-checkbox"
+            class="h-4 w-4 rounded border-slate-300 text-indigo-600 focus:ring-indigo-500 dark:border-slate-600 dark:bg-slate-700 dark:focus:ring-offset-slate-800 dark:checked:bg-indigo-500 dark:checked:border-indigo-500"
+          />
+          <label
+            for="hide-chunk-warning-checkbox"
+            class="ml-2 block text-sm text-slate-700 dark:text-slate-300 cursor-pointer"
+            >Do not show this warning again</label
+          >
+        </div>
+        <div class="modal-actions">
+          <button id="chunk-warning-cancel" type="button" class="btn-secondary">
+            Cancel Generation
+          </button>
+          <button id="chunk-warning-ok" type="button" class="btn-primary">
+            Proceed Anyway
+          </button>
         </div>
+      </div>
     </div>
 
-    <div id="generation-warning-modal" class="modal-overlay hidden opacity-0" data-state="closed"
-        style="display: none;">
-        <div class="modal-box">
-            <h3 class="modal-title">Generation Quality Notice</h3>
-            <div class="modal-content">
-                <p>Please be aware that text-to-speech models may sometimes produce unexpected results or artifacts.</p>
-                <p class="mt-2">This can include variations in voice consistency, delivery, or minor audio
-                    imperfections. Experiment with parameters for best results.</p>
-            </div>
-            <div class="mt-4 mb-5 flex items-center">
-                <input type="checkbox" id="hide-generation-warning-checkbox"
-                    class="h-4 w-4 rounded border-slate-300 text-indigo-600 focus:ring-indigo-500 dark:border-slate-600 dark:bg-slate-700 dark:focus:ring-offset-slate-800 dark:checked:bg-indigo-500 dark:checked:border-indigo-500">
-                <label for="hide-generation-warning-checkbox"
-                    class="ml-2 block text-sm text-slate-700 dark:text-slate-300 cursor-pointer">Do not show this
-                    warning again</label>
-            </div>
-            <div class="modal-actions">
-                <button id="generation-warning-acknowledge" type="button" class="btn-primary">Acknowledge &
-                    Generate</button>
-            </div>
+    <div
+      id="generation-warning-modal"
+      class="modal-overlay hidden opacity-0"
+      data-state="closed"
+      style="display: none"
+    >
+      <div class="modal-box">
+        <h3 class="modal-title">Generation Quality Notice</h3>
+        <div class="modal-content">
+          <p>
+            Please be aware that text-to-speech models may sometimes produce
+            unexpected results or artifacts.
+          </p>
+          <p class="mt-2">
+            This can include variations in voice consistency, delivery, or minor
+            audio imperfections. Experiment with parameters for best results.
+          </p>
         </div>
+        <div class="mt-4 mb-5 flex items-center">
+          <input
+            type="checkbox"
+            id="hide-generation-warning-checkbox"
+            class="h-4 w-4 rounded border-slate-300 text-indigo-600 focus:ring-indigo-500 dark:border-slate-600 dark:bg-slate-700 dark:focus:ring-offset-slate-800 dark:checked:bg-indigo-500 dark:checked:border-indigo-500"
+          />
+          <label
+            for="hide-generation-warning-checkbox"
+            class="ml-2 block text-sm text-slate-700 dark:text-slate-300 cursor-pointer"
+            >Do not show this warning again</label
+          >
+        </div>
+        <div class="modal-actions">
+          <button
+            id="generation-warning-acknowledge"
+            type="button"
+            class="btn-primary"
+          >
+            Acknowledge & Generate
+          </button>
+        </div>
+      </div>
     </div>
 
     <script src="script.js" defer></script>
-</body>
-
-</html>
\ No newline at end of file
+  </body>
+</html>
diff --git a/ui/script.js b/ui/script.js
index 726d6af..da38c76 100644
--- a/ui/script.js
+++ b/ui/script.js
@@ -2,514 +2,732 @@
 // Client-side JavaScript for the Chatterbox TTS Server web interface.
 // Handles UI interactions, API communication, audio playback, and settings management.
 
-document.addEventListener('DOMContentLoaded', async function () {
-    // --- Global Flags & State ---
-    let uiReady = false;
-    let listenersAttached = false;
-    let isGenerating = false;
-    let wavesurfer = null;
-    let currentAudioBlobUrl = null;
-    let saveStateTimeout = null;
-
-    let currentConfig = {};
-    let currentUiState = {};
-    let appPresets = [];
-    let initialReferenceFiles = [];
-    let initialPredefinedVoices = [];
-
-    let hideChunkWarning = false;
-    let hideGenerationWarning = false;
-    let currentVoiceMode = 'predefined';
-
-    const IS_LOCAL_FILE = window.location.protocol === 'file:';
-    // If you always access the server via localhost
-    const API_BASE_URL = IS_LOCAL_FILE ? 'http://localhost:8004' : '';
-
-    const DEBOUNCE_DELAY_MS = 750;
-
-
-    // --- DOM Element Selectors ---
-    const appTitleLink = document.getElementById('app-title-link');
-    const themeToggleButton = document.getElementById('theme-toggle-btn');
-    const themeSwitchThumb = themeToggleButton ? themeToggleButton.querySelector('.theme-switch-thumb') : null;
-    const notificationArea = document.getElementById('notification-area');
-    const ttsForm = document.getElementById('tts-form');
-    const ttsFormHeader = document.getElementById('tts-form-header');
-    const textArea = document.getElementById('text');
-    const charCount = document.getElementById('char-count');
-    const generateBtn = document.getElementById('generate-btn');
-    const splitTextToggle = document.getElementById('split-text-toggle');
-    const chunkSizeControls = document.getElementById('chunk-size-controls');
-    const chunkSizeSlider = document.getElementById('chunk-size-slider');
-    const chunkSizeValue = document.getElementById('chunk-size-value');
-    const chunkExplanation = document.getElementById('chunk-explanation');
-    const voiceModeRadios = document.querySelectorAll('input[name="voice_mode"]');
-    const predefinedVoiceOptionsDiv = document.getElementById('predefined-voice-options');
-    const predefinedVoiceSelect = document.getElementById('predefined-voice-select');
-    const predefinedVoiceImportButton = document.getElementById('predefined-voice-import-button');
-    const predefinedVoiceRefreshButton = document.getElementById('predefined-voice-refresh-button');
-    const predefinedVoiceFileInput = document.getElementById('predefined-voice-file-input');
-    const cloneOptionsDiv = document.getElementById('clone-options');
-    const cloneReferenceSelect = document.getElementById('clone-reference-select');
-    const cloneImportButton = document.getElementById('clone-import-button');
-    const cloneRefreshButton = document.getElementById('clone-refresh-button');
-    const cloneFileInput = document.getElementById('clone-file-input');
-    const presetsContainer = document.getElementById('presets-container');
-    const presetsPlaceholder = document.getElementById('presets-placeholder');
-    const temperatureSlider = document.getElementById('temperature');
-    const temperatureValueDisplay = document.getElementById('temperature-value');
-    const exaggerationSlider = document.getElementById('exaggeration');
-    const exaggerationValueDisplay = document.getElementById('exaggeration-value');
-    const cfgWeightSlider = document.getElementById('cfg-weight');
-    const cfgWeightValueDisplay = document.getElementById('cfg-weight-value');
-    const speedFactorSlider = document.getElementById('speed-factor');
-    const speedFactorValueDisplay = document.getElementById('speed-factor-value');
-    const speedFactorWarningSpan = document.getElementById('speed-factor-warning');
-    const seedInput = document.getElementById('seed');
-    const languageSelectContainer = document.getElementById('language-select-container');
-    const languageSelect = document.getElementById('language');
-    const outputFormatSelect = document.getElementById('output-format');
-    const saveGenDefaultsBtn = document.getElementById('save-gen-defaults-btn');
-    const genDefaultsStatus = document.getElementById('gen-defaults-status');
-    const serverConfigForm = document.getElementById('server-config-form');
-    const saveConfigBtn = document.getElementById('save-config-btn');
-    const restartServerBtn = document.getElementById('restart-server-btn');
-    const configStatus = document.getElementById('config-status');
-    const resetSettingsBtn = document.getElementById('reset-settings-btn');
-    const audioPlayerContainer = document.getElementById('audio-player-container');
-    const loadingOverlay = document.getElementById('loading-overlay');
-    const loadingMessage = document.getElementById('loading-message');
-    const loadingStatusText = document.getElementById('loading-status');
-    const loadingCancelBtn = document.getElementById('loading-cancel-btn');
-    const chunkWarningModal = document.getElementById('chunk-warning-modal');
-    const chunkWarningOkBtn = document.getElementById('chunk-warning-ok');
-    const chunkWarningCancelBtn = document.getElementById('chunk-warning-cancel');
-    const hideChunkWarningCheckbox = document.getElementById('hide-chunk-warning-checkbox');
-    const generationWarningModal = document.getElementById('generation-warning-modal');
-    const generationWarningAcknowledgeBtn = document.getElementById('generation-warning-acknowledge');
-    const hideGenerationWarningCheckbox = document.getElementById('hide-generation-warning-checkbox');
-
-
-    // Handle voice mode selection visual feedback
-    const voiceModeOptions = document.querySelectorAll('.voice-mode-option');
-
-    voiceModeRadios.forEach(radio => {
-        radio.addEventListener('change', function () {
-            // Remove selected class from all options
-            voiceModeOptions.forEach(option => {
-                option.classList.remove('selected');
-            });
-
-            // Add selected class to the parent of the checked radio
-            const selectedOption = this.closest('.voice-mode-option');
-            if (selectedOption) {
-                selectedOption.classList.add('selected');
-            }
-        });
+document.addEventListener("DOMContentLoaded", async function () {
+  // --- Global Flags & State ---
+  let uiReady = false;
+  let listenersAttached = false;
+  let isGenerating = false;
+  let wavesurfer = null;
+  let currentAudioBlobUrl = null;
+  let saveStateTimeout = null;
+
+  let currentConfig = {};
+  let currentUiState = {};
+  let appPresets = [];
+  let initialReferenceFiles = [];
+  let initialPredefinedVoices = [];
+
+  let hideChunkWarning = false;
+  let hideGenerationWarning = false;
+  let currentVoiceMode = "predefined";
+
+  const IS_LOCAL_FILE = window.location.protocol === "file:";
+  // If you always access the server via localhost
+  const API_BASE_URL = IS_LOCAL_FILE ? "http://localhost:8004" : "";
+
+  const DEBOUNCE_DELAY_MS = 750;
+
+  // --- DOM Element Selectors ---
+  const appTitleLink = document.getElementById("app-title-link");
+  const themeToggleButton = document.getElementById("theme-toggle-btn");
+  const themeSwitchThumb = themeToggleButton
+    ? themeToggleButton.querySelector(".theme-switch-thumb")
+    : null;
+  const notificationArea = document.getElementById("notification-area");
+  const ttsForm = document.getElementById("tts-form");
+  const ttsFormHeader = document.getElementById("tts-form-header");
+  const textArea = document.getElementById("text");
+  const charCount = document.getElementById("char-count");
+  const generateBtn = document.getElementById("generate-btn");
+  const splitTextToggle = document.getElementById("split-text-toggle");
+  const chunkSizeControls = document.getElementById("chunk-size-controls");
+  const chunkSizeSlider = document.getElementById("chunk-size-slider");
+  const chunkSizeValue = document.getElementById("chunk-size-value");
+  const chunkExplanation = document.getElementById("chunk-explanation");
+  const voiceModeRadios = document.querySelectorAll('input[name="voice_mode"]');
+  const predefinedVoiceOptionsDiv = document.getElementById(
+    "predefined-voice-options"
+  );
+  const predefinedVoiceSelect = document.getElementById(
+    "predefined-voice-select"
+  );
+  const predefinedVoiceImportButton = document.getElementById(
+    "predefined-voice-import-button"
+  );
+  const predefinedVoiceRefreshButton = document.getElementById(
+    "predefined-voice-refresh-button"
+  );
+  const predefinedVoiceFileInput = document.getElementById(
+    "predefined-voice-file-input"
+  );
+  const cloneOptionsDiv = document.getElementById("clone-options");
+  const cloneReferenceSelect = document.getElementById(
+    "clone-reference-select"
+  );
+  const cloneImportButton = document.getElementById("clone-import-button");
+  const cloneRefreshButton = document.getElementById("clone-refresh-button");
+  const cloneFileInput = document.getElementById("clone-file-input");
+  const presetsContainer = document.getElementById("presets-container");
+  const presetsPlaceholder = document.getElementById("presets-placeholder");
+  const temperatureSlider = document.getElementById("temperature");
+  const temperatureValueDisplay = document.getElementById("temperature-value");
+  const exaggerationSlider = document.getElementById("exaggeration");
+  const exaggerationValueDisplay =
+    document.getElementById("exaggeration-value");
+  const cfgWeightSlider = document.getElementById("cfg-weight");
+  const cfgWeightValueDisplay = document.getElementById("cfg-weight-value");
+  const speedFactorSlider = document.getElementById("speed-factor");
+  const speedFactorValueDisplay = document.getElementById("speed-factor-value");
+  const speedFactorWarningSpan = document.getElementById(
+    "speed-factor-warning"
+  );
+  const seedInput = document.getElementById("seed");
+  const modelSelectContainer = document.getElementById(
+    "model-select-container"
+  );
+  const modelTypeSelect = document.getElementById("model-type");
+  const languageSelectContainer = document.getElementById(
+    "language-select-container"
+  );
+  const languageSelect = document.getElementById("language");
+  const outputFormatSelect = document.getElementById("output-format");
+  const saveGenDefaultsBtn = document.getElementById("save-gen-defaults-btn");
+  const genDefaultsStatus = document.getElementById("gen-defaults-status");
+  const serverConfigForm = document.getElementById("server-config-form");
+  const saveConfigBtn = document.getElementById("save-config-btn");
+  const restartServerBtn = document.getElementById("restart-server-btn");
+  const configStatus = document.getElementById("config-status");
+  const resetSettingsBtn = document.getElementById("reset-settings-btn");
+  const audioPlayerContainer = document.getElementById(
+    "audio-player-container"
+  );
+  const loadingOverlay = document.getElementById("loading-overlay");
+  const loadingMessage = document.getElementById("loading-message");
+  const loadingStatusText = document.getElementById("loading-status");
+  const loadingCancelBtn = document.getElementById("loading-cancel-btn");
+  const chunkWarningModal = document.getElementById("chunk-warning-modal");
+  const chunkWarningOkBtn = document.getElementById("chunk-warning-ok");
+  const chunkWarningCancelBtn = document.getElementById("chunk-warning-cancel");
+  const hideChunkWarningCheckbox = document.getElementById(
+    "hide-chunk-warning-checkbox"
+  );
+  const generationWarningModal = document.getElementById(
+    "generation-warning-modal"
+  );
+  const generationWarningAcknowledgeBtn = document.getElementById(
+    "generation-warning-acknowledge"
+  );
+  const hideGenerationWarningCheckbox = document.getElementById(
+    "hide-generation-warning-checkbox"
+  );
+
+  // Handle voice mode selection visual feedback
+  const voiceModeOptions = document.querySelectorAll(".voice-mode-option");
+
+  voiceModeRadios.forEach((radio) => {
+    radio.addEventListener("change", function () {
+      // Remove selected class from all options
+      voiceModeOptions.forEach((option) => {
+        option.classList.remove("selected");
+      });
+
+      // Add selected class to the parent of the checked radio
+      const selectedOption = this.closest(".voice-mode-option");
+      if (selectedOption) {
+        selectedOption.classList.add("selected");
+      }
     });
-
-    // Set initial state
-    const checkedRadio = document.querySelector('input[name="voice_mode"]:checked');
-    if (checkedRadio) {
-        const selectedOption = checkedRadio.closest('.voice-mode-option');
-        if (selectedOption) {
-            selectedOption.classList.add('selected');
-        }
+  });
+
+  // Set initial state
+  const checkedRadio = document.querySelector(
+    'input[name="voice_mode"]:checked'
+  );
+  if (checkedRadio) {
+    const selectedOption = checkedRadio.closest(".voice-mode-option");
+    if (selectedOption) {
+      selectedOption.classList.add("selected");
     }
-
-    // --- Utility Functions ---
-    function showNotification(message, type = 'info', duration = 5000) {
-        if (!notificationArea) return null;
-        const icons = {
-            success: '<svg class="notification-icon" viewBox="0 0 20 20" fill="currentColor"><path fill-rule="evenodd" d="M10 18a8 8 0 100-16 8 8 0 000 16zm3.707-9.293a1 1 0 00-1.414-1.414L9 10.586 7.707 9.293a1 1 0 00-1.414 1.414l2 2a1 1 0 001.414 0l4-4z" clip-rule="evenodd" /></svg>',
-            error: '<svg class="notification-icon" viewBox="0 0 20 20" fill="currentColor"><path fill-rule="evenodd" d="M10 18a8 8 0 100-16 8 8 0 000 16zM8.707 7.293a1 1 0 00-1.414 1.414L8.586 10l-1.293 1.293a1 1 0 101.414 1.414L10 11.414l1.293 1.293a1 1 0 001.414-1.414L11.414 10l1.293-1.293a1 1 0 00-1.414-1.414L10 8.586 8.707 7.293z" clip-rule="evenodd" /></svg>',
-            warning: '<svg class="notification-icon" viewBox="0 0 20 20" fill="currentColor"><path fill-rule="evenodd" d="M8.485 2.495c.673-1.167 2.357-1.167 3.03 0l6.28 10.875c.673 1.167-.17 2.625-1.516 2.625H3.72c-1.347 0-2.189-1.458-1.515-2.625L8.485 2.495zM10 5a.75.75 0 01.75.75v3.5a.75.75 0 01-1.5 0v-3.5A.75.75 0 0110 5zm0 9a1 1 0 100-2 1 1 0 000 2z" clip-rule="evenodd" /></svg>',
-            info: '<svg class="notification-icon" viewBox="0 0 20 20" fill="currentColor"><path fill-rule="evenodd" d="M18 10a8 8 0 11-16 0 8 8 0 0116 0zm-7-4a1 1 0 11-2 0 1 1 0 012 0zM9 9a.75.75 0 000 1.5h.253a.25.25 0 01.244.304l-.459 2.066A1.75 1.75 0 0010.747 15H11a.75.75 0 000-1.5h-.253a.25.25 0 01-.244-.304l.459-2.066A1.75 1.75 0 009.253 9H9z" clip-rule="evenodd" /></svg>'
-        };
-        const typeClassMap = { success: 'notification-success', error: 'notification-error', warning: 'notification-warning', info: 'notification-info' };
-        const notificationDiv = document.createElement('div');
-        notificationDiv.className = `notification-base ${typeClassMap[type] || 'notification-info'}`;
-        notificationDiv.setAttribute('role', 'alert');
-        // Create content wrapper
-        const contentWrapper = document.createElement('div');
-        contentWrapper.className = 'flex items-start flex-grow';
-        contentWrapper.innerHTML = `${icons[type] || icons['info']} <span class="block sm:inline">${message}</span>`;
-
-        // Create close button
-        const closeButton = document.createElement('button');
-        closeButton.type = 'button';
-        closeButton.className = 'ml-auto -mx-1.5 -my-1.5 bg-transparent rounded-lg p-1.5 inline-flex h-8 w-8 items-center justify-center text-current hover:bg-slate-200 dark:hover:bg-slate-700 focus:outline-none focus:ring-2 focus:ring-slate-400 flex-shrink-0';
-        closeButton.innerHTML = '<span class="sr-only">Close</span><svg class="w-5 h-5" fill="currentColor" viewBox="0 0 20 20"><path fill-rule="evenodd" d="M4.293 4.293a1 1 0 011.414 0L10 8.586l4.293-4.293a1 1 0 111.414 1.414L11.414 10l4.293 4.293a1 1 0 01-1.414 1.414L10 11.414l-4.293 4.293a1 1 0 01-1.414-1.414L8.586 10 4.293 5.707a1 1 0 010-1.414z" clip-rule="evenodd"></path></svg>';
-        closeButton.onclick = () => {
-            notificationDiv.style.transition = 'opacity 0.3s ease, transform 0.3s ease';
-            notificationDiv.style.opacity = '0';
-            notificationDiv.style.transform = 'translateY(-20px)';
-            setTimeout(() => notificationDiv.remove(), 300);
-        };
-
-        // Add both to notification
-        notificationDiv.appendChild(contentWrapper);
-        notificationDiv.appendChild(closeButton);
-        notificationArea.appendChild(notificationDiv);
-        if (duration > 0) setTimeout(() => closeButton.click(), duration);
-        return notificationDiv;
+  }
+
+  // --- Utility Functions ---
+  function showNotification(message, type = "info", duration = 5000) {
+    if (!notificationArea) return null;
+    const icons = {
+      success:
+        '<svg class="notification-icon" viewBox="0 0 20 20" fill="currentColor"><path fill-rule="evenodd" d="M10 18a8 8 0 100-16 8 8 0 000 16zm3.707-9.293a1 1 0 00-1.414-1.414L9 10.586 7.707 9.293a1 1 0 00-1.414 1.414l2 2a1 1 0 001.414 0l4-4z" clip-rule="evenodd" /></svg>',
+      error:
+        '<svg class="notification-icon" viewBox="0 0 20 20" fill="currentColor"><path fill-rule="evenodd" d="M10 18a8 8 0 100-16 8 8 0 000 16zM8.707 7.293a1 1 0 00-1.414 1.414L8.586 10l-1.293 1.293a1 1 0 101.414 1.414L10 11.414l1.293 1.293a1 1 0 001.414-1.414L11.414 10l1.293-1.293a1 1 0 00-1.414-1.414L10 8.586 8.707 7.293z" clip-rule="evenodd" /></svg>',
+      warning:
+        '<svg class="notification-icon" viewBox="0 0 20 20" fill="currentColor"><path fill-rule="evenodd" d="M8.485 2.495c.673-1.167 2.357-1.167 3.03 0l6.28 10.875c.673 1.167-.17 2.625-1.516 2.625H3.72c-1.347 0-2.189-1.458-1.515-2.625L8.485 2.495zM10 5a.75.75 0 01.75.75v3.5a.75.75 0 01-1.5 0v-3.5A.75.75 0 0110 5zm0 9a1 1 0 100-2 1 1 0 000 2z" clip-rule="evenodd" /></svg>',
+      info: '<svg class="notification-icon" viewBox="0 0 20 20" fill="currentColor"><path fill-rule="evenodd" d="M18 10a8 8 0 11-16 0 8 8 0 0116 0zm-7-4a1 1 0 11-2 0 1 1 0 012 0zM9 9a.75.75 0 000 1.5h.253a.25.25 0 01.244.304l-.459 2.066A1.75 1.75 0 0010.747 15H11a.75.75 0 000-1.5h-.253a.25.25 0 01-.244-.304l.459-2.066A1.75 1.75 0 009.253 9H9z" clip-rule="evenodd" /></svg>',
+    };
+    const typeClassMap = {
+      success: "notification-success",
+      error: "notification-error",
+      warning: "notification-warning",
+      info: "notification-info",
+    };
+    const notificationDiv = document.createElement("div");
+    notificationDiv.className = `notification-base ${
+      typeClassMap[type] || "notification-info"
+    }`;
+    notificationDiv.setAttribute("role", "alert");
+    // Create content wrapper
+    const contentWrapper = document.createElement("div");
+    contentWrapper.className = "flex items-start flex-grow";
+    contentWrapper.innerHTML = `${
+      icons[type] || icons["info"]
+    } <span class="block sm:inline">${message}</span>`;
+
+    // Create close button
+    const closeButton = document.createElement("button");
+    closeButton.type = "button";
+    closeButton.className =
+      "ml-auto -mx-1.5 -my-1.5 bg-transparent rounded-lg p-1.5 inline-flex h-8 w-8 items-center justify-center text-current hover:bg-slate-200 dark:hover:bg-slate-700 focus:outline-none focus:ring-2 focus:ring-slate-400 flex-shrink-0";
+    closeButton.innerHTML =
+      '<span class="sr-only">Close</span><svg class="w-5 h-5" fill="currentColor" viewBox="0 0 20 20"><path fill-rule="evenodd" d="M4.293 4.293a1 1 0 011.414 0L10 8.586l4.293-4.293a1 1 0 111.414 1.414L11.414 10l4.293 4.293a1 1 0 01-1.414 1.414L10 11.414l-4.293 4.293a1 1 0 01-1.414-1.414L8.586 10 4.293 5.707a1 1 0 010-1.414z" clip-rule="evenodd"></path></svg>';
+    closeButton.onclick = () => {
+      notificationDiv.style.transition =
+        "opacity 0.3s ease, transform 0.3s ease";
+      notificationDiv.style.opacity = "0";
+      notificationDiv.style.transform = "translateY(-20px)";
+      setTimeout(() => notificationDiv.remove(), 300);
+    };
+
+    // Add both to notification
+    notificationDiv.appendChild(contentWrapper);
+    notificationDiv.appendChild(closeButton);
+    notificationArea.appendChild(notificationDiv);
+    if (duration > 0) setTimeout(() => closeButton.click(), duration);
+    return notificationDiv;
+  }
+
+  function formatTime(seconds) {
+    const minutes = Math.floor(seconds / 60);
+    const secs = Math.floor(seconds % 60)
+      .toString()
+      .padStart(2, "0");
+    return `${minutes}:${secs}`;
+  }
+
+  // --- Theme Management ---
+  function applyTheme(theme) {
+    const isDark = theme === "dark";
+    document.documentElement.classList.toggle("dark", isDark);
+    if (themeSwitchThumb) {
+      themeSwitchThumb.classList.toggle("translate-x-6", isDark);
+      themeSwitchThumb.classList.toggle("bg-indigo-500", isDark);
+      themeSwitchThumb.classList.toggle("bg-white", !isDark);
     }
-
-    function formatTime(seconds) {
-        const minutes = Math.floor(seconds / 60);
-        const secs = Math.floor(seconds % 60).toString().padStart(2, '0');
-        return `${minutes}:${secs}`;
+    if (wavesurfer) {
+      wavesurfer.setOptions({
+        waveColor: isDark ? "#6366f1" : "#a5b4fc",
+        progressColor: isDark ? "#4f46e5" : "#6366f1",
+        cursorColor: isDark ? "#cbd5e1" : "#475569",
+      });
     }
-
-    // --- Theme Management ---
-    function applyTheme(theme) {
-        const isDark = theme === 'dark';
-        document.documentElement.classList.toggle('dark', isDark);
-        if (themeSwitchThumb) {
-            themeSwitchThumb.classList.toggle('translate-x-6', isDark);
-            themeSwitchThumb.classList.toggle('bg-indigo-500', isDark);
-            themeSwitchThumb.classList.toggle('bg-white', !isDark);
-        }
-        if (wavesurfer) {
-            wavesurfer.setOptions({
-                waveColor: isDark ? '#6366f1' : '#a5b4fc',
-                progressColor: isDark ? '#4f46e5' : '#6366f1',
-                cursorColor: isDark ? '#cbd5e1' : '#475569',
-            });
-        }
-        localStorage.setItem('uiTheme', theme);
+    localStorage.setItem("uiTheme", theme);
+  }
+
+  if (themeToggleButton) {
+    themeToggleButton.addEventListener("click", () => {
+      const newTheme = document.documentElement.classList.contains("dark")
+        ? "light"
+        : "dark";
+      applyTheme(newTheme);
+      debouncedSaveState();
+    });
+  }
+
+  // --- UI State Persistence ---
+  async function saveCurrentUiState() {
+    const stateToSave = {
+      last_text: textArea ? textArea.value : "",
+      last_voice_mode: currentVoiceMode,
+      last_predefined_voice: predefinedVoiceSelect
+        ? predefinedVoiceSelect.value
+        : null,
+      last_reference_file: cloneReferenceSelect
+        ? cloneReferenceSelect.value
+        : null,
+      last_seed: seedInput ? parseInt(seedInput.value, 10) || 0 : 0,
+      last_chunk_size: chunkSizeSlider
+        ? parseInt(chunkSizeSlider.value, 10)
+        : 120,
+      last_split_text_enabled: splitTextToggle ? splitTextToggle.checked : true,
+      hide_chunk_warning: hideChunkWarning,
+      hide_generation_warning: hideGenerationWarning,
+      theme: localStorage.getItem("uiTheme") || "dark",
+    };
+    try {
+      const response = await fetch(`${API_BASE_URL}/save_settings`, {
+        method: "POST",
+        headers: { "Content-Type": "application/json" },
+        body: JSON.stringify({ ui_state: stateToSave }),
+      });
+      if (!response.ok) {
+        const errorResult = await response.json();
+        throw new Error(
+          errorResult.detail ||
+            `Failed to save UI state (status ${response.status})`
+        );
+      }
+    } catch (error) {
+      console.error("Error saving UI state via API:", error);
+      showNotification(
+        `Error saving settings: ${error.message}. Some changes may not persist.`,
+        "error",
+        0
+      );
     }
+  }
 
-    if (themeToggleButton) {
-        themeToggleButton.addEventListener('click', () => {
-            const newTheme = document.documentElement.classList.contains('dark') ? 'light' : 'dark';
-            applyTheme(newTheme);
-            debouncedSaveState();
-        });
+  function debouncedSaveState() {
+    // Do not save anything until the entire UI has finished its initial setup.
+    // MODIFICATION: Add a check for listenersAttached.
+    if (!uiReady || !listenersAttached) {
+      return;
+    }
+    clearTimeout(saveStateTimeout);
+    saveStateTimeout = setTimeout(saveCurrentUiState, DEBOUNCE_DELAY_MS);
+  }
+
+  // --- Speed Factor Warning ---
+  function updateSpeedFactorWarning() {
+    if (speedFactorSlider && speedFactorWarningSpan) {
+      const value = parseFloat(speedFactorSlider.value);
+      if (value !== 1.0) {
+        speedFactorWarningSpan.textContent = "* Experimental, may cause echo.";
+        speedFactorWarningSpan.classList.remove("hidden");
+      } else {
+        speedFactorWarningSpan.classList.add("hidden");
+      }
+    }
+  }
+
+  // --- Initial Application Setup ---
+  function initializeApplication() {
+    const preferredTheme =
+      localStorage.getItem("uiTheme") || currentUiState.theme || "dark";
+    applyTheme(preferredTheme);
+    const pageTitle = currentConfig?.ui?.title || "Chatterbox TTS Server";
+    document.title = pageTitle;
+    if (appTitleLink) appTitleLink.textContent = pageTitle;
+    if (ttsFormHeader) ttsFormHeader.textContent = `Generate Speech`;
+
+    // Update model status badge
+    const modelStatusBadge = document.getElementById("model-status-badge");
+    if (modelStatusBadge) {
+      const useMultilingual = currentConfig?.model?.use_multilingual !== false;
+      modelStatusBadge.textContent = useMultilingual
+        ? "🌐 Multilingual (23 Languages)"
+        : "🇬🇧 English Only";
+      modelStatusBadge.title = useMultilingual
+        ? "Currently loaded: Multilingual model supporting 23 languages"
+        : "Currently loaded: English-only model";
     }
 
-    // --- UI State Persistence ---
-    async function saveCurrentUiState() {
-        const stateToSave = {
-            last_text: textArea ? textArea.value : '',
-            last_voice_mode: currentVoiceMode,
-            last_predefined_voice: predefinedVoiceSelect ? predefinedVoiceSelect.value : null,
-            last_reference_file: cloneReferenceSelect ? cloneReferenceSelect.value : null,
-            last_seed: seedInput ? parseInt(seedInput.value, 10) || 0 : 0,
-            last_chunk_size: chunkSizeSlider ? parseInt(chunkSizeSlider.value, 10) : 120,
-            last_split_text_enabled: splitTextToggle ? splitTextToggle.checked : true,
-            hide_chunk_warning: hideChunkWarning,
-            hide_generation_warning: hideGenerationWarning,
-            theme: localStorage.getItem('uiTheme') || 'dark'
+    loadInitialUiState();
+    populatePredefinedVoices();
+    populateReferenceFiles();
+    populatePresets();
+    displayServerConfiguration();
+    if (
+      languageSelectContainer &&
+      currentConfig?.ui?.show_language_select === false
+    ) {
+      languageSelectContainer.classList.add("hidden");
+    }
+    updateSpeedFactorWarning(); // Initial check for speed factor warning
+    // attachStateSavingListeners();
+    const initialGenResult = currentConfig.initial_gen_result;
+    if (initialGenResult && initialGenResult.outputUrl) {
+      initializeWaveSurfer(initialGenResult.outputUrl, initialGenResult);
+    }
+  }
+
+  async function fetchInitialData() {
+    try {
+      const response = await fetch(`${API_BASE_URL}/api/ui/initial-data`);
+      if (!response.ok) {
+        const errorText = await response.text();
+        throw new Error(
+          `Failed to fetch initial UI data: ${response.status} ${response.statusText}. Server response: ${errorText}`
+        );
+      }
+      const data = await response.json();
+      currentConfig = data.config || {};
+      currentUiState = currentConfig.ui_state || {};
+      appPresets = data.presets || [];
+      initialReferenceFiles = data.reference_files || [];
+      initialPredefinedVoices = data.predefined_voices || [];
+      hideChunkWarning = currentUiState.hide_chunk_warning || false;
+      hideGenerationWarning = currentUiState.hide_generation_warning || false;
+      currentVoiceMode = currentUiState.last_voice_mode || "predefined";
+
+      // This now ONLY sets values. It does NOT attach state-saving listeners.
+      initializeApplication();
+    } catch (error) {
+      console.error("Error fetching initial data:", error);
+      showNotification(
+        `Could not load essential application data: ${error.message}. Please try refreshing.`,
+        "error",
+        0
+      );
+      if (Object.keys(currentConfig).length === 0) {
+        currentConfig = {
+          ui: { title: "Chatterbox TTS Server (Error Mode)" },
+          generation_defaults: {},
+          ui_state: {},
         };
-        try {
-            const response = await fetch(`${API_BASE_URL}/save_settings`, {
-                method: 'POST',
-                headers: { 'Content-Type': 'application/json' },
-                body: JSON.stringify({ ui_state: stateToSave })
-            });
-            if (!response.ok) {
-                const errorResult = await response.json();
-                throw new Error(errorResult.detail || `Failed to save UI state (status ${response.status})`);
-            }
-        } catch (error) {
-            console.error("Error saving UI state via API:", error);
-            showNotification(`Error saving settings: ${error.message}. Some changes may not persist.`, 'error', 0);
-        }
+        currentUiState = currentConfig.ui_state;
+      }
+      initializeApplication(); // Attempt to init in a degraded state
+    } finally {
+      // --- PHASE 2: Attach listeners and enable UI readiness ---
+      // This pushes the listener attachment to the end of the event queue,
+      // ensuring all initialization events have fired harmlessly before we start listening.
+      setTimeout(() => {
+        attachStateSavingListeners();
+        listenersAttached = true;
+        uiReady = true;
+      }, 50); // A 50ms delay is more robust than 0ms for complex UIs.
     }
+  }
 
-    function debouncedSaveState() {
-        // Do not save anything until the entire UI has finished its initial setup.
-        // MODIFICATION: Add a check for listenersAttached.
-        if (!uiReady || !listenersAttached) { return; }
-        clearTimeout(saveStateTimeout);
-        saveStateTimeout = setTimeout(saveCurrentUiState, DEBOUNCE_DELAY_MS);
+  function loadInitialUiState() {
+    if (textArea && currentUiState.last_text) {
+      textArea.value = currentUiState.last_text;
+      if (charCount) charCount.textContent = textArea.value.length;
     }
-
-    // --- Speed Factor Warning ---
-    function updateSpeedFactorWarning() {
-        if (speedFactorSlider && speedFactorWarningSpan) {
-            const value = parseFloat(speedFactorSlider.value);
-            if (value !== 1.0) {
-                speedFactorWarningSpan.textContent = "* Experimental, may cause echo.";
-                speedFactorWarningSpan.classList.remove('hidden');
-            } else {
-                speedFactorWarningSpan.classList.add('hidden');
-            }
-        }
+    const modeRadioToSelect = document.querySelector(
+      `input[name="voice_mode"][value="${currentVoiceMode}"]`
+    );
+    if (modeRadioToSelect) modeRadioToSelect.checked = true;
+    else {
+      document.querySelector(
+        'input[name="voice_mode"][value="predefined"]'
+      ).checked = true;
+      currentVoiceMode = "predefined";
     }
-
-
-    // --- Initial Application Setup ---
-    function initializeApplication() {
-        const preferredTheme = localStorage.getItem('uiTheme') || currentUiState.theme || 'dark';
-        applyTheme(preferredTheme);
-        const pageTitle = currentConfig?.ui?.title || "Chatterbox TTS Server";
-        document.title = pageTitle;
-        if (appTitleLink) appTitleLink.textContent = pageTitle;
-        if (ttsFormHeader) ttsFormHeader.textContent = `Generate Speech`;
-        loadInitialUiState();
-        populatePredefinedVoices();
-        populateReferenceFiles();
-        populatePresets();
-        displayServerConfiguration();
-        if (languageSelectContainer && currentConfig?.ui?.show_language_select === false) {
-            languageSelectContainer.classList.add('hidden');
-        }
-        updateSpeedFactorWarning(); // Initial check for speed factor warning
-        // attachStateSavingListeners();
-        const initialGenResult = currentConfig.initial_gen_result;
-        if (initialGenResult && initialGenResult.outputUrl) {
-            initializeWaveSurfer(initialGenResult.outputUrl, initialGenResult);
-        }
+    toggleVoiceOptionsDisplay();
+    if (seedInput && currentUiState.last_seed !== undefined)
+      seedInput.value = currentUiState.last_seed;
+    else if (
+      seedInput &&
+      currentConfig?.generation_defaults?.seed !== undefined
+    )
+      seedInput.value = currentConfig.generation_defaults.seed;
+    if (splitTextToggle)
+      splitTextToggle.checked =
+        currentUiState.last_split_text_enabled !== undefined
+          ? currentUiState.last_split_text_enabled
+          : true;
+    if (chunkSizeSlider && currentUiState.last_chunk_size !== undefined)
+      chunkSizeSlider.value = currentUiState.last_chunk_size;
+    if (chunkSizeValue)
+      chunkSizeValue.textContent = chunkSizeSlider
+        ? chunkSizeSlider.value
+        : "120";
+    toggleChunkControlsVisibility();
+    const genDefaults = currentConfig.generation_defaults || {};
+    if (temperatureSlider)
+      temperatureSlider.value =
+        genDefaults.temperature !== undefined ? genDefaults.temperature : 0.8;
+    if (temperatureValueDisplay)
+      temperatureValueDisplay.textContent = temperatureSlider.value;
+    if (exaggerationSlider)
+      exaggerationSlider.value =
+        genDefaults.exaggeration !== undefined ? genDefaults.exaggeration : 0.5;
+    if (exaggerationValueDisplay)
+      exaggerationValueDisplay.textContent = exaggerationSlider.value;
+    if (cfgWeightSlider)
+      cfgWeightSlider.value =
+        genDefaults.cfg_weight !== undefined ? genDefaults.cfg_weight : 0.5;
+    if (cfgWeightValueDisplay)
+      cfgWeightValueDisplay.textContent = cfgWeightSlider.value;
+    if (speedFactorSlider)
+      speedFactorSlider.value =
+        genDefaults.speed_factor !== undefined ? genDefaults.speed_factor : 1.0;
+    if (speedFactorValueDisplay)
+      speedFactorValueDisplay.textContent = speedFactorSlider.value;
+
+    // Set model type from config
+    if (modelTypeSelect) {
+      const useMultilingual = currentConfig?.model?.use_multilingual !== false; // Default to true
+      modelTypeSelect.value = useMultilingual ? "multilingual" : "english-only";
     }
 
-    async function fetchInitialData() {
-        try {
-            const response = await fetch(`${API_BASE_URL}/api/ui/initial-data`);
-            if (!response.ok) {
-                const errorText = await response.text();
-                throw new Error(`Failed to fetch initial UI data: ${response.status} ${response.statusText}. Server response: ${errorText}`);
-            }
-            const data = await response.json();
-            currentConfig = data.config || {};
-            currentUiState = currentConfig.ui_state || {};
-            appPresets = data.presets || [];
-            initialReferenceFiles = data.reference_files || [];
-            initialPredefinedVoices = data.predefined_voices || [];
-            hideChunkWarning = currentUiState.hide_chunk_warning || false;
-            hideGenerationWarning = currentUiState.hide_generation_warning || false;
-            currentVoiceMode = currentUiState.last_voice_mode || 'predefined';
-
-            // This now ONLY sets values. It does NOT attach state-saving listeners.
-            initializeApplication();
-
-        } catch (error) {
-            console.error("Error fetching initial data:", error);
-            showNotification(`Could not load essential application data: ${error.message}. Please try refreshing.`, 'error', 0);
-            if (Object.keys(currentConfig).length === 0) {
-                currentConfig = { ui: { title: "Chatterbox TTS Server (Error Mode)" }, generation_defaults: {}, ui_state: {} };
-                currentUiState = currentConfig.ui_state;
-            }
-            initializeApplication(); // Attempt to init in a degraded state
-        } finally {
-            // --- PHASE 2: Attach listeners and enable UI readiness ---
-            // This pushes the listener attachment to the end of the event queue,
-            // ensuring all initialization events have fired harmlessly before we start listening.
-            setTimeout(() => {
-                attachStateSavingListeners();
-                listenersAttached = true;
-                uiReady = true;
-            }, 50); // A 50ms delay is more robust than 0ms for complex UIs.
-        }
+    if (languageSelect) languageSelect.value = genDefaults.language || "en";
+    if (outputFormatSelect)
+      outputFormatSelect.value = currentConfig?.audio_output?.format || "mp3";
+    if (hideChunkWarningCheckbox)
+      hideChunkWarningCheckbox.checked = hideChunkWarning;
+    if (hideGenerationWarningCheckbox)
+      hideGenerationWarningCheckbox.checked = hideGenerationWarning;
+    if (textArea && !textArea.value && appPresets && appPresets.length > 0) {
+      const defaultPreset =
+        appPresets.find((p) => p.name === "Standard Narration") ||
+        appPresets[0];
+      if (defaultPreset) applyPreset(defaultPreset, false, false);
     }
+  }
 
-    function loadInitialUiState() {
-        if (textArea && currentUiState.last_text) {
-            textArea.value = currentUiState.last_text;
-            if (charCount) charCount.textContent = textArea.value.length;
-        }
-        const modeRadioToSelect = document.querySelector(`input[name="voice_mode"][value="${currentVoiceMode}"]`);
-        if (modeRadioToSelect) modeRadioToSelect.checked = true;
-        else {
-            document.querySelector('input[name="voice_mode"][value="predefined"]').checked = true;
-            currentVoiceMode = 'predefined';
-        }
-        toggleVoiceOptionsDisplay();
-        if (seedInput && currentUiState.last_seed !== undefined) seedInput.value = currentUiState.last_seed;
-        else if (seedInput && currentConfig?.generation_defaults?.seed !== undefined) seedInput.value = currentConfig.generation_defaults.seed;
-        if (splitTextToggle) splitTextToggle.checked = currentUiState.last_split_text_enabled !== undefined ? currentUiState.last_split_text_enabled : true;
-        if (chunkSizeSlider && currentUiState.last_chunk_size !== undefined) chunkSizeSlider.value = currentUiState.last_chunk_size;
-        if (chunkSizeValue) chunkSizeValue.textContent = chunkSizeSlider ? chunkSizeSlider.value : '120';
+  function attachStateSavingListeners() {
+    voiceModeRadios.forEach((radio) => {
+      radio.addEventListener("change", debouncedSaveState);
+    });
+
+    if (textArea)
+      textArea.addEventListener("input", () => {
+        if (charCount) charCount.textContent = textArea.value.length;
+        debouncedSaveState();
+      });
+    if (predefinedVoiceSelect)
+      predefinedVoiceSelect.addEventListener("change", debouncedSaveState);
+    if (cloneReferenceSelect)
+      cloneReferenceSelect.addEventListener("change", debouncedSaveState);
+    if (seedInput) seedInput.addEventListener("change", debouncedSaveState);
+    if (splitTextToggle)
+      splitTextToggle.addEventListener("change", () => {
         toggleChunkControlsVisibility();
-        const genDefaults = currentConfig.generation_defaults || {};
-        if (temperatureSlider) temperatureSlider.value = genDefaults.temperature !== undefined ? genDefaults.temperature : 0.8;
-        if (temperatureValueDisplay) temperatureValueDisplay.textContent = temperatureSlider.value;
-        if (exaggerationSlider) exaggerationSlider.value = genDefaults.exaggeration !== undefined ? genDefaults.exaggeration : 0.5;
-        if (exaggerationValueDisplay) exaggerationValueDisplay.textContent = exaggerationSlider.value;
-        if (cfgWeightSlider) cfgWeightSlider.value = genDefaults.cfg_weight !== undefined ? genDefaults.cfg_weight : 0.5;
-        if (cfgWeightValueDisplay) cfgWeightValueDisplay.textContent = cfgWeightSlider.value;
-        if (speedFactorSlider) speedFactorSlider.value = genDefaults.speed_factor !== undefined ? genDefaults.speed_factor : 1.0;
-        if (speedFactorValueDisplay) speedFactorValueDisplay.textContent = speedFactorSlider.value;
-        if (languageSelect) languageSelect.value = genDefaults.language || 'en';
-        if (outputFormatSelect) outputFormatSelect.value = currentConfig?.audio_output?.format || 'mp3';
-        if (hideChunkWarningCheckbox) hideChunkWarningCheckbox.checked = hideChunkWarning;
-        if (hideGenerationWarningCheckbox) hideGenerationWarningCheckbox.checked = hideGenerationWarning;
-        if (textArea && !textArea.value && appPresets && appPresets.length > 0) {
-            const defaultPreset = appPresets.find(p => p.name === "Standard Narration") || appPresets[0];
-            if (defaultPreset) applyPreset(defaultPreset, false, false);
-        }
+        debouncedSaveState();
+      });
+    if (chunkSizeSlider) {
+      chunkSizeSlider.addEventListener("input", () => {
+        if (chunkSizeValue) chunkSizeValue.textContent = chunkSizeSlider.value;
+      });
+      chunkSizeSlider.addEventListener("change", debouncedSaveState);
     }
-
-    function attachStateSavingListeners() {
-        voiceModeRadios.forEach(radio => {
-            radio.addEventListener('change', debouncedSaveState);
+    const genParamSliders = [
+      temperatureSlider,
+      exaggerationSlider,
+      cfgWeightSlider,
+      speedFactorSlider,
+    ];
+    genParamSliders.forEach((slider) => {
+      if (slider) {
+        const valueDisplayId = slider.id + "-value";
+        const valueDisplay = document.getElementById(valueDisplayId);
+        slider.addEventListener("input", () => {
+          if (valueDisplay) valueDisplay.textContent = slider.value;
+          if (slider.id === "speed-factor") updateSpeedFactorWarning(); // Update warning on input
         });
-
-        if (textArea) textArea.addEventListener('input', () => { if (charCount) charCount.textContent = textArea.value.length; debouncedSaveState(); });
-        if (predefinedVoiceSelect) predefinedVoiceSelect.addEventListener('change', debouncedSaveState);
-        if (cloneReferenceSelect) cloneReferenceSelect.addEventListener('change', debouncedSaveState);
-        if (seedInput) seedInput.addEventListener('change', debouncedSaveState);
-        if (splitTextToggle) splitTextToggle.addEventListener('change', () => { toggleChunkControlsVisibility(); debouncedSaveState(); });
-        if (chunkSizeSlider) {
-            chunkSizeSlider.addEventListener('input', () => { if (chunkSizeValue) chunkSizeValue.textContent = chunkSizeSlider.value; });
-            chunkSizeSlider.addEventListener('change', debouncedSaveState);
+        slider.addEventListener("change", debouncedSaveState);
+      }
+    });
+    if (languageSelect)
+      languageSelect.addEventListener("change", debouncedSaveState);
+
+    // Add listener for model type selector
+    if (modelTypeSelect) {
+      modelTypeSelect.addEventListener("change", () => {
+        const originalValue =
+          currentConfig?.model?.use_multilingual !== false
+            ? "multilingual"
+            : "english-only";
+        const newValue = modelTypeSelect.value;
+        if (newValue !== originalValue) {
+          showNotification(
+            'Model type changed. Remember to click "Save Generation Parameters" and restart the server for changes to take effect.',
+            "info",
+            8000
+          );
         }
-        const genParamSliders = [temperatureSlider, exaggerationSlider, cfgWeightSlider, speedFactorSlider];
-        genParamSliders.forEach(slider => {
-            if (slider) {
-                const valueDisplayId = slider.id + '-value';
-                const valueDisplay = document.getElementById(valueDisplayId);
-                slider.addEventListener('input', () => {
-                    if (valueDisplay) valueDisplay.textContent = slider.value;
-                    if (slider.id === 'speed-factor') updateSpeedFactorWarning(); // Update warning on input
-                });
-                slider.addEventListener('change', debouncedSaveState);
-            }
-        });
-        if (languageSelect) languageSelect.addEventListener('change', debouncedSaveState);
-        if (outputFormatSelect) outputFormatSelect.addEventListener('change', debouncedSaveState);
+      });
     }
 
-    // --- Dynamic UI Population ---
-    function populatePredefinedVoices(voicesData = initialPredefinedVoices) {
-        if (!predefinedVoiceSelect) return;
-        const currentSelectedValue = predefinedVoiceSelect.value;
-        predefinedVoiceSelect.innerHTML = '<option value="none">-- Select Voice --</option>';
-        voicesData.forEach(voice => {
-            const option = document.createElement('option');
-            option.value = voice.filename;
-            option.textContent = voice.display_name || voice.filename;
-            predefinedVoiceSelect.appendChild(option);
-        });
-        const lastSelected = currentUiState.last_predefined_voice;
-        const defaultFromConfig = currentConfig?.tts_engine?.default_voice_id;
-        if (currentSelectedValue !== 'none' && voicesData.some(v => v.filename === currentSelectedValue)) {
-            predefinedVoiceSelect.value = currentSelectedValue;
-        } else if (lastSelected && voicesData.some(v => v.filename === lastSelected)) {
-            predefinedVoiceSelect.value = lastSelected;
-        } else if (defaultFromConfig && voicesData.some(v => v.filename === defaultFromConfig)) {
-            predefinedVoiceSelect.value = defaultFromConfig;
-        } else {
-            predefinedVoiceSelect.value = 'none';
-        }
+    if (outputFormatSelect)
+      outputFormatSelect.addEventListener("change", debouncedSaveState);
+  }
+
+  // --- Dynamic UI Population ---
+  function populatePredefinedVoices(voicesData = initialPredefinedVoices) {
+    if (!predefinedVoiceSelect) return;
+    const currentSelectedValue = predefinedVoiceSelect.value;
+    predefinedVoiceSelect.innerHTML =
+      '<option value="none">-- Select Voice --</option>';
+    voicesData.forEach((voice) => {
+      const option = document.createElement("option");
+      option.value = voice.filename;
+      option.textContent = voice.display_name || voice.filename;
+      predefinedVoiceSelect.appendChild(option);
+    });
+    const lastSelected = currentUiState.last_predefined_voice;
+    const defaultFromConfig = currentConfig?.tts_engine?.default_voice_id;
+    if (
+      currentSelectedValue !== "none" &&
+      voicesData.some((v) => v.filename === currentSelectedValue)
+    ) {
+      predefinedVoiceSelect.value = currentSelectedValue;
+    } else if (
+      lastSelected &&
+      voicesData.some((v) => v.filename === lastSelected)
+    ) {
+      predefinedVoiceSelect.value = lastSelected;
+    } else if (
+      defaultFromConfig &&
+      voicesData.some((v) => v.filename === defaultFromConfig)
+    ) {
+      predefinedVoiceSelect.value = defaultFromConfig;
+    } else {
+      predefinedVoiceSelect.value = "none";
     }
-
-    function populateReferenceFiles(filesData = initialReferenceFiles) {
-        if (!cloneReferenceSelect) return;
-        const currentSelectedValue = cloneReferenceSelect.value;
-        cloneReferenceSelect.innerHTML = '<option value="none">-- Select Reference File --</option>';
-        filesData.forEach(filename => {
-            const option = document.createElement('option');
-            option.value = filename;
-            option.textContent = filename;
-            cloneReferenceSelect.appendChild(option);
-        });
-        const lastSelected = currentUiState.last_reference_file;
-        if (currentSelectedValue !== 'none' && filesData.includes(currentSelectedValue)) {
-            cloneReferenceSelect.value = currentSelectedValue;
-        } else if (lastSelected && filesData.includes(lastSelected)) {
-            cloneReferenceSelect.value = lastSelected;
-        } else {
-            cloneReferenceSelect.value = 'none';
-        }
+  }
+
+  function populateReferenceFiles(filesData = initialReferenceFiles) {
+    if (!cloneReferenceSelect) return;
+    const currentSelectedValue = cloneReferenceSelect.value;
+    cloneReferenceSelect.innerHTML =
+      '<option value="none">-- Select Reference File --</option>';
+    filesData.forEach((filename) => {
+      const option = document.createElement("option");
+      option.value = filename;
+      option.textContent = filename;
+      cloneReferenceSelect.appendChild(option);
+    });
+    const lastSelected = currentUiState.last_reference_file;
+    if (
+      currentSelectedValue !== "none" &&
+      filesData.includes(currentSelectedValue)
+    ) {
+      cloneReferenceSelect.value = currentSelectedValue;
+    } else if (lastSelected && filesData.includes(lastSelected)) {
+      cloneReferenceSelect.value = lastSelected;
+    } else {
+      cloneReferenceSelect.value = "none";
     }
-
-    function populatePresets() {
-        if (!presetsContainer || !appPresets) return;
-        if (appPresets.length === 0) {
-            if (presetsPlaceholder) presetsPlaceholder.textContent = 'No presets available.';
-            return;
-        }
-        if (presetsPlaceholder) presetsPlaceholder.remove();
-        presetsContainer.innerHTML = '';
-        appPresets.forEach((preset, index) => {
-            const button = document.createElement('button');
-            button.type = 'button';
-            button.id = `preset-btn-${index}`;
-            button.className = 'preset-button';
-            button.title = `Load '${preset.name}' text and settings`;
-            button.textContent = preset.name;
-            button.addEventListener('click', () => applyPreset(preset));
-            presetsContainer.appendChild(button);
-        });
+  }
+
+  function populatePresets() {
+    if (!presetsContainer || !appPresets) return;
+    if (appPresets.length === 0) {
+      if (presetsPlaceholder)
+        presetsPlaceholder.textContent = "No presets available.";
+      return;
     }
+    if (presetsPlaceholder) presetsPlaceholder.remove();
+    presetsContainer.innerHTML = "";
+    appPresets.forEach((preset, index) => {
+      const button = document.createElement("button");
+      button.type = "button";
+      button.id = `preset-btn-${index}`;
+      button.className = "preset-button";
+      button.title = `Load '${preset.name}' text and settings`;
+      button.textContent = preset.name;
+      button.addEventListener("click", () => applyPreset(preset));
+      presetsContainer.appendChild(button);
+    });
+  }
 
-    function applyPreset(presetData, showNotif = true) {
-        if (!presetData) return;
-        if (textArea && presetData.text !== undefined) {
-            textArea.value = presetData.text;
-            if (charCount) charCount.textContent = textArea.value.length;
-        }
-        const genParams = presetData.params || presetData;
-        if (temperatureSlider && genParams.temperature !== undefined) temperatureSlider.value = genParams.temperature;
-        if (exaggerationSlider && genParams.exaggeration !== undefined) exaggerationSlider.value = genParams.exaggeration;
-        if (cfgWeightSlider && genParams.cfg_weight !== undefined) cfgWeightSlider.value = genParams.cfg_weight;
-        if (speedFactorSlider && genParams.speed_factor !== undefined) speedFactorSlider.value = genParams.speed_factor;
-        if (seedInput && genParams.seed !== undefined) seedInput.value = genParams.seed;
-        if (languageSelect && genParams.language !== undefined) languageSelect.value = genParams.language;
-        if (temperatureValueDisplay && temperatureSlider) temperatureValueDisplay.textContent = temperatureSlider.value;
-        if (exaggerationValueDisplay && exaggerationSlider) exaggerationValueDisplay.textContent = exaggerationSlider.value;
-        if (cfgWeightValueDisplay && cfgWeightSlider) cfgWeightValueDisplay.textContent = cfgWeightSlider.value;
-        if (speedFactorValueDisplay && speedFactorSlider) speedFactorValueDisplay.textContent = speedFactorSlider.value;
-        updateSpeedFactorWarning(); // Update warning after applying preset
-        if (genParams.voice_id && predefinedVoiceSelect) {
-            const voiceExists = Array.from(predefinedVoiceSelect.options).some(opt => opt.value === genParams.voice_id);
-            if (voiceExists) {
-                predefinedVoiceSelect.value = genParams.voice_id;
-                // MODIFICATION: Set checked property directly and call the UI update function.
-                document.querySelector('input[name="voice_mode"][value="predefined"]').checked = true;
-                toggleVoiceOptionsDisplay();
-            }
-        } else if (genParams.reference_audio_filename && cloneReferenceSelect) {
-            const refExists = Array.from(cloneReferenceSelect.options).some(opt => opt.value === genParams.reference_audio_filename);
-            if (refExists) {
-                cloneReferenceSelect.value = genParams.reference_audio_filename;
-                // MODIFICATION: Set checked property directly and call the UI update function.
-                document.querySelector('input[name="voice_mode"][value="clone"]').checked = true;
-                toggleVoiceOptionsDisplay();
-            }
-        }
-        if (showNotif) showNotification(`Preset "${presetData.name}" loaded.`, 'info', 3000);
-        // Only save the state if this was a direct user click, not an init call.
-        if (isUserInteraction) {
-            debouncedSaveState();
-        }
+  function applyPreset(presetData, showNotif = true) {
+    if (!presetData) return;
+    if (textArea && presetData.text !== undefined) {
+      textArea.value = presetData.text;
+      if (charCount) charCount.textContent = textArea.value.length;
     }
-
-    // --- Voice Mode and Options Visibility ---
-    function toggleVoiceOptionsDisplay() {
-        const selectedMode = document.querySelector('input[name="voice_mode"]:checked')?.value;
-        currentVoiceMode = selectedMode;
-        if (predefinedVoiceOptionsDiv) predefinedVoiceOptionsDiv.classList.toggle('hidden', selectedMode !== 'predefined');
-        if (cloneOptionsDiv) cloneOptionsDiv.classList.toggle('hidden', selectedMode !== 'clone');
-        if (predefinedVoiceSelect) predefinedVoiceSelect.required = (selectedMode === 'predefined');
-        if (cloneReferenceSelect) cloneReferenceSelect.required = (selectedMode === 'clone');
+    const genParams = presetData.params || presetData;
+    if (temperatureSlider && genParams.temperature !== undefined)
+      temperatureSlider.value = genParams.temperature;
+    if (exaggerationSlider && genParams.exaggeration !== undefined)
+      exaggerationSlider.value = genParams.exaggeration;
+    if (cfgWeightSlider && genParams.cfg_weight !== undefined)
+      cfgWeightSlider.value = genParams.cfg_weight;
+    if (speedFactorSlider && genParams.speed_factor !== undefined)
+      speedFactorSlider.value = genParams.speed_factor;
+    if (seedInput && genParams.seed !== undefined)
+      seedInput.value = genParams.seed;
+    if (languageSelect && genParams.language !== undefined)
+      languageSelect.value = genParams.language;
+    if (temperatureValueDisplay && temperatureSlider)
+      temperatureValueDisplay.textContent = temperatureSlider.value;
+    if (exaggerationValueDisplay && exaggerationSlider)
+      exaggerationValueDisplay.textContent = exaggerationSlider.value;
+    if (cfgWeightValueDisplay && cfgWeightSlider)
+      cfgWeightValueDisplay.textContent = cfgWeightSlider.value;
+    if (speedFactorValueDisplay && speedFactorSlider)
+      speedFactorValueDisplay.textContent = speedFactorSlider.value;
+    updateSpeedFactorWarning(); // Update warning after applying preset
+    if (genParams.voice_id && predefinedVoiceSelect) {
+      const voiceExists = Array.from(predefinedVoiceSelect.options).some(
+        (opt) => opt.value === genParams.voice_id
+      );
+      if (voiceExists) {
+        predefinedVoiceSelect.value = genParams.voice_id;
+        // MODIFICATION: Set checked property directly and call the UI update function.
+        document.querySelector(
+          'input[name="voice_mode"][value="predefined"]'
+        ).checked = true;
+        toggleVoiceOptionsDisplay();
+      }
+    } else if (genParams.reference_audio_filename && cloneReferenceSelect) {
+      const refExists = Array.from(cloneReferenceSelect.options).some(
+        (opt) => opt.value === genParams.reference_audio_filename
+      );
+      if (refExists) {
+        cloneReferenceSelect.value = genParams.reference_audio_filename;
+        // MODIFICATION: Set checked property directly and call the UI update function.
+        document.querySelector(
+          'input[name="voice_mode"][value="clone"]'
+        ).checked = true;
+        toggleVoiceOptionsDisplay();
+      }
     }
-    voiceModeRadios.forEach(radio => radio.addEventListener('change', toggleVoiceOptionsDisplay));
-
-    function toggleChunkControlsVisibility() {
-        const isChecked = splitTextToggle ? splitTextToggle.checked : false;
-        if (chunkSizeControls) chunkSizeControls.classList.toggle('hidden', !isChecked);
-        if (chunkExplanation) chunkExplanation.classList.toggle('hidden', !isChecked);
+    if (showNotif)
+      showNotification(`Preset "${presetData.name}" loaded.`, "info", 3000);
+    // Only save the state if this was a direct user click, not an init call.
+    if (isUserInteraction) {
+      debouncedSaveState();
     }
-    if (splitTextToggle) toggleChunkControlsVisibility();
-
-    // --- Audio Player (WaveSurfer) ---
-    function initializeWaveSurfer(audioUrl, resultDetails = {}) {
-        if (wavesurfer) {
-            wavesurfer.unAll(); // Remove all event listeners before destroying
-            wavesurfer.destroy();
-            wavesurfer = null;
-        }
-        if (currentAudioBlobUrl) {
-            URL.revokeObjectURL(currentAudioBlobUrl);
-            currentAudioBlobUrl = null;
-        }
-        currentAudioBlobUrl = audioUrl;
+  }
+
+  // --- Voice Mode and Options Visibility ---
+  function toggleVoiceOptionsDisplay() {
+    const selectedMode = document.querySelector(
+      'input[name="voice_mode"]:checked'
+    )?.value;
+    currentVoiceMode = selectedMode;
+    if (predefinedVoiceOptionsDiv)
+      predefinedVoiceOptionsDiv.classList.toggle(
+        "hidden",
+        selectedMode !== "predefined"
+      );
+    if (cloneOptionsDiv)
+      cloneOptionsDiv.classList.toggle("hidden", selectedMode !== "clone");
+    if (predefinedVoiceSelect)
+      predefinedVoiceSelect.required = selectedMode === "predefined";
+    if (cloneReferenceSelect)
+      cloneReferenceSelect.required = selectedMode === "clone";
+  }
+  voiceModeRadios.forEach((radio) =>
+    radio.addEventListener("change", toggleVoiceOptionsDisplay)
+  );
+
+  function toggleChunkControlsVisibility() {
+    const isChecked = splitTextToggle ? splitTextToggle.checked : false;
+    if (chunkSizeControls)
+      chunkSizeControls.classList.toggle("hidden", !isChecked);
+    if (chunkExplanation)
+      chunkExplanation.classList.toggle("hidden", !isChecked);
+  }
+  if (splitTextToggle) toggleChunkControlsVisibility();
+
+  // --- Audio Player (WaveSurfer) ---
+  function initializeWaveSurfer(audioUrl, resultDetails = {}) {
+    if (wavesurfer) {
+      wavesurfer.unAll(); // Remove all event listeners before destroying
+      wavesurfer.destroy();
+      wavesurfer = null;
+    }
+    if (currentAudioBlobUrl) {
+      URL.revokeObjectURL(currentAudioBlobUrl);
+      currentAudioBlobUrl = null;
+    }
+    currentAudioBlobUrl = audioUrl;
 
-        // Ensure the container is clean or re-created
-        audioPlayerContainer.innerHTML = `
+    // Ensure the container is clean or re-created
+    audioPlayerContainer.innerHTML = `
             <div class="audio-player-card">
                 <div class="p-6 sm:p-8">
                     <h2 class="card-header">Generated Audio</h2>
@@ -537,498 +755,852 @@ document.addEventListener('DOMContentLoaded', async function () {
                 </div>
             </div>`;
 
-        // Re-select elements after recreating them
-        const waveformDiv = audioPlayerContainer.querySelector('#waveform');
-        const playBtn = audioPlayerContainer.querySelector('#play-btn');
-        const downloadLink = audioPlayerContainer.querySelector('#download-link');
-        const playerModeSpan = audioPlayerContainer.querySelector('#player-voice-mode');
-        const playerFileSpan = audioPlayerContainer.querySelector('#player-voice-file-details');
-        const playerGenTimeSpan = audioPlayerContainer.querySelector('#player-gen-time');
-        const audioDurationSpan = audioPlayerContainer.querySelector('#audio-duration');
-
-        const audioFilename = resultDetails.filename || (typeof audioUrl === 'string' ? audioUrl.split('/').pop() : 'tts_output.wav');
-        if (downloadLink) {
-            downloadLink.href = audioUrl;
-            downloadLink.download = audioFilename;
-            const downloadTextSpan = downloadLink.querySelector('span'); // Target the span for text update
-            if (downloadTextSpan) {
-                downloadTextSpan.textContent = `Download ${audioFilename.split('.').pop().toUpperCase()}`;
-            }
-        }
-        if (playerModeSpan) playerModeSpan.textContent = resultDetails.submittedVoiceMode || currentVoiceMode || '--';
-        if (playerFileSpan) {
-            let fileDetail = '';
-            if ((resultDetails.submittedVoiceMode || currentVoiceMode) === 'clone' && resultDetails.submittedCloneFile) {
-                fileDetail = `(<span class="font-medium text-slate-700 dark:text-slate-300">${resultDetails.submittedCloneFile}</span>)`;
-            } else if ((resultDetails.submittedVoiceMode || currentVoiceMode) === 'predefined' && resultDetails.submittedPredefinedVoice) {
-                fileDetail = `(<span class="font-medium text-slate-700 dark:text-slate-300">${resultDetails.submittedPredefinedVoice}</span>)`;
-            }
-            playerFileSpan.innerHTML = fileDetail;
-        }
-        if (playerGenTimeSpan) playerGenTimeSpan.textContent = resultDetails.genTime ? `${resultDetails.genTime}s` : '--s';
-
-        const playIconSVG = `<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 20 20" fill="currentColor" class="w-5 h-5 mr-1.5"><path fill-rule="evenodd" d="M2 10a8 8 0 1 1 16 0 8 8 0 0 1-16 0Zm6.39-2.908a.75.75 0 0 1 .766.027l3.5 2.25a.75.75 0 0 1 0 1.262l-3.5 2.25A.75.75 0 0 1 8 12.25v-4.5a.75.75 0 0 1 .39-.658Z" clip-rule="evenodd" /></svg><span>Play</span>`;
-        const pauseIconSVG = `<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 20 20" fill="currentColor" class="w-5 h-5 mr-1.5"><path fill-rule="evenodd" d="M2 10a8 8 0 1 1 16 0 8 8 0 0 1-16 0Zm5-2.25A.75.75 0 0 1 7.75 7h4.5a.75.75 0 0 1 .75.75v4.5a.75.75 0 0 1-.75.75h-4.5a.75.75 0 0 1-.75-.75v-4.5Z" clip-rule="evenodd" /></svg><span>Pause</span>`;
-        const isDark = document.documentElement.classList.contains('dark');
-
-        wavesurfer = WaveSurfer.create({
-            container: waveformDiv, waveColor: isDark ? '#6366f1' : '#a5b4fc', progressColor: isDark ? '#4f46e5' : '#6366f1',
-            cursorColor: isDark ? '#cbd5e1' : '#475569', barWidth: 3, barRadius: 3, cursorWidth: 1, height: 80, barGap: 2,
-            responsive: true, url: audioUrl, mediaControls: false, normalize: true,
-        });
-
-        wavesurfer.on('ready', () => {
-            const duration = wavesurfer.getDuration();
-            if (audioDurationSpan) audioDurationSpan.textContent = formatTime(duration);
-            if (playBtn) { playBtn.disabled = false; playBtn.innerHTML = playIconSVG; }
-            if (downloadLink) { downloadLink.classList.remove('opacity-50', 'pointer-events-none'); downloadLink.setAttribute('aria-disabled', 'false'); }
-        });
-        wavesurfer.on('play', () => { if (playBtn) playBtn.innerHTML = pauseIconSVG; });
-        wavesurfer.on('pause', () => { if (playBtn) playBtn.innerHTML = playIconSVG; });
-        wavesurfer.on('finish', () => { if (playBtn) playBtn.innerHTML = playIconSVG; wavesurfer.seekTo(0); });
-        wavesurfer.on('error', (err) => {
-            console.error("WaveSurfer error:", err);
-            showNotification(`Error loading audio waveform: ${err.message || err}`, 'error');
-            if (waveformDiv) waveformDiv.innerHTML = `<p class="p-4 text-sm text-red-600 dark:text-red-400">Could not load waveform.</p>`;
-            if (playBtn) playBtn.disabled = true;
-        });
-
-        if (playBtn) {
-            playBtn.onclick = () => {
-                if (wavesurfer) {
-                    wavesurfer.playPause();
-                }
-            };
-        }
-        setTimeout(() => audioPlayerContainer.scrollIntoView({ behavior: 'smooth', block: 'nearest' }), 150);
+    // Re-select elements after recreating them
+    const waveformDiv = audioPlayerContainer.querySelector("#waveform");
+    const playBtn = audioPlayerContainer.querySelector("#play-btn");
+    const downloadLink = audioPlayerContainer.querySelector("#download-link");
+    const playerModeSpan =
+      audioPlayerContainer.querySelector("#player-voice-mode");
+    const playerFileSpan = audioPlayerContainer.querySelector(
+      "#player-voice-file-details"
+    );
+    const playerGenTimeSpan =
+      audioPlayerContainer.querySelector("#player-gen-time");
+    const audioDurationSpan =
+      audioPlayerContainer.querySelector("#audio-duration");
+
+    const audioFilename =
+      resultDetails.filename ||
+      (typeof audioUrl === "string"
+        ? audioUrl.split("/").pop()
+        : "tts_output.wav");
+    if (downloadLink) {
+      downloadLink.href = audioUrl;
+      downloadLink.download = audioFilename;
+      const downloadTextSpan = downloadLink.querySelector("span"); // Target the span for text update
+      if (downloadTextSpan) {
+        downloadTextSpan.textContent = `Download ${audioFilename
+          .split(".")
+          .pop()
+          .toUpperCase()}`;
+      }
     }
-
-    // --- TTS Generation Logic ---
-    function getTTSFormData() {
-        const jsonData = {
-            text: textArea.value,
-            temperature: parseFloat(temperatureSlider.value),
-            exaggeration: parseFloat(exaggerationSlider.value),
-            cfg_weight: parseFloat(cfgWeightSlider.value),
-            speed_factor: parseFloat(speedFactorSlider.value),
-            seed: parseInt(seedInput.value, 10),
-            language: languageSelect.value,
-            voice_mode: currentVoiceMode,
-            split_text: splitTextToggle.checked,
-            chunk_size: parseInt(chunkSizeSlider.value, 10),
-            output_format: outputFormatSelect.value || 'mp3'
-        };
-        if (currentVoiceMode === 'predefined' && predefinedVoiceSelect.value !== 'none') {
-            jsonData.predefined_voice_id = predefinedVoiceSelect.value;
-        } else if (currentVoiceMode === 'clone' && cloneReferenceSelect.value !== 'none') {
-            jsonData.reference_audio_filename = cloneReferenceSelect.value;
-        }
-        return jsonData;
+    if (playerModeSpan)
+      playerModeSpan.textContent =
+        resultDetails.submittedVoiceMode || currentVoiceMode || "--";
+    if (playerFileSpan) {
+      let fileDetail = "";
+      if (
+        (resultDetails.submittedVoiceMode || currentVoiceMode) === "clone" &&
+        resultDetails.submittedCloneFile
+      ) {
+        fileDetail = `(<span class="font-medium text-slate-700 dark:text-slate-300">${resultDetails.submittedCloneFile}</span>)`;
+      } else if (
+        (resultDetails.submittedVoiceMode || currentVoiceMode) ===
+          "predefined" &&
+        resultDetails.submittedPredefinedVoice
+      ) {
+        fileDetail = `(<span class="font-medium text-slate-700 dark:text-slate-300">${resultDetails.submittedPredefinedVoice}</span>)`;
+      }
+      playerFileSpan.innerHTML = fileDetail;
     }
+    if (playerGenTimeSpan)
+      playerGenTimeSpan.textContent = resultDetails.genTime
+        ? `${resultDetails.genTime}s`
+        : "--s";
+
+    const playIconSVG = `<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 20 20" fill="currentColor" class="w-5 h-5 mr-1.5"><path fill-rule="evenodd" d="M2 10a8 8 0 1 1 16 0 8 8 0 0 1-16 0Zm6.39-2.908a.75.75 0 0 1 .766.027l3.5 2.25a.75.75 0 0 1 0 1.262l-3.5 2.25A.75.75 0 0 1 8 12.25v-4.5a.75.75 0 0 1 .39-.658Z" clip-rule="evenodd" /></svg><span>Play</span>`;
+    const pauseIconSVG = `<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 20 20" fill="currentColor" class="w-5 h-5 mr-1.5"><path fill-rule="evenodd" d="M2 10a8 8 0 1 1 16 0 8 8 0 0 1-16 0Zm5-2.25A.75.75 0 0 1 7.75 7h4.5a.75.75 0 0 1 .75.75v4.5a.75.75 0 0 1-.75.75h-4.5a.75.75 0 0 1-.75-.75v-4.5Z" clip-rule="evenodd" /></svg><span>Pause</span>`;
+    const isDark = document.documentElement.classList.contains("dark");
+
+    wavesurfer = WaveSurfer.create({
+      container: waveformDiv,
+      waveColor: isDark ? "#6366f1" : "#a5b4fc",
+      progressColor: isDark ? "#4f46e5" : "#6366f1",
+      cursorColor: isDark ? "#cbd5e1" : "#475569",
+      barWidth: 3,
+      barRadius: 3,
+      cursorWidth: 1,
+      height: 80,
+      barGap: 2,
+      responsive: true,
+      url: audioUrl,
+      mediaControls: false,
+      normalize: true,
+    });
+
+    wavesurfer.on("ready", () => {
+      const duration = wavesurfer.getDuration();
+      if (audioDurationSpan)
+        audioDurationSpan.textContent = formatTime(duration);
+      if (playBtn) {
+        playBtn.disabled = false;
+        playBtn.innerHTML = playIconSVG;
+      }
+      if (downloadLink) {
+        downloadLink.classList.remove("opacity-50", "pointer-events-none");
+        downloadLink.setAttribute("aria-disabled", "false");
+      }
+    });
+    wavesurfer.on("play", () => {
+      if (playBtn) playBtn.innerHTML = pauseIconSVG;
+    });
+    wavesurfer.on("pause", () => {
+      if (playBtn) playBtn.innerHTML = playIconSVG;
+    });
+    wavesurfer.on("finish", () => {
+      if (playBtn) playBtn.innerHTML = playIconSVG;
+      wavesurfer.seekTo(0);
+    });
+    wavesurfer.on("error", (err) => {
+      console.error("WaveSurfer error:", err);
+      showNotification(
+        `Error loading audio waveform: ${err.message || err}`,
+        "error"
+      );
+      if (waveformDiv)
+        waveformDiv.innerHTML = `<p class="p-4 text-sm text-red-600 dark:text-red-400">Could not load waveform.</p>`;
+      if (playBtn) playBtn.disabled = true;
+    });
 
-    async function submitTTSRequest() {
-        isGenerating = true;
-        showLoadingOverlay();
-        const startTime = performance.now();
-        const jsonData = getTTSFormData();
-        try {
-            const response = await fetch(`${API_BASE_URL}/tts`, {
-                method: 'POST',
-                headers: { 'Content-Type': 'application/json' },
-                body: JSON.stringify(jsonData)
-            });
-            if (!response.ok) {
-                const errorResult = await response.json().catch(() => ({ detail: `HTTP error ${response.status}` }));
-                throw new Error(errorResult.detail || 'TTS generation failed.');
-            }
-            const audioBlob = await response.blob();
-            const endTime = performance.now();
-            const genTime = ((endTime - startTime) / 1000).toFixed(2);
-            const filenameFromServer = response.headers.get('Content-Disposition')?.split('filename=')[1]?.replace(/"/g, '') || 'generated_audio.wav';
-            const resultDetails = {
-                outputUrl: URL.createObjectURL(audioBlob), filename: filenameFromServer, genTime: genTime,
-                submittedVoiceMode: jsonData.voice_mode, submittedPredefinedVoice: jsonData.predefined_voice_id,
-                submittedCloneFile: jsonData.reference_audio_filename
-            };
-            initializeWaveSurfer(resultDetails.outputUrl, resultDetails);
-            showNotification('Audio generated successfully!', 'success');
-        } catch (error) {
-            console.error('TTS Generation Error:', error);
-            showNotification(error.message || 'An unknown error occurred during TTS generation.', 'error');
-        } finally {
-            isGenerating = false;
-            hideLoadingOverlay();
+    if (playBtn) {
+      playBtn.onclick = () => {
+        if (wavesurfer) {
+          wavesurfer.playPause();
         }
+      };
     }
-
-    function proceedWithSubmissionChecks() {
-        const textContent = textArea.value.trim();
-        const isSplittingEnabled = splitTextToggle.checked;
-        const currentChunkSz = parseInt(chunkSizeSlider.value, 10);
-        const needsChunkWarn = isSplittingEnabled && textContent.length >= currentChunkSz * 1.5 &&
-            currentVoiceMode !== 'predefined' && currentVoiceMode !== 'clone' &&
-            (!seedInput || parseInt(seedInput.value, 10) === 0 || seedInput.value === '') && !hideChunkWarning;
-        if (needsChunkWarn) { showChunkWarningModal(); return; }
-        submitTTSRequest();
+    setTimeout(
+      () =>
+        audioPlayerContainer.scrollIntoView({
+          behavior: "smooth",
+          block: "nearest",
+        }),
+      150
+    );
+  }
+
+  // --- TTS Generation Logic ---
+  function getTTSFormData() {
+    const jsonData = {
+      text: textArea.value,
+      temperature: parseFloat(temperatureSlider.value),
+      exaggeration: parseFloat(exaggerationSlider.value),
+      cfg_weight: parseFloat(cfgWeightSlider.value),
+      speed_factor: parseFloat(speedFactorSlider.value),
+      seed: parseInt(seedInput.value, 10),
+      language: languageSelect.value,
+      voice_mode: currentVoiceMode,
+      split_text: splitTextToggle.checked,
+      chunk_size: parseInt(chunkSizeSlider.value, 10),
+      output_format: outputFormatSelect.value || "mp3",
+    };
+    if (
+      currentVoiceMode === "predefined" &&
+      predefinedVoiceSelect.value !== "none"
+    ) {
+      jsonData.predefined_voice_id = predefinedVoiceSelect.value;
+    } else if (
+      currentVoiceMode === "clone" &&
+      cloneReferenceSelect.value !== "none"
+    ) {
+      jsonData.reference_audio_filename = cloneReferenceSelect.value;
     }
-
-    // --- Attach main generation event to the button's CLICK, not the form's SUBMIT ---
-    // This is a more robust method that prevents accidental submissions during page load.
-    if (generateBtn) {
-        generateBtn.addEventListener('click', function (event) {
-
-            console.log('Generate button clicked!');
-            console.log('Current voice mode:', currentVoiceMode);
-            console.log('Is generating:', isGenerating);
-            console.log('Text content:', textArea ? textArea.value.trim() : 'NO TEXTAREA');
-
-            // We still prevent default in case the button has any default browser actions.
-            event.preventDefault();
-
-            if (isGenerating) {
-                showNotification("Generation is already in progress.", "warning");
-                return;
-            }
-            const textContent = textArea.value.trim();
-            if (!textContent) {
-                showNotification("Please enter some text to generate speech.", 'error');
-                return;
-            }
-            if (currentVoiceMode === 'predefined' && (!predefinedVoiceSelect || predefinedVoiceSelect.value === 'none')) {
-                showNotification("Please select a predefined voice.", 'error');
-                return;
-            }
-            if (currentVoiceMode === 'clone' && (!cloneReferenceSelect || cloneReferenceSelect.value === 'none')) {
-                showNotification("Please select a reference audio file for Voice Cloning.", 'error');
-                return;
-            }
-
-            // Check for the generation quality warning.
-            if (!hideGenerationWarning) {
-                showGenerationWarningModal();
-                return; // Stop here and let the modal handler take over.
-            }
-
-            // If the warning is hidden, proceed to the final checks.
-            proceedWithSubmissionChecks();
-        });
-    } else {
-        console.log('Generate button not found!');
+    return jsonData;
+  }
+
+  async function submitTTSRequest() {
+    isGenerating = true;
+    showLoadingOverlay();
+    const startTime = performance.now();
+    const jsonData = getTTSFormData();
+    try {
+      const response = await fetch(`${API_BASE_URL}/tts`, {
+        method: "POST",
+        headers: { "Content-Type": "application/json" },
+        body: JSON.stringify(jsonData),
+      });
+      if (!response.ok) {
+        const errorResult = await response
+          .json()
+          .catch(() => ({ detail: `HTTP error ${response.status}` }));
+        throw new Error(errorResult.detail || "TTS generation failed.");
+      }
+      const audioBlob = await response.blob();
+      const endTime = performance.now();
+      const genTime = ((endTime - startTime) / 1000).toFixed(2);
+      const filenameFromServer =
+        response.headers
+          .get("Content-Disposition")
+          ?.split("filename=")[1]
+          ?.replace(/"/g, "") || "generated_audio.wav";
+      const resultDetails = {
+        outputUrl: URL.createObjectURL(audioBlob),
+        filename: filenameFromServer,
+        genTime: genTime,
+        submittedVoiceMode: jsonData.voice_mode,
+        submittedPredefinedVoice: jsonData.predefined_voice_id,
+        submittedCloneFile: jsonData.reference_audio_filename,
+      };
+      initializeWaveSurfer(resultDetails.outputUrl, resultDetails);
+      showNotification("Audio generated successfully!", "success");
+    } catch (error) {
+      console.error("TTS Generation Error:", error);
+      showNotification(
+        error.message || "An unknown error occurred during TTS generation.",
+        "error"
+      );
+    } finally {
+      isGenerating = false;
+      hideLoadingOverlay();
     }
-
-    // --- Modal Handling ---
-    function showChunkWarningModal() {
-        if (chunkWarningModal) {
-            chunkWarningModal.style.display = 'flex';
-            chunkWarningModal.classList.remove('hidden', 'opacity-0');
-            chunkWarningModal.dataset.state = 'open';
-        }
+  }
+
+  function proceedWithSubmissionChecks() {
+    const textContent = textArea.value.trim();
+    const isSplittingEnabled = splitTextToggle.checked;
+    const currentChunkSz = parseInt(chunkSizeSlider.value, 10);
+    const needsChunkWarn =
+      isSplittingEnabled &&
+      textContent.length >= currentChunkSz * 1.5 &&
+      currentVoiceMode !== "predefined" &&
+      currentVoiceMode !== "clone" &&
+      (!seedInput ||
+        parseInt(seedInput.value, 10) === 0 ||
+        seedInput.value === "") &&
+      !hideChunkWarning;
+    if (needsChunkWarn) {
+      showChunkWarningModal();
+      return;
     }
-    function hideChunkWarningModal() {
-        if (chunkWarningModal) {
-            chunkWarningModal.classList.add('opacity-0');
-            setTimeout(() => {
-                chunkWarningModal.style.display = 'none';
-                chunkWarningModal.dataset.state = 'closed';
-            }, 300);
-        }
+    submitTTSRequest();
+  }
+
+  // --- Attach main generation event to the button's CLICK, not the form's SUBMIT ---
+  // This is a more robust method that prevents accidental submissions during page load.
+  if (generateBtn) {
+    generateBtn.addEventListener("click", function (event) {
+      console.log("Generate button clicked!");
+      console.log("Current voice mode:", currentVoiceMode);
+      console.log("Is generating:", isGenerating);
+      console.log(
+        "Text content:",
+        textArea ? textArea.value.trim() : "NO TEXTAREA"
+      );
+
+      // We still prevent default in case the button has any default browser actions.
+      event.preventDefault();
+
+      if (isGenerating) {
+        showNotification("Generation is already in progress.", "warning");
+        return;
+      }
+      const textContent = textArea.value.trim();
+      if (!textContent) {
+        showNotification("Please enter some text to generate speech.", "error");
+        return;
+      }
+      if (
+        currentVoiceMode === "predefined" &&
+        (!predefinedVoiceSelect || predefinedVoiceSelect.value === "none")
+      ) {
+        showNotification("Please select a predefined voice.", "error");
+        return;
+      }
+      if (
+        currentVoiceMode === "clone" &&
+        (!cloneReferenceSelect || cloneReferenceSelect.value === "none")
+      ) {
+        showNotification(
+          "Please select a reference audio file for Voice Cloning.",
+          "error"
+        );
+        return;
+      }
+
+      // Check for the generation quality warning.
+      if (!hideGenerationWarning) {
+        showGenerationWarningModal();
+        return; // Stop here and let the modal handler take over.
+      }
+
+      // If the warning is hidden, proceed to the final checks.
+      proceedWithSubmissionChecks();
+    });
+  } else {
+    console.log("Generate button not found!");
+  }
+
+  // --- Modal Handling ---
+  function showChunkWarningModal() {
+    if (chunkWarningModal) {
+      chunkWarningModal.style.display = "flex";
+      chunkWarningModal.classList.remove("hidden", "opacity-0");
+      chunkWarningModal.dataset.state = "open";
     }
-    function showGenerationWarningModal() {
-        if (generationWarningModal) {
-            generationWarningModal.style.display = 'flex';
-            generationWarningModal.classList.remove('hidden', 'opacity-0');
-            generationWarningModal.dataset.state = 'open';
-        }
+  }
+  function hideChunkWarningModal() {
+    if (chunkWarningModal) {
+      chunkWarningModal.classList.add("opacity-0");
+      setTimeout(() => {
+        chunkWarningModal.style.display = "none";
+        chunkWarningModal.dataset.state = "closed";
+      }, 300);
     }
-    function hideGenerationWarningModal() {
-        if (generationWarningModal) {
-            generationWarningModal.classList.add('opacity-0');
-            setTimeout(() => {
-                generationWarningModal.style.display = 'none';
-                generationWarningModal.dataset.state = 'closed';
-            }, 300);
-        }
+  }
+  function showGenerationWarningModal() {
+    if (generationWarningModal) {
+      generationWarningModal.style.display = "flex";
+      generationWarningModal.classList.remove("hidden", "opacity-0");
+      generationWarningModal.dataset.state = "open";
+    }
+  }
+  function hideGenerationWarningModal() {
+    if (generationWarningModal) {
+      generationWarningModal.classList.add("opacity-0");
+      setTimeout(() => {
+        generationWarningModal.style.display = "none";
+        generationWarningModal.dataset.state = "closed";
+      }, 300);
     }
-    if (chunkWarningOkBtn) chunkWarningOkBtn.addEventListener('click', () => {
-        if (hideChunkWarningCheckbox && hideChunkWarningCheckbox.checked) hideChunkWarning = true;
-        hideChunkWarningModal(); debouncedSaveState(); submitTTSRequest();
+  }
+  if (chunkWarningOkBtn)
+    chunkWarningOkBtn.addEventListener("click", () => {
+      if (hideChunkWarningCheckbox && hideChunkWarningCheckbox.checked)
+        hideChunkWarning = true;
+      hideChunkWarningModal();
+      debouncedSaveState();
+      submitTTSRequest();
     });
-    if (chunkWarningCancelBtn) chunkWarningCancelBtn.addEventListener('click', hideChunkWarningModal);
-    if (generationWarningAcknowledgeBtn) generationWarningAcknowledgeBtn.addEventListener('click', () => {
-        if (hideGenerationWarningCheckbox && hideGenerationWarningCheckbox.checked) hideGenerationWarning = true;
-        hideGenerationWarningModal(); debouncedSaveState(); proceedWithSubmissionChecks();
+  if (chunkWarningCancelBtn)
+    chunkWarningCancelBtn.addEventListener("click", hideChunkWarningModal);
+  if (generationWarningAcknowledgeBtn)
+    generationWarningAcknowledgeBtn.addEventListener("click", () => {
+      if (
+        hideGenerationWarningCheckbox &&
+        hideGenerationWarningCheckbox.checked
+      )
+        hideGenerationWarning = true;
+      hideGenerationWarningModal();
+      debouncedSaveState();
+      proceedWithSubmissionChecks();
     });
-    if (loadingCancelBtn) loadingCancelBtn.addEventListener('click', () => {
-        if (isGenerating) { isGenerating = false; hideLoadingOverlay(); showNotification("Generation UI cancelled by user.", "info"); }
+  if (loadingCancelBtn)
+    loadingCancelBtn.addEventListener("click", () => {
+      if (isGenerating) {
+        isGenerating = false;
+        hideLoadingOverlay();
+        showNotification("Generation UI cancelled by user.", "info");
+      }
     });
-    function showLoadingOverlay() {
-        if (loadingOverlay && generateBtn && loadingCancelBtn) {
-            loadingMessage.textContent = 'Generating audio...';
-            loadingStatusText.textContent = 'Please wait. This may take some time.';
-            loadingOverlay.style.display = 'flex';
-            loadingOverlay.classList.remove('hidden', 'opacity-0'); loadingOverlay.dataset.state = 'open';
-            generateBtn.disabled = true; loadingCancelBtn.disabled = false;
-        }
+  function showLoadingOverlay() {
+    if (loadingOverlay && generateBtn && loadingCancelBtn) {
+      loadingMessage.textContent = "Generating audio...";
+      loadingStatusText.textContent = "Please wait. This may take some time.";
+      loadingOverlay.style.display = "flex";
+      loadingOverlay.classList.remove("hidden", "opacity-0");
+      loadingOverlay.dataset.state = "open";
+      generateBtn.disabled = true;
+      loadingCancelBtn.disabled = false;
     }
-    function hideLoadingOverlay() {
-        if (loadingOverlay && generateBtn) {
-            loadingOverlay.classList.add('opacity-0');
-            setTimeout(() => {
-                loadingOverlay.style.display = 'none';
-                loadingOverlay.dataset.state = 'closed';
-            }, 300);
-            generateBtn.disabled = false;
-        }
+  }
+  function hideLoadingOverlay() {
+    if (loadingOverlay && generateBtn) {
+      loadingOverlay.classList.add("opacity-0");
+      setTimeout(() => {
+        loadingOverlay.style.display = "none";
+        loadingOverlay.dataset.state = "closed";
+      }, 300);
+      generateBtn.disabled = false;
     }
-
-    // --- Configuration Management ---
-    function displayServerConfiguration() {
-        if (!serverConfigForm || !currentConfig || Object.keys(currentConfig).length === 0) return;
-        const fieldsToDisplay = {
-            "server.host": currentConfig.server?.host, "server.port": currentConfig.server?.port,
-            "tts_engine.device": currentConfig.tts_engine?.device, "tts_engine.default_voice_id": currentConfig.tts_engine?.default_voice_id,
-            "paths.model_cache": currentConfig.paths?.model_cache, "tts_engine.predefined_voices_path": currentConfig.tts_engine?.predefined_voices_path,
-            "tts_engine.reference_audio_path": currentConfig.tts_engine?.reference_audio_path, "paths.output": currentConfig.paths?.output,
-            "audio_output.format": currentConfig.audio_output?.format, "audio_output.sample_rate": currentConfig.audio_output?.sample_rate
-        };
-        for (const name in fieldsToDisplay) {
-            const input = serverConfigForm.querySelector(`input[name="${name}"]`);
-            if (input) {
-                input.value = fieldsToDisplay[name] !== undefined ? fieldsToDisplay[name] : '';
-                if (name.includes('.host') || name.includes('.port') || name.includes('.device') || name.includes('paths.')) input.readOnly = true;
-                else input.readOnly = false;
-            }
-        }
+  }
+
+  // --- Configuration Management ---
+  function displayServerConfiguration() {
+    if (
+      !serverConfigForm ||
+      !currentConfig ||
+      Object.keys(currentConfig).length === 0
+    )
+      return;
+    const fieldsToDisplay = {
+      "server.host": currentConfig.server?.host,
+      "server.port": currentConfig.server?.port,
+      "tts_engine.device": currentConfig.tts_engine?.device,
+      "tts_engine.default_voice_id": currentConfig.tts_engine?.default_voice_id,
+      "paths.model_cache": currentConfig.paths?.model_cache,
+      "tts_engine.predefined_voices_path":
+        currentConfig.tts_engine?.predefined_voices_path,
+      "tts_engine.reference_audio_path":
+        currentConfig.tts_engine?.reference_audio_path,
+      "paths.output": currentConfig.paths?.output,
+      "audio_output.format": currentConfig.audio_output?.format,
+      "audio_output.sample_rate": currentConfig.audio_output?.sample_rate,
+    };
+    for (const name in fieldsToDisplay) {
+      const input = serverConfigForm.querySelector(`input[name="${name}"]`);
+      if (input) {
+        input.value =
+          fieldsToDisplay[name] !== undefined ? fieldsToDisplay[name] : "";
+        if (
+          name.includes(".host") ||
+          name.includes(".port") ||
+          name.includes(".device") ||
+          name.includes("paths.")
+        )
+          input.readOnly = true;
+        else input.readOnly = false;
+      }
     }
-    async function updateConfigStatus(button, statusElem, message, type = 'info', duration = 5000, enableButtonAfter = true) {
-        const statusClasses = { success: 'text-green-600 dark:text-green-400', error: 'text-red-600 dark:text-red-400', warning: 'text-yellow-600 dark:text-yellow-400', info: 'text-indigo-600 dark:text-indigo-400', processing: 'text-yellow-600 dark:text-yellow-400 animate-pulse' };
-        const isProcessing = message.toLowerCase().includes('saving') || message.toLowerCase().includes('restarting') || message.toLowerCase().includes('resetting');
-        const messageType = isProcessing ? 'processing' : type;
-        if (statusElem) {
-            statusElem.textContent = message;
-            statusElem.className = `text-xs ml-2 ${statusClasses[messageType] || statusClasses['info']}`;
-            statusElem.classList.remove('hidden');
-        }
-        if (button) button.disabled = isProcessing || (type === 'error' && !enableButtonAfter) || (type === 'success' && !enableButtonAfter);
-        if (duration > 0) setTimeout(() => { if (statusElem) statusElem.classList.add('hidden'); if (button && enableButtonAfter) button.disabled = false; }, duration);
-        else if (button && enableButtonAfter && !isProcessing) button.disabled = false;
+  }
+  async function updateConfigStatus(
+    button,
+    statusElem,
+    message,
+    type = "info",
+    duration = 5000,
+    enableButtonAfter = true
+  ) {
+    const statusClasses = {
+      success: "text-green-600 dark:text-green-400",
+      error: "text-red-600 dark:text-red-400",
+      warning: "text-yellow-600 dark:text-yellow-400",
+      info: "text-indigo-600 dark:text-indigo-400",
+      processing: "text-yellow-600 dark:text-yellow-400 animate-pulse",
+    };
+    const isProcessing =
+      message.toLowerCase().includes("saving") ||
+      message.toLowerCase().includes("restarting") ||
+      message.toLowerCase().includes("resetting");
+    const messageType = isProcessing ? "processing" : type;
+    if (statusElem) {
+      statusElem.textContent = message;
+      statusElem.className = `text-xs ml-2 ${
+        statusClasses[messageType] || statusClasses["info"]
+      }`;
+      statusElem.classList.remove("hidden");
     }
-
-    if (saveConfigBtn && configStatus) {
-        saveConfigBtn.addEventListener('click', async () => {
-            const configDataToSave = {};
-            const inputs = serverConfigForm.querySelectorAll('input[name]:not([readonly]), select[name]:not([readonly])');
-            inputs.forEach(input => {
-                const keys = input.name.split('.'); let currentLevel = configDataToSave;
-                keys.forEach((key, index) => {
-                    if (index === keys.length - 1) {
-                        let value = input.value;
-                        if (input.type === 'number') value = parseFloat(value) || 0;
-                        else if (input.type === 'checkbox') value = input.checked;
-                        currentLevel[key] = value;
-                    } else { currentLevel[key] = currentLevel[key] || {}; currentLevel = currentLevel[key]; }
-                });
-            });
-            if (Object.keys(configDataToSave).length === 0) { showNotification("No editable configuration values to save.", "info"); return; }
-            updateConfigStatus(saveConfigBtn, configStatus, 'Saving configuration...', 'info', 0, false);
-            try {
-                const response = await fetch(`${API_BASE_URL}/save_settings`, {
-                    method: 'POST',
-                    headers: { 'Content-Type': 'application/json' },
-                    body: JSON.stringify(configDataToSave)
-                });
-                const result = await response.json();
-                if (!response.ok) throw new Error(result.detail || 'Failed to save configuration');
-                updateConfigStatus(saveConfigBtn, configStatus, result.message || 'Configuration saved.', 'success', 5000);
-                if (result.restart_needed && restartServerBtn) restartServerBtn.classList.remove('hidden');
-                await fetchInitialData();
-                showNotification("Configuration saved. Some changes may require a server restart if prompted.", "success");
-            } catch (error) {
-                console.error('Error saving server config:', error);
-                updateConfigStatus(saveConfigBtn, configStatus, `Error: ${error.message}`, 'error', 0);
-            }
+    if (button)
+      button.disabled =
+        isProcessing ||
+        (type === "error" && !enableButtonAfter) ||
+        (type === "success" && !enableButtonAfter);
+    if (duration > 0)
+      setTimeout(() => {
+        if (statusElem) statusElem.classList.add("hidden");
+        if (button && enableButtonAfter) button.disabled = false;
+      }, duration);
+    else if (button && enableButtonAfter && !isProcessing)
+      button.disabled = false;
+  }
+
+  if (saveConfigBtn && configStatus) {
+    saveConfigBtn.addEventListener("click", async () => {
+      const configDataToSave = {};
+      const inputs = serverConfigForm.querySelectorAll(
+        "input[name]:not([readonly]), select[name]:not([readonly])"
+      );
+      inputs.forEach((input) => {
+        const keys = input.name.split(".");
+        let currentLevel = configDataToSave;
+        keys.forEach((key, index) => {
+          if (index === keys.length - 1) {
+            let value = input.value;
+            if (input.type === "number") value = parseFloat(value) || 0;
+            else if (input.type === "checkbox") value = input.checked;
+            currentLevel[key] = value;
+          } else {
+            currentLevel[key] = currentLevel[key] || {};
+            currentLevel = currentLevel[key];
+          }
         });
-    }
-
-    if (saveGenDefaultsBtn && genDefaultsStatus) {
-        saveGenDefaultsBtn.addEventListener('click', async () => {
-            const genParams = {
-                temperature: parseFloat(temperatureSlider.value), exaggeration: parseFloat(exaggerationSlider.value),
-                cfg_weight: parseFloat(cfgWeightSlider.value), speed_factor: parseFloat(speedFactorSlider.value),
-                seed: parseInt(seedInput.value, 10) || 0, language: languageSelect.value
-            };
-            updateConfigStatus(saveGenDefaultsBtn, genDefaultsStatus, 'Saving generation defaults...', 'info', 0, false);
-            try {
-                const response = await fetch(`${API_BASE_URL}/save_settings`, {
-                    method: 'POST',
-                    headers: { 'Content-Type': 'application/json' },
-                    body: JSON.stringify({ generation_defaults: genParams })
-                });
-                const result = await response.json();
-                if (!response.ok) throw new Error(result.detail || 'Failed to save generation defaults');
-                updateConfigStatus(saveGenDefaultsBtn, genDefaultsStatus, result.message || 'Generation defaults saved.', 'success', 5000);
-                if (currentConfig.generation_defaults) Object.assign(currentConfig.generation_defaults, genParams);
-            } catch (error) {
-                console.error('Error saving generation defaults:', error);
-                updateConfigStatus(saveGenDefaultsBtn, genDefaultsStatus, `Error: ${error.message}`, 'error', 0);
-            }
+      });
+      if (Object.keys(configDataToSave).length === 0) {
+        showNotification("No editable configuration values to save.", "info");
+        return;
+      }
+      updateConfigStatus(
+        saveConfigBtn,
+        configStatus,
+        "Saving configuration...",
+        "info",
+        0,
+        false
+      );
+      try {
+        const response = await fetch(`${API_BASE_URL}/save_settings`, {
+          method: "POST",
+          headers: { "Content-Type": "application/json" },
+          body: JSON.stringify(configDataToSave),
         });
-    }
-
-    if (resetSettingsBtn) {
-        resetSettingsBtn.addEventListener('click', async () => {
-            if (!confirm("Are you sure you want to reset ALL settings to their initial defaults? This will affect config.yaml and UI preferences. This action cannot be undone.")) return;
-            updateConfigStatus(resetSettingsBtn, configStatus, 'Resetting settings...', 'info', 0, false);
-            try {
-                const response = await fetch(`${API_BASE_URL}/reset_settings`, {
-                    method: 'POST'
-                });
-                if (!response.ok) {
-                    const errorResult = await response.json().catch(() => ({ detail: 'Failed to reset settings on server.' }));
-                    throw new Error(errorResult.detail);
-                }
-                const result = await response.json();
-                updateConfigStatus(resetSettingsBtn, configStatus, result.message + " Reloading page...", 'success', 0, false);
-                setTimeout(() => window.location.reload(true), 2000);
-            } catch (error) {
-                console.error('Error resetting settings:', error);
-                updateConfigStatus(resetSettingsBtn, configStatus, `Reset Error: ${error.message}`, 'error', 0);
-                showNotification(`Error resetting settings: ${error.message}`, 'error');
-            }
+        const result = await response.json();
+        if (!response.ok)
+          throw new Error(result.detail || "Failed to save configuration");
+        updateConfigStatus(
+          saveConfigBtn,
+          configStatus,
+          result.message || "Configuration saved.",
+          "success",
+          5000
+        );
+        if (result.restart_needed && restartServerBtn)
+          restartServerBtn.classList.remove("hidden");
+        await fetchInitialData();
+        showNotification(
+          "Configuration saved. Some changes may require a server restart if prompted.",
+          "success"
+        );
+      } catch (error) {
+        console.error("Error saving server config:", error);
+        updateConfigStatus(
+          saveConfigBtn,
+          configStatus,
+          `Error: ${error.message}`,
+          "error",
+          0
+        );
+      }
+    });
+  }
+
+  if (saveGenDefaultsBtn && genDefaultsStatus) {
+    saveGenDefaultsBtn.addEventListener("click", async () => {
+      const genParams = {
+        temperature: parseFloat(temperatureSlider.value),
+        exaggeration: parseFloat(exaggerationSlider.value),
+        cfg_weight: parseFloat(cfgWeightSlider.value),
+        speed_factor: parseFloat(speedFactorSlider.value),
+        seed: parseInt(seedInput.value, 10) || 0,
+        language: languageSelect.value,
+      };
+
+      // Also save model selection
+      const modelConfig = {
+        use_multilingual: modelTypeSelect.value === "multilingual",
+      };
+
+      updateConfigStatus(
+        saveGenDefaultsBtn,
+        genDefaultsStatus,
+        "Saving generation defaults...",
+        "info",
+        0,
+        false
+      );
+      try {
+        const response = await fetch(`${API_BASE_URL}/save_settings`, {
+          method: "POST",
+          headers: { "Content-Type": "application/json" },
+          body: JSON.stringify({
+            generation_defaults: genParams,
+            model: modelConfig,
+          }),
         });
-    }
+        const result = await response.json();
+        if (!response.ok)
+          throw new Error(
+            result.detail || "Failed to save generation defaults"
+          );
+
+        let message = result.message || "Generation defaults saved.";
+        if (
+          result.restart_needed ||
+          modelConfig.use_multilingual !==
+            currentConfig?.model?.use_multilingual
+        ) {
+          message += " Server restart required to apply model change.";
+          showNotification(
+            "Model type changed. Please restart the server for changes to take effect.",
+            "warning",
+            8000
+          );
+        }
 
-    if (restartServerBtn) {
-        restartServerBtn.addEventListener('click', async () => {
-            if (!confirm("Are you sure you want to restart the server?")) return;
-            updateConfigStatus(restartServerBtn, configStatus, 'Attempting server restart...', 'processing', 0, false);
-            try {
-                const response = await fetch(`${API_BASE_URL}/restart_server`, {
-                    method: 'POST'
-                });
-                const result = await response.json();
-                if (!response.ok) throw new Error(result.detail || 'Server responded with error on restart command');
-                showNotification("Server restart initiated. Please wait a moment for the server to come back online, then refresh the page.", "info", 10000);
-            } catch (error) {
-                showNotification(`Server restart command failed: ${error.message}`, "error");
-                updateConfigStatus(restartServerBtn, configStatus, `Restart failed.`, 'error', 5000, true);
-            }
+        updateConfigStatus(
+          saveGenDefaultsBtn,
+          genDefaultsStatus,
+          message,
+          "success",
+          5000
+        );
+        if (currentConfig.generation_defaults)
+          Object.assign(currentConfig.generation_defaults, genParams);
+        if (currentConfig.model)
+          currentConfig.model.use_multilingual = modelConfig.use_multilingual;
+      } catch (error) {
+        console.error("Error saving generation defaults:", error);
+        updateConfigStatus(
+          saveGenDefaultsBtn,
+          genDefaultsStatus,
+          `Error: ${error.message}`,
+          "error",
+          0
+        );
+      }
+    });
+  }
+
+  if (resetSettingsBtn) {
+    resetSettingsBtn.addEventListener("click", async () => {
+      if (
+        !confirm(
+          "Are you sure you want to reset ALL settings to their initial defaults? This will affect config.yaml and UI preferences. This action cannot be undone."
+        )
+      )
+        return;
+      updateConfigStatus(
+        resetSettingsBtn,
+        configStatus,
+        "Resetting settings...",
+        "info",
+        0,
+        false
+      );
+      try {
+        const response = await fetch(`${API_BASE_URL}/reset_settings`, {
+          method: "POST",
         });
-    }
-
-    // --- File Upload & Refresh ---
-    async function handleFileUpload(fileInput, endpoint, successCallback, buttonToAnimate) {
-        const files = fileInput.files;
-        if (!files || files.length === 0) return;
-        const originalButtonHTML = buttonToAnimate ? buttonToAnimate.innerHTML : '';
-        if (buttonToAnimate) {
-            buttonToAnimate.innerHTML = `<svg class="animate-spin h-5 w-5 mr-1.5 inline-block" xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24"><circle class="opacity-25" cx="12" cy="12" r="10" stroke="currentColor" stroke-width="4"></circle><path class="opacity-75" fill="currentColor" d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4zm2 5.291A7.962 7.962 0 014 12H0c0 3.042 1.135 5.824 3 7.938l3-2.647z"></path></svg>Uploading...`;
-            buttonToAnimate.disabled = true;
-        }
-        const uploadNotification = showNotification(`Uploading ${files.length} file(s)...`, 'info', 0);
-        const formData = new FormData();
-        for (const file of files) formData.append('files', file);
-        try {
-            const response = await fetch(`${API_BASE_URL}${endpoint}`, {
-                method: 'POST',
-                body: formData
-            });
-            const result = await response.json();
-            if (uploadNotification) uploadNotification.remove();
-            if (!response.ok) throw new Error(result.message || result.detail || `Upload failed with status ${response.status}`);
-            if (result.errors && result.errors.length > 0) {
-                result.errors.forEach(err => showNotification(`Upload Warning: ${err.filename || 'File'} - ${err.error}`, 'warning', 10000));
-            }
-            const successfulUploads = result.uploaded_files || [];
-            if (successfulUploads.length > 0) {
-                showNotification(`Successfully uploaded: ${successfulUploads.join(', ')}`, 'success');
-            } else if (!result.errors || result.errors.length === 0) {
-                showNotification("Files processed. No new valid files were added or an issue occurred.", 'info');
-            }
-            successCallback(result);
-            debouncedSaveState();
-        } catch (error) {
-            console.error(`Error uploading to ${endpoint}:`, error);
-            if (uploadNotification) uploadNotification.remove();
-            showNotification(`Upload Error: ${error.message}`, 'error');
-        } finally {
-            if (buttonToAnimate) {
-                buttonToAnimate.disabled = false;
-                buttonToAnimate.innerHTML = originalButtonHTML;
-            }
-            fileInput.value = '';
+        if (!response.ok) {
+          const errorResult = await response
+            .json()
+            .catch(() => ({ detail: "Failed to reset settings on server." }));
+          throw new Error(errorResult.detail);
         }
-    }
-
-    if (cloneImportButton && cloneFileInput) {
-        cloneImportButton.addEventListener('click', () => cloneFileInput.click());
-        cloneFileInput.addEventListener('change', () => handleFileUpload(cloneFileInput, '/upload_reference', (result) => {
-            initialReferenceFiles = result.all_reference_files || [];
-            populateReferenceFiles();
-            const firstUploaded = result.uploaded_files?.[0];
-            if (firstUploaded && cloneReferenceSelect && Array.from(cloneReferenceSelect.options).some(opt => opt.value === firstUploaded)) {
-                cloneReferenceSelect.value = firstUploaded;
-            }
-        }, cloneImportButton));
-    }
-
-    if (predefinedVoiceImportButton && predefinedVoiceFileInput) {
-        predefinedVoiceImportButton.addEventListener('click', () => predefinedVoiceFileInput.click());
-        predefinedVoiceFileInput.addEventListener('change', () => handleFileUpload(predefinedVoiceFileInput, '/upload_predefined_voice', (result) => {
-            initialPredefinedVoices = result.all_predefined_voices || [];
-            populatePredefinedVoices();
-            const firstUploadedFilename = result.uploaded_files?.[0];
-            if (firstUploadedFilename && predefinedVoiceSelect && initialPredefinedVoices.some(v => v.filename === firstUploadedFilename)) {
-                predefinedVoiceSelect.value = firstUploadedFilename;
-            }
-        }, predefinedVoiceImportButton));
-    }
-
-    if (cloneRefreshButton && cloneReferenceSelect) {
-        cloneRefreshButton.addEventListener('click', async () => {
-            const originalButtonIcon = cloneRefreshButton.innerHTML;
-            cloneRefreshButton.innerHTML = `<svg class="animate-spin h-5 w-5" xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24"><circle class="opacity-25" cx="12" cy="12" r="10" stroke="currentColor" stroke-width="4"></circle><path class="opacity-75" fill="currentColor" d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4zm2 5.291A7.962 7.962 0 014 12H0c0 3.042 1.135 5.824 3 7.938l3-2.647z"></path></svg>`;
-            cloneRefreshButton.disabled = true;
-            try {
-                const response = await fetch(`${API_BASE_URL}/get_reference_files`);
-                if (!response.ok) throw new Error('Failed to fetch reference files list');
-                const files = await response.json();
-                initialReferenceFiles = files;
-                populateReferenceFiles();
-                showNotification("Reference file list refreshed.", 'info', 2000);
-                debouncedSaveState();
-            } catch (error) {
-                console.error("Error refreshing reference files:", error);
-                showNotification(`Error refreshing list: ${error.message}`, 'error');
-            } finally {
-                cloneRefreshButton.disabled = false;
-                cloneRefreshButton.innerHTML = originalButtonIcon;
-            }
+        const result = await response.json();
+        updateConfigStatus(
+          resetSettingsBtn,
+          configStatus,
+          result.message + " Reloading page...",
+          "success",
+          0,
+          false
+        );
+        setTimeout(() => window.location.reload(true), 2000);
+      } catch (error) {
+        console.error("Error resetting settings:", error);
+        updateConfigStatus(
+          resetSettingsBtn,
+          configStatus,
+          `Reset Error: ${error.message}`,
+          "error",
+          0
+        );
+        showNotification(`Error resetting settings: ${error.message}`, "error");
+      }
+    });
+  }
+
+  if (restartServerBtn) {
+    restartServerBtn.addEventListener("click", async () => {
+      if (!confirm("Are you sure you want to restart the server?")) return;
+      updateConfigStatus(
+        restartServerBtn,
+        configStatus,
+        "Attempting server restart...",
+        "processing",
+        0,
+        false
+      );
+      try {
+        const response = await fetch(`${API_BASE_URL}/restart_server`, {
+          method: "POST",
         });
+        const result = await response.json();
+        if (!response.ok)
+          throw new Error(
+            result.detail || "Server responded with error on restart command"
+          );
+        showNotification(
+          "Server restart initiated. Please wait a moment for the server to come back online, then refresh the page.",
+          "info",
+          10000
+        );
+      } catch (error) {
+        showNotification(
+          `Server restart command failed: ${error.message}`,
+          "error"
+        );
+        updateConfigStatus(
+          restartServerBtn,
+          configStatus,
+          `Restart failed.`,
+          "error",
+          5000,
+          true
+        );
+      }
+    });
+  }
+
+  // --- File Upload & Refresh ---
+  async function handleFileUpload(
+    fileInput,
+    endpoint,
+    successCallback,
+    buttonToAnimate
+  ) {
+    const files = fileInput.files;
+    if (!files || files.length === 0) return;
+    const originalButtonHTML = buttonToAnimate ? buttonToAnimate.innerHTML : "";
+    if (buttonToAnimate) {
+      buttonToAnimate.innerHTML = `<svg class="animate-spin h-5 w-5 mr-1.5 inline-block" xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24"><circle class="opacity-25" cx="12" cy="12" r="10" stroke="currentColor" stroke-width="4"></circle><path class="opacity-75" fill="currentColor" d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4zm2 5.291A7.962 7.962 0 014 12H0c0 3.042 1.135 5.824 3 7.938l3-2.647z"></path></svg>Uploading...`;
+      buttonToAnimate.disabled = true;
     }
-
-    if (predefinedVoiceRefreshButton && predefinedVoiceSelect) {
-        predefinedVoiceRefreshButton.addEventListener('click', async () => {
-            const originalButtonIcon = predefinedVoiceRefreshButton.innerHTML;
-            predefinedVoiceRefreshButton.innerHTML = `<svg class="animate-spin h-5 w-5" xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24"><circle class="opacity-25" cx="12" cy="12" r="10" stroke="currentColor" stroke-width="4"></circle><path class="opacity-75" fill="currentColor" d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4zm2 5.291A7.962 7.962 0 014 12H0c0 3.042 1.135 5.824 3 7.938l3-2.647z"></path></svg>`;
-            predefinedVoiceRefreshButton.disabled = true;
-            try {
-                const response = await fetch(`${API_BASE_URL}/get_predefined_voices`);
-                if (!response.ok) throw new Error('Failed to fetch predefined voices list');
-                const voices = await response.json();
-                initialPredefinedVoices = voices;
-                populatePredefinedVoices();
-                showNotification("Predefined voices list refreshed.", 'info', 2000);
-                debouncedSaveState();
-            } catch (error) {
-                console.error("Error refreshing predefined voices:", error);
-                showNotification(`Error refreshing list: ${error.message}`, 'error');
-            } finally {
-                predefinedVoiceRefreshButton.disabled = false;
-                predefinedVoiceRefreshButton.innerHTML = originalButtonIcon;
-            }
-        });
+    const uploadNotification = showNotification(
+      `Uploading ${files.length} file(s)...`,
+      "info",
+      0
+    );
+    const formData = new FormData();
+    for (const file of files) formData.append("files", file);
+    try {
+      const response = await fetch(`${API_BASE_URL}${endpoint}`, {
+        method: "POST",
+        body: formData,
+      });
+      const result = await response.json();
+      if (uploadNotification) uploadNotification.remove();
+      if (!response.ok)
+        throw new Error(
+          result.message ||
+            result.detail ||
+            `Upload failed with status ${response.status}`
+        );
+      if (result.errors && result.errors.length > 0) {
+        result.errors.forEach((err) =>
+          showNotification(
+            `Upload Warning: ${err.filename || "File"} - ${err.error}`,
+            "warning",
+            10000
+          )
+        );
+      }
+      const successfulUploads = result.uploaded_files || [];
+      if (successfulUploads.length > 0) {
+        showNotification(
+          `Successfully uploaded: ${successfulUploads.join(", ")}`,
+          "success"
+        );
+      } else if (!result.errors || result.errors.length === 0) {
+        showNotification(
+          "Files processed. No new valid files were added or an issue occurred.",
+          "info"
+        );
+      }
+      successCallback(result);
+      debouncedSaveState();
+    } catch (error) {
+      console.error(`Error uploading to ${endpoint}:`, error);
+      if (uploadNotification) uploadNotification.remove();
+      showNotification(`Upload Error: ${error.message}`, "error");
+    } finally {
+      if (buttonToAnimate) {
+        buttonToAnimate.disabled = false;
+        buttonToAnimate.innerHTML = originalButtonHTML;
+      }
+      fileInput.value = "";
     }
+  }
+
+  if (cloneImportButton && cloneFileInput) {
+    cloneImportButton.addEventListener("click", () => cloneFileInput.click());
+    cloneFileInput.addEventListener("change", () =>
+      handleFileUpload(
+        cloneFileInput,
+        "/upload_reference",
+        (result) => {
+          initialReferenceFiles = result.all_reference_files || [];
+          populateReferenceFiles();
+          const firstUploaded = result.uploaded_files?.[0];
+          if (
+            firstUploaded &&
+            cloneReferenceSelect &&
+            Array.from(cloneReferenceSelect.options).some(
+              (opt) => opt.value === firstUploaded
+            )
+          ) {
+            cloneReferenceSelect.value = firstUploaded;
+          }
+        },
+        cloneImportButton
+      )
+    );
+  }
+
+  if (predefinedVoiceImportButton && predefinedVoiceFileInput) {
+    predefinedVoiceImportButton.addEventListener("click", () =>
+      predefinedVoiceFileInput.click()
+    );
+    predefinedVoiceFileInput.addEventListener("change", () =>
+      handleFileUpload(
+        predefinedVoiceFileInput,
+        "/upload_predefined_voice",
+        (result) => {
+          initialPredefinedVoices = result.all_predefined_voices || [];
+          populatePredefinedVoices();
+          const firstUploadedFilename = result.uploaded_files?.[0];
+          if (
+            firstUploadedFilename &&
+            predefinedVoiceSelect &&
+            initialPredefinedVoices.some(
+              (v) => v.filename === firstUploadedFilename
+            )
+          ) {
+            predefinedVoiceSelect.value = firstUploadedFilename;
+          }
+        },
+        predefinedVoiceImportButton
+      )
+    );
+  }
+
+  if (cloneRefreshButton && cloneReferenceSelect) {
+    cloneRefreshButton.addEventListener("click", async () => {
+      const originalButtonIcon = cloneRefreshButton.innerHTML;
+      cloneRefreshButton.innerHTML = `<svg class="animate-spin h-5 w-5" xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24"><circle class="opacity-25" cx="12" cy="12" r="10" stroke="currentColor" stroke-width="4"></circle><path class="opacity-75" fill="currentColor" d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4zm2 5.291A7.962 7.962 0 014 12H0c0 3.042 1.135 5.824 3 7.938l3-2.647z"></path></svg>`;
+      cloneRefreshButton.disabled = true;
+      try {
+        const response = await fetch(`${API_BASE_URL}/get_reference_files`);
+        if (!response.ok)
+          throw new Error("Failed to fetch reference files list");
+        const files = await response.json();
+        initialReferenceFiles = files;
+        populateReferenceFiles();
+        showNotification("Reference file list refreshed.", "info", 2000);
+        debouncedSaveState();
+      } catch (error) {
+        console.error("Error refreshing reference files:", error);
+        showNotification(`Error refreshing list: ${error.message}`, "error");
+      } finally {
+        cloneRefreshButton.disabled = false;
+        cloneRefreshButton.innerHTML = originalButtonIcon;
+      }
+    });
+  }
+
+  if (predefinedVoiceRefreshButton && predefinedVoiceSelect) {
+    predefinedVoiceRefreshButton.addEventListener("click", async () => {
+      const originalButtonIcon = predefinedVoiceRefreshButton.innerHTML;
+      predefinedVoiceRefreshButton.innerHTML = `<svg class="animate-spin h-5 w-5" xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24"><circle class="opacity-25" cx="12" cy="12" r="10" stroke="currentColor" stroke-width="4"></circle><path class="opacity-75" fill="currentColor" d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4zm2 5.291A7.962 7.962 0 014 12H0c0 3.042 1.135 5.824 3 7.938l3-2.647z"></path></svg>`;
+      predefinedVoiceRefreshButton.disabled = true;
+      try {
+        const response = await fetch(`${API_BASE_URL}/get_predefined_voices`);
+        if (!response.ok)
+          throw new Error("Failed to fetch predefined voices list");
+        const voices = await response.json();
+        initialPredefinedVoices = voices;
+        populatePredefinedVoices();
+        showNotification("Predefined voices list refreshed.", "info", 2000);
+        debouncedSaveState();
+      } catch (error) {
+        console.error("Error refreshing predefined voices:", error);
+        showNotification(`Error refreshing list: ${error.message}`, "error");
+      } finally {
+        predefinedVoiceRefreshButton.disabled = false;
+        predefinedVoiceRefreshButton.innerHTML = originalButtonIcon;
+      }
+    });
+  }
 
-    await fetchInitialData();
+  await fetchInitialData();
 });
diff --git a/uv.lock b/uv.lock
new file mode 100644
index 0000000..470a4e4
--- /dev/null
+++ b/uv.lock
@@ -0,0 +1,8 @@
+version = 1
+revision = 3
+requires-python = ">=3.10"
+
+[[package]]
+name = "chatterbox-tts-server"
+version = "0.1.0"
+source = { virtual = "." }

From 564d4e13e5c3ae7fa9a9bba4351bab28859cd392 Mon Sep 17 00:00:00 2001
From: Alok Kumar <kumaralok344@gmail.com>
Date: Tue, 11 Nov 2025 12:39:38 +0530
Subject: [PATCH 2/2] Update default language in config.yaml to English

---
 UI_CHANGES_SUMMARY.md | 151 ----------------------------
 UPGRADE_FOR_HINDI.md  | 224 ------------------------------------------
 config.yaml           |   2 +-
 3 files changed, 1 insertion(+), 376 deletions(-)
 delete mode 100644 UI_CHANGES_SUMMARY.md
 delete mode 100644 UPGRADE_FOR_HINDI.md

diff --git a/UI_CHANGES_SUMMARY.md b/UI_CHANGES_SUMMARY.md
deleted file mode 100644
index 531e7b9..0000000
--- a/UI_CHANGES_SUMMARY.md
+++ /dev/null
@@ -1,151 +0,0 @@
-# UI Changes for Multilingual Support
-
-## Summary
-Added UI controls to allow users to select between the multilingual and English-only TTS models, along with improved language selection and status indicators.
-
-## Changes Made
-
-### 1. Navigation Bar (index.html)
-**Added Model Status Badge** - Shows which model is currently loaded
-- Location: Next to the title in the navigation bar
-- Displays: "🌐 Multilingual (23 Languages)" or "🇬🇧 English Only"
-- Updates automatically based on server configuration
-- Provides visual feedback about the active model
-
-### 2. Generation Parameters Section (index.html)
-**Added Model Type Selector** - Dropdown to choose TTS model
-- Location: Just before the Language selector in the Generation Parameters
-- Options:
-  - "Multilingual (23 Languages)" - Supports 23 languages including Hindi
-  - "English Only" - Smaller model, English only
-- Note: Warns users that server restart is required to apply changes
-
-**Updated Language Selector**
-- Changed default selection from "en" (English) to "hi" (Hindi)
-- Added language codes in parentheses for clarity (e.g., "Hindi (hi)")
-- Updated help text to be more informative
-- Now properly syncs with the config.yaml default language setting
-
-### 3. JavaScript Updates (script.js)
-
-#### Model Type Handling
-- Added `modelTypeSelect` element reference
-- Loads current model type from server config on page load
-- Sets dropdown value based on `config.model.use_multilingual`
-
-#### Save Generation Parameters
-- Extended to save both generation parameters AND model type selection
-- Saves `model.use_multilingual` boolean value to server config
-- Shows warning notification when model type changes
-- Detects if restart is needed and notifies user
-
-#### Model Status Badge Updates
-- Badge text and icon update on page load based on config
-- Shows multilingual status with globe emoji (🌐) or English with flag (🇬🇧)
-- Tooltip provides detailed information about loaded model
-
-#### Change Detection & Warnings
-- Detects when user changes model type from current config
-- Shows persistent notification reminding user to:
-  1. Click "Save Generation Parameters"
-  2. Restart the server for changes to take effect
-- Prevents confusion about why model didn't change
-
-## User Workflow
-
-### To Switch Models:
-1. Open the UI in browser
-2. Look at the Generation Parameters section
-3. Find "TTS Model" dropdown
-4. Select desired model:
-   - "Multilingual (23 Languages)" - for Hindi and other languages
-   - "English Only" - for English-only use case
-5. Click "Save Generation Parameters" button
-6. Notice the warning about server restart
-7. Click "Restart Server" button (or manually restart)
-8. Refresh the page
-9. Verify the model status badge shows the new model
-
-### To Use Hindi (Default):
-1. Model is already set to Multilingual
-2. Language is already set to Hindi (hi)
-3. Simply enter Hindi text and generate
-
-### To Use Other Languages:
-1. Ensure Model Type is "Multilingual"
-2. Select desired language from Language dropdown
-3. Enter text in that language
-4. Generate speech
-
-## Visual Indicators
-
-### Model Status Badge (Top Navigation)
-```
-Current State          Badge Display
-------------------     -------------------------------
-Multilingual loaded -> "🌐 Multilingual (23 Languages)"
-English-only loaded -> "🇬🇧 English Only"
-```
-
-### Model Type Selector (Form)
-```
-Config Value                      Dropdown Shows
---------------------------------  ---------------------------
-use_multilingual: true        ->  "Multilingual (23 Languages)"
-use_multilingual: false       ->  "English Only"
-```
-
-### Language Selector (Form)
-- Shows all 23 supported languages
-- Hindi (hi) is selected by default
-- Language codes shown for clarity
-
-## Benefits
-
-1. **User-Friendly**: Clear visual feedback about which model is loaded
-2. **Flexible**: Easy switching between multilingual and English-only models
-3. **Informative**: Warnings and tooltips guide users through the process
-4. **Consistent**: UI state syncs with server configuration
-5. **Safe**: Clear warnings about server restart requirements
-
-## Technical Details
-
-### Config Values Saved
-```json
-{
-  "model": {
-    "use_multilingual": true  // or false
-  },
-  "generation_defaults": {
-    "language": "hi",  // or any of 23 supported codes
-    "temperature": 0.8,
-    "exaggeration": 0.5,
-    // ... other params
-  }
-}
-```
-
-### Supported Language Codes
-ar, da, de, el, en, es, fi, fr, he, **hi**, it, ja, ko, ms, nl, no, pl, pt, ru, sv, sw, tr, zh
-
-### Files Modified
-1. `ui/index.html` - Added model selector, updated language selector, added status badge
-2. `ui/script.js` - Added model handling logic, status updates, change detection
-3. `config.yaml` - Set default to multilingual model and Hindi language
-4. `config.py` - Added default multilingual setting to DEFAULT_CONFIG
-5. `engine.py` - Added multilingual model support
-6. `server.py` - Added language_id parameter handling
-
-## Testing Checklist
-
-- [ ] Model status badge shows correct model on page load
-- [ ] Model type selector reflects current config
-- [ ] Language selector defaults to Hindi
-- [ ] Changing model type shows notification
-- [ ] Save button updates config.yaml correctly
-- [ ] Restart button triggers server restart
-- [ ] After restart, new model is loaded
-- [ ] Badge updates after model change
-- [ ] Hindi text generates proper speech (not noise)
-- [ ] Other languages work correctly
-- [ ] English still works when English-only model selected
diff --git a/UPGRADE_FOR_HINDI.md b/UPGRADE_FOR_HINDI.md
deleted file mode 100644
index b40200b..0000000
--- a/UPGRADE_FOR_HINDI.md
+++ /dev/null
@@ -1,224 +0,0 @@
-# Upgrade Guide: Enable Hindi & Multilingual Support
-
-## Current Status
-
-Your Chatterbox TTS server is currently running with:
-- **chatterbox-tts version**: 0.1.2 (English-only)
-- **Model**: English-only TTS model
-- **Languages**: English only
-
-## To Enable Hindi & 22 Other Languages
-
-The code has been updated to support multilingual TTS, but you need to upgrade the `chatterbox-tts` package to access the multilingual model.
-
-### Step 1: Upgrade chatterbox-tts Package
-
-```bash
-# Stop the server first (Ctrl+C if running)
-
-# Activate your virtual environment
-source .venv/bin/activate
-
-# Upgrade to the latest version
-pip install --upgrade chatterbox-tts
-
-# Or install a specific version if available
-pip install chatterbox-tts>=0.1.4
-```
-
-### Step 2: Verify the Upgrade
-
-```bash
-# Check the installed version
-pip show chatterbox-tts
-
-# Verify multilingual support is available
-python -c "from chatterbox import ChatterboxMultilingualTTS; print('Multilingual support: Available')" || echo "Multilingual support: Not yet available"
-```
-
-### Step 3: Enable Multilingual in Configuration
-
-Edit `config.yaml`:
-
-```yaml
-model:
-  repo_id: ResembleAI/chatterbox
-  use_multilingual: true  # Change from false to true
-
-generation_defaults:
-  temperature: 0.8
-  exaggeration: 0.5
-  cfg_weight: 0.5
-  seed: 0
-  speed_factor: 1.0
-  language: hi  # Change from 'en' to 'hi' for Hindi
-```
-
-### Step 4: Restart the Server
-
-```bash
-./run.bash
-```
-
-Or manually:
-
-```bash
-source .venv/bin/activate
-python server.py
-```
-
-### Step 5: Verify Hindi Support
-
-Check the server logs for:
-```
-Successfully loaded Multilingual TTS model on mps. Supports 23 languages including Hindi.
-```
-
-Open the web UI and you should see:
-- Model badge showing: "🌐 Multilingual (23 Languages)"
-- Language selector with Hindi (hi) as default
-
-## Current Fallback Behavior
-
-The code has been designed to gracefully handle the missing multilingual model:
-
-1. **If multilingual is requested but not available**:
-   - Server logs a warning
-   - Automatically falls back to English-only model
-   - Server continues to run normally
-
-2. **If you try to generate Hindi audio**:
-   - Server logs a warning: "Language 'hi' requested but multilingual model not available"
-   - Generates English audio instead
-   - No errors or crashes
-
-## Supported Languages (After Upgrade)
-
-Once upgraded, your server will support these 23 languages:
-
-| Code | Language | Code | Language | Code | Language |
-|------|----------|------|----------|------|----------|
-| ar | Arabic | da | Danish | de | German |
-| el | Greek | en | English | es | Spanish |
-| fi | Finnish | fr | French | he | Hebrew |
-| **hi** | **Hindi** | it | Italian | ja | Japanese |
-| ko | Korean | ms | Malay | nl | Dutch |
-| no | Norwegian | pl | Polish | pt | Portuguese |
-| ru | Russian | sv | Swedish | sw | Swahili |
-| tr | Turkish | zh | Chinese | | |
-
-## Troubleshooting
-
-### Issue: Multilingual import still fails after upgrade
-
-**Solution**: The multilingual model might be in a different package or version:
-
-```bash
-# Check if there's a separate multilingual package
-pip search chatterbox-multilingual
-
-# Or check the Chatterbox GitHub for latest installation instructions
-# https://github.com/resemble-ai/chatterbox
-```
-
-### Issue: Model download fails
-
-**Solution**: The multilingual model is larger (~2-3GB). Ensure you have:
-- Sufficient disk space
-- Stable internet connection
-- Access to Hugging Face (not blocked by firewall)
-
-### Issue: Server shows "Multilingual model requested but not available"
-
-This means the package doesn't have the multilingual class yet. Check:
-
-```bash
-# Verify the package contents
-python -c "import chatterbox; print(dir(chatterbox))"
-
-# Look for ChatterboxMultilingualTTS in the output
-```
-
-### Issue: Still generating English audio for Hindi text
-
-Possible causes:
-1. Multilingual model not loaded (check logs)
-2. Config still has `use_multilingual: false`
-3. Model badge still shows "English Only"
-
-**Solution**: Follow all upgrade steps again and restart server.
-
-## Alternative: Use Latest GitHub Version
-
-If PyPI doesn't have the latest multilingual version yet:
-
-```bash
-# Install directly from GitHub
-pip uninstall chatterbox-tts -y
-pip install git+https://github.com/resemble-ai/chatterbox.git
-
-# Or clone and install locally
-git clone https://github.com/resemble-ai/chatterbox.git
-cd chatterbox
-pip install -e .
-```
-
-## Rollback to English-Only
-
-If you prefer to use only English:
-
-1. Edit `config.yaml`:
-   ```yaml
-   model:
-     use_multilingual: false
-   
-   generation_defaults:
-     language: en
-   ```
-
-2. Restart server
-
-The server will use the smaller, faster English-only model.
-
-## Code Changes Summary
-
-The following changes were made to support gradual migration:
-
-### engine.py
-- Added conditional import for multilingual model
-- Falls back gracefully if multilingual not available
-- Warns user to upgrade when multilingual is requested
-- Uses English-only model as fallback
-
-### config.yaml
-- Added `model.use_multilingual` setting (currently `false`)
-- Language default set to `en` (will be `hi` after upgrade)
-
-### UI
-- Model selector dropdown to switch between models
-- Status badge showing currently loaded model
-- Language selector for all 23 languages
-- Automatic warnings when changes require restart
-
-## Benefits After Upgrade
-
-✅ **Full Hindi Support** - Generate natural-sounding Hindi speech
-✅ **22 Additional Languages** - Support for multiple languages
-✅ **Better Quality** - Improved voice quality for non-English languages
-✅ **No Accent Issues** - Native language synthesis without English accent
-✅ **UI Ready** - All UI controls already in place
-
-## Next Steps
-
-1. Upgrade `chatterbox-tts` package
-2. Update `config.yaml` settings
-3. Restart server
-4. Test Hindi generation
-5. Enjoy multilingual TTS! 🎉
-
----
-
-**Need Help?**
-- Check server logs for detailed error messages
-- Review [Chatterbox GitHub](https://github.com/resemble-ai/chatterbox)
-- Check [Chatterbox Multilingual Demo](https://huggingface.co/spaces/ResembleAI/Chatterbox-Multilingual-TTS)
diff --git a/config.yaml b/config.yaml
index 30f827f..4fc64f0 100644
--- a/config.yaml
+++ b/config.yaml
@@ -25,7 +25,7 @@ generation_defaults:
   cfg_weight: 0.5
   seed: 0
   speed_factor: 1.0
-  language: hi
+  language: en
 audio_output:
   format: wav
   sample_rate: 24000