Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 7 additions & 4 deletions docs/introduction.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -3,15 +3,18 @@ title: Introduction
icon: book
---

Rime provides text-to-speech AI models built specifically for real-time conversation. These models boast a sub-200ms latency that maintains conversational flow with no awkward silences. The models are trained on natural speech patterns to give your AI agents a voice that customers actually want to talk to.
Rime provides text-to-speech AI models built specifically for real-time conversation. The latest [Arcana v3](https://rime.ai/resources/arcana-v3) model delivers sub 120ms on-prem latency and 200ms via the cloud API. This is fast enough for mid-utterance control and barge-in with no awkward silences.

Rime offers two flagship models:

- **Arcana** produces ultra-realistic voices that capture the warmth and rhythm of human speech, including natural elements like laughter and breathing.
- **Arcana** produces ultra-realistic voices that capture the warmth and rhythm of human speech, including natural elements like laughter and breathing.
- **Mistv2** prioritizes speed and control, delivering accurate pronunciation with fine-grained customization options for high-volume applications.

The Rime API supports **English**, **Spanish**, **French**, **German**, and **Hindi**, with voices across different demographics and accents. Rime uses phonetic markup to handle tricky brand names, currencies, and personal details (such as IDs and phone numbers), so you can customize models to create the perfect voice that represents your company and brand.
Arcana v3 supports **multilingual codeswitching** across: English, Spanish, Hindi, Arabic, French, Portuguese, German, Japanese, Hebrew, and Tamil. This allows your agents to switch languages mid-conversation while maintaining the same voice identity.

Rime supports flexible deployment options, from cloud API and virtual private cloud, to on-premises, with no concurrency limits.

Ready to get started with Rime? Follow the [Python quickstart(/api-reference/quickstart-python) to begin generating text-to-speech with Rime's proprietary models in under five minutes.
Rime uses phonetic markup to handle tricky brand names, currencies, and personal details (such as IDs and phone numbers), so you can customize models to create the perfect voice that represents your company and brand.

Ready to get started with Rime? Follow the [Python quickstart](/api-reference/quickstart-python) to begin generating text-to-speech with Rime's proprietary models in under five minutes.