Sonus 🎧✨

AI-powered, immersive audiobook creation with granular voice control, powered by Murf AI. Transform your stories into lifelike audio, one sentence at a time.

🚀 Overview

Sonus is an innovative platform for creating immersive audiobooks with AI. Upload your manuscript, optimize your content, and generate professional-grade audio with sentence-level voice customization. Sonus leverages Murf AI, Facebook Bart, and Gemini Flash 2.0 to recommend the best voice settings—while you stay in control.

✨ Features

📄 Multi-format Upload: Supports .txt, .pdf, and .docx.
🧠 AI Content Optimization: Facebook Bart suggests improved/summarized content.
💸 Cost Transparency: See character usage, cost incurred, and savings instantly.
🎭 Character Recognition: Gemini Flash 2.0 identifies characters and assigns voices.
🎚️ Granular Voice Control: Per-sentence Murf AI voice, pitch, rate, and style recommendations.
🔁 Real-time Preview: Instantly stream audio for each block using Murf's streaming API.
✏️ Block Editing: Update text, delete blocks, and override voice settings.
🎵 Unified Export: Combine all blocks into a single MP3 via Murf's generation API.
🗂️ File Management: Preview, download, delete, and update files easily.
🔒 Secure Auth: User authentication and authorization powered by Clerk.

🛠️ Tech Stack

Layer	Technology
Frontend	Next.js
Backend	Flask
AI Models	Facebook Bart, Gemini Flash 2.0
TTS Engine	Murf AI
Storage	Supabase Buckets
Auth	Clerk
Deployment	Vercel, Render

🏗️ Workflow

Upload your document (.txt, .pdf, .docx).
Choose between original or AI-optimized content (with cost analysis).
Analyze: Gemini Flash 2.0 identifies characters and recommends Murf AI voice settings per sentence.
Review & Edit:
- Preview each sentence/block with real-time AI voice.
- Edit text, override voice, pitch, rate, or style.
Export: Combine all blocks into a single MP3.
Manage: Preview, download, delete, and update your files.

🧩 API Reference

Full documentation & examples in our Postman Collection.

🎙️ Example: Voice Recommendation Response

{
  "config": {
    "voiceId": "en-UK-aiden",
    "pitch": -10,
    "rate": -10,
    "style": "Narration",
    "sampleRate": 44100,
    "channelType": "MONO",
    "format": "MP3",
    "text": "On the edge of a quiet village nestled between silver hills..."
  },
  "analysis": {
    "primary_emotion": "Nostalgia",
    "preferred_accent": "UK",
    "preferred_age": "middle_aged",
    "preferred_gender": "male"
  },
  "reasoning": {
    "pitch_rationale": "Pitch set to -10 based on emotional intensity...",
    "rate_rationale": "Rate set to -10 based on urgency level...",
    "style_rationale": "Selected 'Narration' based on Nostalgia emotion...",
    "voice_choice": "A middle-aged UK male voice conveys warmth and experience."
  }
}

🏁 Getting Started

Clone the repo

git clone https://github.com/your-org/sonus.git cd sonus

Install dependencies

Frontend: cd frontend && npm install
Backend: cd backend && pip install -r requirements.txt

Set up environment variables

Supabase, Clerk, and Murf AI API keys.

Deploy

Frontend: Vercel
Backend: Render

Import API collection

Use the provided Postman Collection for API testing.

📂 File Structure


sonus/
├── frontend/ # Next.js app
├── backend/ # Flask API
├── Murf.postman_collection.json
└── README.md

🌐 Useful Links

🤝 Contributing

We welcome contributions! Please open issues, submit PRs, or suggest features.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sonus 🎧✨

🚀 Overview

✨ Features

🛠️ Tech Stack

🏗️ Workflow

🧩 API Reference

🎙️ Example: Voice Recommendation Response

🏁 Getting Started

📂 File Structure

🌐 Useful Links

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
backend		backend
frontend		frontend
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Sonus 🎧✨

🚀 Overview

✨ Features

🛠️ Tech Stack

🏗️ Workflow

🧩 API Reference

🎙️ Example: Voice Recommendation Response

🏁 Getting Started

📂 File Structure

🌐 Useful Links

🤝 Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages