Skip to content

KDSCRIPT/Sonus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

Sonus 🎧✨

AI-powered, immersive audiobook creation with granular voice control, powered by Murf AI. Transform your stories into lifelike audio, one sentence at a time.


🚀 Overview

Sonus is an innovative platform for creating immersive audiobooks with AI. Upload your manuscript, optimize your content, and generate professional-grade audio with sentence-level voice customization. Sonus leverages Murf AI, Facebook Bart, and Gemini Flash 2.0 to recommend the best voice settings—while you stay in control.


✨ Features

  • 📄 Multi-format Upload: Supports .txt, .pdf, and .docx.
  • 🧠 AI Content Optimization: Facebook Bart suggests improved/summarized content.
  • 💸 Cost Transparency: See character usage, cost incurred, and savings instantly.
  • 🎭 Character Recognition: Gemini Flash 2.0 identifies characters and assigns voices.
  • 🎚️ Granular Voice Control: Per-sentence Murf AI voice, pitch, rate, and style recommendations.
  • 🔁 Real-time Preview: Instantly stream audio for each block using Murf's streaming API.
  • ✏️ Block Editing: Update text, delete blocks, and override voice settings.
  • 🎵 Unified Export: Combine all blocks into a single MP3 via Murf's generation API.
  • 🗂️ File Management: Preview, download, delete, and update files easily.
  • 🔒 Secure Auth: User authentication and authorization powered by Clerk.

🛠️ Tech Stack

Layer Technology
Frontend Next.js
Backend Flask
AI Models Facebook Bart, Gemini Flash 2.0
TTS Engine Murf AI
Storage Supabase Buckets
Auth Clerk
Deployment Vercel, Render

🏗️ Workflow

  1. Upload your document (.txt, .pdf, .docx).
  2. Choose between original or AI-optimized content (with cost analysis).
  3. Analyze: Gemini Flash 2.0 identifies characters and recommends Murf AI voice settings per sentence.
  4. Review & Edit:
    • Preview each sentence/block with real-time AI voice.
    • Edit text, override voice, pitch, rate, or style.
  5. Export: Combine all blocks into a single MP3.
  6. Manage: Preview, download, delete, and update your files.

🧩 API Reference

Full documentation & examples in our Postman Collection.

🎙️ Example: Voice Recommendation Response

{
  "config": {
    "voiceId": "en-UK-aiden",
    "pitch": -10,
    "rate": -10,
    "style": "Narration",
    "sampleRate": 44100,
    "channelType": "MONO",
    "format": "MP3",
    "text": "On the edge of a quiet village nestled between silver hills..."
  },
  "analysis": {
    "primary_emotion": "Nostalgia",
    "preferred_accent": "UK",
    "preferred_age": "middle_aged",
    "preferred_gender": "male"
  },
  "reasoning": {
    "pitch_rationale": "Pitch set to -10 based on emotional intensity...",
    "rate_rationale": "Rate set to -10 based on urgency level...",
    "style_rationale": "Selected 'Narration' based on Nostalgia emotion...",
    "voice_choice": "A middle-aged UK male voice conveys warmth and experience."
  }
}

🏁 Getting Started

  1. Clone the repo

git clone https://github.com/your-org/sonus.git cd sonus

  1. Install dependencies
  • Frontend: cd frontend && npm install
  • Backend: cd backend && pip install -r requirements.txt
  1. Set up environment variables
  • Supabase, Clerk, and Murf AI API keys.
  1. Deploy
  1. Import API collection

📂 File Structure


sonus/
├── frontend/ # Next.js app
├── backend/ # Flask API
├── Murf.postman_collection.json
└── README.md


🌐 Useful Links

Live Demo API Docs


🤝 Contributing

We welcome contributions! Please open issues, submit PRs, or suggest features.


MIT License Built with Murf AI

About

Sonus is an AI-powered platform that brings stories to life by transforming written content into immersive audiobooks—one sentence at a time. By combining advanced NLP models like Facebook BART and Gemini Flash 2.0 with industry-leading voice synthesis from Murf AI, Sonus enables creators to generate high-quality, emotionally rich audio experience

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors