Text-to-Speech Overview
Sarvam AI offers a powerful text-to-speech model:
View our pricing page for detailed information about model-specific pricing and usage tiers.
API Types
Generate speech for short text with immediate response. Best for quick conversions up to 1000 characters.
Stream long or live text into speech with low latency. Ideal for real-time playback, WebSocket-based async use, and efficient resource handling.
Supported Audio Formats & MIME Types
The TTS API support over 8 major audio formats and MIME type variants.Supported formats and MIME types are listed below:
Voice Sample
Female Speakers
Anushka
Vidya
Manisha
Arya
Anushka – Clear and Professional
Audio Text: सरवम एआई की टेक्स्ट-टू-स्पीच सेवा 11 भारतीय भाषाओं में प्राकृतिक और पेशेवर आवाज़ें प्रदान करती है, जो विविध उपयोग मामलों के लिए उपयुक्त हैं।
Best Used For: Audiobooks, Professional Narration, Corporate Training
Male Speakers
Abhilash
Karun
Hitesh
Abhilash – Deep and Authoritative
Audio Text: Warning. Unusual activity detected in Zone 7. Immediate verification is required to maintain system integrity. Proceed with caution.
Best Used For: Security Systems, Announcements, Documentaries
Try it yourself: You can explore different speakers, languages, and styles directly at Sarvam Dashboard. Generate your own audio samples and experiment with custom input!
Next Steps
Need help choosing the right API? Contact us on discord for guidance.