Text-to-Speech Overview
Sarvam AI offers a powerful text-to-speech model:
View our pricing page for detailed information about model-specific pricing and usage tiers.
API Types
Real Time API
Generate speech for short text with immediate response. Best for quick conversions up to 1000 characters.
Streaming API
Stream long or live text into speech with low latency. Ideal for real-time playback, WebSocket-based async use, and efficient resource handling.
Supported Audio Formats & MIME Types
The TTS API support over 8 major audio formats and MIME type variants.Supported formats and MIME types are listed below:
Check out our detailed API Reference to explore Text To Speech Generation and all available options.
Next Steps
Need help choosing the right API? Contact us on discord for guidance.