Text-to-Speech Overview

Sarvam AI offers a powerful text-to-speech model:

View our pricing page for detailed information about model-specific pricing and usage tiers.

API Types

Supported Audio Formats & MIME Types

The TTS API support over 8 major audio formats and MIME type variants.Supported formats and MIME types are listed below:

Format GroupSupported MIME Types
MP3 Variantsmp3
WAV Variantswav
AAC Variantsaac
OPUS Formatopus
FLAC Variants (Lossless)flac
PCM LINEAR16pcm
MULAW (μ-law)mulaw
ALAW (A-law)alaw

Voice Sample

Female Speakers

Anushka – Clear and Professional

Audio Text: सरवम एआई की टेक्स्ट-टू-स्पीच सेवा 11 भारतीय भाषाओं में प्राकृतिक और पेशेवर आवाज़ें प्रदान करती है, जो विविध उपयोग मामलों के लिए उपयुक्त हैं।

Best Used For: Audiobooks, Professional Narration, Corporate Training

Male Speakers

Abhilash – Deep and Authoritative

Audio Text: Warning. Unusual activity detected in Zone 7. Immediate verification is required to maintain system integrity. Proceed with caution.

Best Used For: Security Systems, Announcements, Documentaries


Try it yourself: You can explore different speakers, languages, and styles directly at Sarvam Dashboard. Generate your own audio samples and experiment with custom input!

Next Steps

1

Choose Your API

Select the appropriate API type based on your use case.

3

Get API Key

Sign up and get your API key from the dashboard.

4

Go Live

Deploy your integration and monitor usage in the dashboard.

Need help choosing the right API? Contact us on discord for guidance.