Text-to-Speech Rest API
Synchronous Processing
Convert text to speech with immediate response. Best for quick conversions and testing. Features include:- Instant audio generation
- Multiple voice options
- Customizable speech parameters
- Various audio formats
API Features
- Multiple speaker voices
- Adjustable speech parameters
- High-quality audio output
- Natural prosody and intonation
- Multiple audio file formats
- Base64 encoded string
- Configurable sample rates
- Pitch control
- Speech rate adjustment
- Language selection
Model Information
Our flagship text-to-speech model designed for Indian languages and accents.
Key Features:
- Natural-sounding speech with human-like prosody
- Multiple voice personalities
- Multi-language support
- Real-time synthesis capabilities
- Fine-grained control over pitch, pace, and loudness
Supports 11 Indian languages with BCP-47 codes:
Supported Languages:
- English (en-IN)
- Hindi (hi-IN)
- Bengali (bn-IN)
- Tamil (ta-IN)
- Telugu (te-IN)
- Kannada (kn-IN)
- Malayalam (ml-IN)
- Marathi (mr-IN)
- Gujarati (gu-IN)
- Punjabi (pa-IN)
- Odia (od-IN)
Bulbul: Our Text to Speech Model
Bulbul is our state-of-the-art text-to-speech model that excels in generating natural-sounding speech with support for multiple Indian languages and various voice options.
Text to Speech Features
Basic Synthesis
Voice Selection
Advanced Options
Basic Text to Speech Synthesis
Convert text to natural-sounding speech with high quality. Features include:
- Multiple voice options
- Support for Indian languages
- Natural prosody and intonation
- High-quality audio output
- For numbers > 4 digits, use commas (e.g., ‘10,000’)
- Enable preprocessing for better numbers, dates handling
API Response Format
Supported audio formats: WAV (default), MP3, Linear16, Mulaw, Alaw, Opus, FLAC, AAC
Decoding Audio Examples
Python:
JavaScript:
Error Responses
All errors return a JSON object with an error field containing details about what went wrong.
Error Response Structure
Error Codes Reference
Example Error Response
Error Handling Code Example
Check out our detailed API Reference to explore Text to Speech and all available options.
Need help? Contact us on discord for guidance.