> For clean Markdown of any page, append `.md` to the page URL.
> For a complete documentation index, see https://docs.sarvam.ai/llms.txt.
> For full documentation content in one file, see https://docs.sarvam.ai/llms-full.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.sarvam.ai/_mcp/server.

# Text-to-Speech Overview

> Complete overview of Sarvam AI Text-to-Speech APIs using Bulbul v3 model. Convert text to natural speech with real-time and streaming options for Indian languages.

Sarvam AI offers a powerful text-to-speech model: [Bulbul V3](/api-reference-docs/getting-started/models/bulbul) — advanced TTS with 30+ voices and high-quality natural speech synthesis for Indian languages.

Advanced text-to-speech model with 30+ voices
and high-quality natural speech synthesis for Indian languages.

## API Types

Available API types: [REST API](/api-reference-docs/api-guides-tutorials/text-to-speech/rest-api) for quick conversions up to 2500 characters, and [Streaming API](/api-reference-docs/api-guides-tutorials/text-to-speech/streaming-api) for real-time audio via HTTP stream or WebSocket.

Generate speech for short text with immediate response. Best for quick
conversions up to 2500 characters.

Stream audio in real time — via a single HTTP POST for simple pipelines,
or a persistent WebSocket connection for interactive voice agents.

Not sure which one fits your latency and interactivity needs? See [Which Text-to-Speech API to Use](/api-reference-docs/api-guides-tutorials/text-to-speech/which-api-to-use) for a side-by-side comparison of REST, HTTP streaming, and WebSocket.

## Supported Audio Formats & MIME Types

The TTS API supports over 8 major audio formats and MIME type variants. Supported formats and MIME types are listed below:

| Format Group                 | Supported MIME Types |
| ---------------------------- | -------------------- |
| **MP3 Variants**             | `mp3`                |
| **WAV Variants**             | `wav`                |
| **AAC Variants**             | `aac`                |
| **OPUS Format**              | `opus`               |
| **FLAC Variants (Lossless)** | `flac`               |
| **PCM LINEAR16**             | `pcm`                |
| **MULAW (μ-law)**            | `mulaw`              |
| **ALAW (A-law)**             | `alaw`               |

**Experience the voices**: Head to [dashboard.sarvam.ai](https://dashboard.sarvam.ai/text-to-speech) to explore 30+ speaker voices, test different languages, and generate audio samples with custom input.

## Next Steps

Select the appropriate API type based on your use case.

Sign up and get your API key from the
[dashboard](https://dashboard.sarvam.ai).

Deploy your integration and monitor usage in the dashboard.

Need help choosing the right API? Contact us on
[discord](https://discord.com/invite/5rAsykttcs) for guidance.