For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
CommunityAPI StatusAPI PricingSign Up
DocumentationAPI ReferencesCookbookIntegration
DocumentationAPI ReferencesCookbookIntegration
  • Getting Started
    • Welcome
    • Quickstart
    • SDKs & Libraries
    • Models
      • Saaras
      • Bulbul
      • Mayura
      • Sarvam Translate
      • Sarvam 30B
      • Sarvam 105B
      • Sarvam Vision
      • Sarvam M (Legacy)
      • Saarika (Legacy)
    • Credits & Rate Limits
    • Talk to us
    • Pricing
    • Changelog
  • API Guides & Tutorials
LogoLogo
CommunityAPI StatusAPI PricingSign Up
On this page
  • Key Features
  • Language Support
  • Available Speakers
  • Key Capabilities
  • Next Steps
Getting StartedModels

Bulbul

||View as Markdown|
Was this page helpful?
Previous

Mayura

Next
Built with

Bulbul v3 is our latest text-to-speech model, specifically designed for Indian languages and accents. It features improved audio quality, 30+ speaker voices, and supports up to 2500 characters per request.

Key Features

30+ Speaker Voices

Wide selection of natural-sounding voices including Shubh, Aditya, Ritu, Simran, Anand, Roopa, Priya, and more.

Extended Character Limit

Support for up to 2500 characters per request for longer content generation.

Sample Rate Options

Multiple sample rates: 8kHz, 16kHz, 22.05kHz, 24kHz (default). Higher rates (32kHz, 44.1kHz, 48kHz) available in bulbul:v3 REST API only.

Language Support

Support for 11 languages (10 Indian + English) with BCP-47 codes. The target language code is primarily used by the pre-TTS text normalization model.

Natural Prosody

Human-like speech patterns with natural intonation and emotional expression.

Pace Control

Adjustable speech speed from 0.5x to 2.0x for customized delivery.

Language Support

Bulbul v3 supports the following Indian languages:

Hindi (hi-IN), Bengali (bn-IN), Tamil (ta-IN), Telugu (te-IN), Gujarati (gu-IN), Kannada (kn-IN), Malayalam (ml-IN), Marathi (mr-IN), Punjabi (pa-IN), Odia (od-IN), English (en-IN)

Available Speakers

Bulbul v3 offers 30+ speaker voices:

Speakers: Shubh (default), Aditya, Ritu, Priya, Neha, Rahul, Pooja, Rohan, Simran, Kavya, Amit, Dev, Ishita, Shreya, Ratan, Varun, Manan, Sumit, Roopa, Kabir, Aayan, Ashutosh, Advait, Anand, Tanya, Tarun, Sunny, Mani, Gokul, Vijay, Shruti, Suhani, Mohit, Kavitha, Rehan, Soham, Rupali

Use the speaker parameter to select specific voices for your use case. Each speaker has unique characteristics suitable for different applications.

Key Capabilities

Basic Usage
Speaker Selection
Sample Rate Control

Convert text to speech with default settings. This is the simplest way to get started with Bulbul v3.

1from sarvamai import SarvamAI
2from sarvamai.play import play, save
3
4client = SarvamAI(
5 api_subscription_key="YOUR_SARVAM_API_KEY"
6)
7
8response = client.text_to_speech.convert(
9 text="Hello, how are you today?",
10 target_language_code="en-IN",
11 model="bulbul:v3"
12)
13
14# Play the audio
15play(response)
16
17# Save the response to a file
18save(response, "output.wav")

Next Steps

Developer quickstart

Learn how to integrate the Bulbul v3 API within your application.

API Reference

Complete API documentation for text to speech endpoints.

Cookbook

Step-by-step tutorial for text-to-speech implementation.