How to set the sample rate (audio quality)

The speech_sample_rate parameter controls the audio quality and size of the generated output.

It is optional — if omitted, the default value is 24000 Hz (premium quality).

Supported values:

Sample Rate (Hz)Audio Quality DescriptionAvailability
8000Basic telephony qualityAll models & modes
16000Good quality voiceAll models & modes
22050High-quality audio (default for v2)All models & modes
24000Premium audio quality (default for v3)All models & modes
32000Broadcast qualitybulbul:v3 REST API only
44100CD quality audiobulbul:v3 REST API only
48000Professional/Studio qualitybulbul:v3 REST API only

Important: Sample rates above 24000 Hz (32000, 44100, 48000) are only available with bulbul:v3 via the REST API. They are not supported in streaming mode.

Example Code

1from sarvamai import SarvamAI
2from sarvamai.play import save
3
4# Initialize the REST client
5client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")
6
7# Generate speech using REST
8audio = client.text_to_speech.convert(
9 text="Welcome to Sarvam AI!",
10 model="bulbul:v3",
11 target_language_code="en-IN",
12 speech_sample_rate=24000 # Set premium audio quality
13)
14save(audio, "output1.wav")