How to set the sample rate (audio quality)

The speech_sample_rate parameter controls the audio quality and size of the generated output.

It is optional — if omitted, the default value is 22050 Hz (high quality).

Supported values:

Sample Rate (Hz)Audio Quality Description
8000Basic telephony quality
16000Good quality voice
22050Default high-quality audio
24000Premium audio quality (larger size)
1# Import SarvamAI client and save helper
2from sarvamai import SarvamAI
3from sarvamai.play import save
1# Initialize the SarvamAI client with your API key
2client = SarvamAI(api_subscription_key="YOUR_API_SUBSCRIPTION_KEY")
1# Example 1: Using default sample rate (22050 Hz)
2audio = client.text_to_speech.convert(
3 text="Hello, how are you today?",
4 model="bulbul:v2",
5 target_language_code="en-IN",
6 speech_sample_rate=22050
7)
1#Example 2: Using selected sample rate (24000 Hz)
2audio = client.text_to_speech.convert(
3 text="Hello, how are you today?",
4 model="bulbul:v2",
5 target_language_code="en-IN",
6 speech_sample_rate=24000 # Set premium audio quality
7)
1# Save the generated audio to a file
2save(audio, "output_pitch_up.wav")