REST Stream
Authentication
Request
The language code in BCP-47 format.
The speaker voice to be used for the output audio.
Default: Shubh (for bulbul:v3), Anushka (for bulbul:v2)
Note: Speaker selection must match the chosen model version.
Controls the pitch of the audio. Range: -0.75 to 0.75. Default is 0.0.
Note: Only supported for bulbul:v2.
Controls the loudness of the audio. Range: 0.3 to 3.0. Default is 1.0.
Note: Only supported for bulbul:v2.
Specifies the sample rate of the output audio. Default is 22050 Hz.
Note: OPUS codec only supports 8000, 12000, 16000, 24000, 48000 Hz.
Specifies the model to use for text-to-speech conversion. Default is bulbul:v2.
Controls the randomness of the output. Range: 0.01 to 1.0. Default is 0.6.
Note: Only supported for bulbul:v3.
Specifies the codec for the streamed output audio (e.g., ‘mp3’).
Response
Success. Returns a streamed audio response in the requested format (e.g., audio/mpeg for MP3, audio/wav for WAV).