How to select output mode

Saaras v3 supports multiple output modes to handle different transcription and translation needs. Use the mode parameter to specify how you want the audio processed.

The mode parameter is only available for Saaras v3. For legacy Saarika v2.5, only basic transcription is supported.

Output Mode Comparison

For the same input audio saying: “मेरा फोन नंबर है 9840950950” (My phone number is 9840950950)

ModeDescriptionExample Output
transcribeStandard transcription with number normalizationमेरा फोन नंबर है 9840950950
translateTranslate to EnglishMy phone number is 9840950950
verbatimExact word-for-word, preserves spoken numbersमेरा फोन नंबर है नौ आठ चार zero नौ पांच zero नौ पांच zero
translitRomanized/Latin scriptmera phone number hai 9840950950
codemixEnglish words in English, Indic words in native scriptमेरा phone number है 9840950950

When to Use Each Mode

ModeBest For
transcribeCall recordings, meetings, voice notes, general transcription
translateAnalytics dashboards, English-only systems, international teams
verbatimLegal transcriptions, compliance, preserving exact spoken content
translitSystems that only support Latin characters, search indexing
codemixHinglish conversations, mixed-language customer support

Example Code

1from sarvamai import SarvamAI
2
3client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")
4
5# Standard transcription in original language
6response = client.speech_to_text.transcribe(
7 file=open("audio.wav", "rb"),
8 model="saaras:v3",
9 language_code="hi-IN",
10 mode="transcribe" # Default mode
11)
12
13print(response.transcript)
14# Output: मेरा फोन नंबर है 9840950950