For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
CommunityAPI StatusAPI PricingSign Up
DocumentationAPI ReferencesCookbookIntegrationDeveloper Tools
DocumentationAPI ReferencesCookbookIntegrationDeveloper Tools
  • Getting Started
    • Welcome
    • Quickstart
    • SDKs & Libraries
    • Building for Indian Languages
    • Models
    • Credits & Rate Limits
    • Errors & Troubleshooting
    • Talk to us
    • Pricing
    • Changelog
  • API Guides & Tutorials
      • Overview
      • Which API to Use
      • Rest API
      • Batch API
      • Streaming API
        • Select Output Mode
        • Specify Language Codes
        • Enable Speaker Diarization
      • FAQs
LogoLogo
CommunityAPI StatusAPI PricingSign Up
On this page
  • Output Mode Comparison
  • When to Use Each Mode
  • Example Code
API Guides & TutorialsSpeech to TextHow-to

How to select output mode

||View as Markdown|
Was this page helpful?
Previous

How to specify language codes

Next
Built with

Saaras v3 supports multiple output modes to handle different transcription and translation needs. Use the mode parameter to specify how you want the audio processed.

The mode parameter is only available for Saaras v3. For legacy Saarika v2.5, only basic transcription is supported.

Output Mode Comparison

For the same input audio saying: “मेरा फोन नंबर है 9840950950” (My phone number is 9840950950)

ModeDescriptionExample Output
transcribeStandard transcription with number normalizationमेरा फोन नंबर है 9840950950
translateTranslate to EnglishMy phone number is 9840950950
verbatimExact word-for-word, preserves spoken numbersमेरा फोन नंबर है नौ आठ चार zero नौ पांच zero नौ पांच zero
translitRomanized/Latin scriptmera phone number hai 9840950950
codemixEnglish words in English, Indic words in native scriptमेरा phone number है 9840950950

When to Use Each Mode

ModeBest For
transcribeCall recordings, meetings, voice notes, general transcription
translateAnalytics dashboards, English-only systems, international teams
verbatimLegal transcriptions, compliance, preserving exact spoken content
translitSystems that only support Latin characters, search indexing
codemixHinglish conversations, mixed-language customer support

Example Code

Transcribe
Translate
Verbatim
Translit
Codemix
1from sarvamai import SarvamAI
2
3client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")
4
5# Standard transcription in original language
6response = client.speech_to_text.transcribe(
7 file=open("audio.wav", "rb"),
8 model="saaras:v3",
9 language_code="hi-IN",
10 mode="transcribe" # Default mode
11)
12
13print(response.transcript)
14# Output: मेरा फोन नंबर है 9840950950