Sarvam AI provides a purpose-built AI stack for building applications in Indian languages. Our models span speech-to-text, speech translation, text translation, and high-quality text-to-speech—designed specifically for India’s linguistic diversity, accents, and real-world usage patterns.
Each model is trained and evaluated on Indian languages and culturally grounded data, enabling higher accuracy in production scenarios. With simple, well-documented APIs and predictable performance, developers can build, deploy, and scale India-first AI experiences without managing model complexity.
New to building for Indian languages? Start with Building for Indian Languages — a practical guide to language coverage, code-mixing, scripts, native numerals, 8kHz telephony audio, and pronunciation control.
Available models: Saaras v3 — Speech to Text, Bulbul v3 — Text to Speech, Mayura — Text Translation, Sarvam-Translate — Extended Translation, Sarvam-30B — Chat LLM, Sarvam-105B — Flagship Chat LLM, Sarvam Vision — Document Intelligence.
State-of-the-art ASR with 23 language support (22 Indian + English) and multiple output modes: transcribe, translate, verbatim, translit, codemix.
Natural-sounding voices for 11 languages (10 Indian + English) with customizable pitch, pace, and speaker options.
High-quality translation between 11 languages (10 Indian + English) with context preservation.
Extended translation support for all 23 languages (22 Indian + English) with superior accuracy.
30B parameter multilingual chat model delivering strong reasoning and conversational capabilities at a balanced performance-to-cost ratio.
105B parameter flagship model — our most capable chat model for the highest quality Indian language understanding, reasoning, and generation.
Extract and digitize content from documents in 23 languages with accurate OCR and structured output.
mode="transcribe" to convert user speech to textPerfect for customer service, smart home devices, and accessibility applications.
The following models are still available but are being phased out. We recommend migrating to the newer models listed above.
24B parameter multilingual chat model with hybrid reasoning. Deprecated and no longer available through the API — migrate to Sarvam-30B or Sarvam-105B.
Legacy ASR model supporting 11 Indian languages. Migrate to Saaras v3 for improved accuracy and 23 language support.