Models
Sarvam AI provides a purpose-built AI stack for building applications in Indian languages. Our models span speech-to-text, speech translation, text translation, and high-quality text-to-speech—designed specifically for India’s linguistic diversity, accents, and real-world usage patterns.
Each model is trained and evaluated on Indian languages and culturally grounded data, enabling higher accuracy in production scenarios. With simple, well-documented APIs and predictable performance, developers can build, deploy, and scale India-first AI experiences without managing model complexity.
Model Selection Guide
State-of-the-art ASR with 23 language support (22 Indian + English) and multiple output modes: transcribe, translate, verbatim, translit, codemix.
Natural-sounding voices for 11 languages (10 Indian + English) with customizable pitch, pace, and speaker options.
High-quality translation between 11 languages (10 Indian + English) with context preservation.
Extended translation support for all 23 languages (22 Indian + English) with superior accuracy.
Multilingual chat model with advanced reasoning capabilities for Indian language conversations.
Extract and digitize content from documents in 23 languages with accurate OCR and structured output.
Language Support Overview
Saaras v3 & Sarvam Translate: Complete 23 Language Support (22 Indian + English)
Model Language Support Summary
Use Cases
Voice Assistant
Content Localization
Call Center Analytics
Educational Platform
Document Processing
Build a multilingual voice assistant
- Speech Input: Use Saaras v3 with
mode="transcribe"to convert user speech to text - Understanding: Process with Sarvam-M for intelligent responses
- Speech Output: Convert responses to speech with Bulbul
Perfect for customer service, smart home devices, and accessibility applications.
Legacy Models
The following models are still available but are being phased out. We recommend migrating to the newer models listed above.