Welcome to Sarvam AI Documentation

Sarvam provides Models & APIs across the stack to help developers build powerful applications. Whether you’re looking to translate text, convert speech to text, or combine speech recognition with translation, Sarvam has you covered.

Key Features

Translate Text

Use the /translate endpoint to translate text from one language to another. Supports 10 Indic languages along with English, achieving best-in-class performance. Know more

Speech to Text

Use the /speech-to-text endpoint to convert spoken language into written text. Output is returned in the same input language. Know more

Speech to Text Translate

Use the /speech-to-text-translate endpoint to combine speech recognition and translation, allowing you to convert spoken language directly into translated text. Know more

Text to Speech

Use the /text-to-speech endpoint to convert written text into spoken words. Supports natural-sounding voices across 10 languages. Know more

Call Analytics

Use the /call-analytics endpoint to perform intelligent question-answering on recorded calls or conversations. Know more

Text Analytics

Use the /text-analytics endpoint to conduct advanced question-answering on written text. Know more

Getting Started

To get started with Sarvam APIs, follow these steps:

  1. Authenticate: Learn how to authenticate your API requests in the Authentication Guide.
  2. Meta Prompt: You can now use our Meta Prompt directly to guide any AI chat model with the context needed to use Sarvam’s APIs effectively. An example on AI Studio with Gemini’s latest model: Explore here
  3. Try Examples: Use the Usage Guides to see examples and best practices.

Need Help?

If you have any questions or need assistance, visit our Help section or reach out to us on our Discord.