Speech-to-Text APIs | Sarvam API Docs

API Types

Real-time API

Process short audio files synchronously with immediate response. Best for files under 1 minute.

Batch API

Handle large audio files asynchronously. Ideal for long recordings.

Streaming API

Real-time audio streaming with instant results. Perfect for live transcription.

Coming soon

Code Examples

Real-time API

Batch API

Streaming API

Synchronous Processing

Process short audio files with immediate response. Best for quick transcriptions and testing. Features include:

Instant results
Simple integration
Support for multiple audio formats
Maximum duration: 30 seconds

Python

JavaScript

cURL

1 from sarvamai import SarvamAI
2 
3 client = SarvamAI(
4   api_subscription_key="YOUR_SARVAM_API_KEY",
5 )
6 
7 response = client.speech_to_text.transcribe(
8   file=open("audio.wav", "rb"),
9   model="saarika:v2.5",
10   language_code="gu-IN"
11 )
12 
13 print(response)

API Features

Language Support

Multiple Indian languages and English support
Automatic language detection
High accuracy transcription

API Types

Real-Time API (under 30 seconds) - Batch API for longer files

Advanced Features

Speaker diarization (Batch API only)
Separate pricing for diarization
Real-time transcription

Next Steps

Choose Your API

Select the appropriate API type based on your use case.

Get API Key

Go Live

Deploy your integration and monitor usage in the dashboard.

Need help choosing the right API? Contact us on discord for guidance.