Speech-to-Text APIs

API Types

Real-time API

Process short audio files synchronously with immediate response. Best for files under 1 minute.

Batch API

Handle large audio files asynchronously with webhook callbacks. Ideal for long recordings.

Streaming API

Real-time audio streaming with instant results. Perfect for live transcription.

Coming soon

Code Examples

Synchronous Processing

Process short audio files with immediate response. Best for quick transcriptions and testing. Features include:

  • Instant results
  • Simple integration
  • Support for multiple audio formats
  • Maximum duration: 1 minute
1from sarvamai import SarvamAI
2
3client = SarvamAI(
4api_subscription_key="YOUR_SARVAM_API_KEY",
5)
6
7response = client.speech_to_text.transcribe(
8file=open("audio.wav", "rb"),
9model="saarika:v1",
10language_code="gu-IN"
11)
12
13print(response)

API Features

Language Support
  • Multiple Indian languages and English support
  • Automatic language detection
  • High accuracy transcription
API Types
  • Real-Time API (under 30 seconds) - Batch API for longer files - Immediate results for Real-Time
Advanced Features
  • Speaker diarization (Batch API only)
  • Separate pricing for diarization
  • Real-time transcription

Next Steps

1

Choose Your API

Select the appropriate API type based on your use case.

3

Get API Key

Sign up and get your API key from the dashboard.

4

Go Live

Deploy your integration and monitor usage in the dashboard.

Need help choosing the right API? Contact us on discord for guidance.