Libraries & SDKs | Sarvam API Docs

The official SDKs wrap every Sarvam AI API — Speech-to-Text, Text-to-Speech, Translation, Chat Completion, and Document Digitization — behind a typed, ergonomic client. They handle auth, serialization, multipart uploads, and error mapping so you don’t have to hand-roll HTTP calls.

Language	Package	Install
Python	`sarvamai` on PyPI	`pip install sarvamai`
JavaScript / TypeScript	`sarvamai` on npm	`npm install sarvamai`

Official SDKs vs. generated request snippets. The Python and JavaScript SDKs above are hand-tested, fully supported clients. The per-endpoint snippets you see in other languages (cURL, Go, Swift, etc.) on API Reference pages are auto-generated request examples — useful as a starting point, but only Python and JavaScript are first-class SDKs.

Initialize the client

Pass your API key directly, or let the SDK read it from the SARVAM_API_KEY environment variable.

1 from sarvamai import SarvamAI
2 
3 client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")
4 
5 response = client.text.translate(
6     input="Hello, how are you?",
7     source_language_code="auto",
8     target_language_code="hi-IN",
9 )
10 print(response)

Async usage

Both SDKs support fully asynchronous calls — use them when you need concurrency (e.g. fanning out many requests, or inside an async web server or voice agent).

1 import asyncio
2 from sarvamai import AsyncSarvamAI
3 
4 client = AsyncSarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")
5 
6 async def main():
7     # Run several translations concurrently
8     inputs = ["Good morning", "How are you?", "Thank you"]
9     results = await asyncio.gather(*[
10         client.text.translate(
11             input=text,
12             source_language_code="en-IN",
13             target_language_code="hi-IN",
14         )
15         for text in inputs
16     ])
17     for r in results:
18         print(r)
19 
20 asyncio.run(main())

In JavaScript, every method is already Promise-based — await it directly. In Python, the synchronous SarvamAI and asynchronous AsyncSarvamAI expose the exact same method names and arguments, so you can switch with a one-line change.

Timeouts and retries

Configure timeouts and automatic retries either globally (on the client) or per request. Per-request options override the client defaults.

1 from sarvamai import SarvamAI
2 
3 # Client-level: applies to every request
4 client = SarvamAI(
5     api_subscription_key="YOUR_SARVAM_API_KEY",
6     timeout=30.0,  # seconds
7 )
8 
9 # Per-request: override timeout and set automatic retries
10 response = client.text.translate(
11     input="Hello, how are you?",
12     source_language_code="auto",
13     target_language_code="hi-IN",
14     request_options={
15         "timeout_in_seconds": 60,
16         "max_retries": 3,
17     },
18 )

Retries use exponential backoff and apply to transient failures (HTTP 429, 5xx, and connection errors). For a full retry/backoff helper and idempotency guidance, see Errors & Troubleshooting.

Handling errors

The SDKs raise typed exceptions mapped to HTTP status codes, so you can catch exactly the failure you care about. Catch the base class to handle everything.

1 from sarvamai import SarvamAI, TooManyRequestsError
2 from sarvamai.core.api_error import ApiError
3 
4 client = SarvamAI(api_subscription_key="YOUR_SARVAM_API_KEY")
5 
6 try:
7     response = client.text.translate(
8         input="Hello, how are you?",
9         source_language_code="auto",
10         target_language_code="hi-IN",
11     )
12 except TooManyRequestsError:
13     print("Rate limited — back off and retry")
14 except ApiError as e:
15     # Base class for every API error
16     print(f"API error {e.status_code}: {e.body}")

Exception reference

HTTP status	Python exception	JavaScript exception
`400` Bad request	`BadRequestError`	`SarvamAI.BadRequestError`
`403` Forbidden / invalid key¹	`ForbiddenError`	`SarvamAI.ForbiddenError`
`404` Not found	`NotFoundError`	`SarvamAI.NotFoundError`
`413` Payload too large	`ContentTooLargeError`	`SarvamAI.ContentTooLargeError`
`422` Unprocessable entity	`UnprocessableEntityError`	`SarvamAI.UnprocessableEntityError`
`429` Rate limited / quota	`TooManyRequestsError`	`SarvamAI.TooManyRequestsError`
`500` Server error	`InternalServerError`	`SarvamAI.InternalServerError`
`503` Service unavailable	`ServiceUnavailableError`	`SarvamAI.ServiceUnavailableError`
Any of the above (base)	`ApiError`	`SarvamAIError`
Client-side timeout	`httpx.TimeoutException`	`SarvamAITimeoutError`

¹ Auth failures return HTTP 403 (invalid_api_key_error), not 401 — see the auth status-code note.

Streaming support

Several APIs stream results instead of returning a single response. Here’s what each SDK supports:

Capability	Method	Python	JavaScript
TTS over HTTP stream	`text_to_speech.convert_stream` / `textToSpeech.convertStream`	✅	✅
TTS over WebSocket	`text_to_speech_streaming` / `textToSpeechStreaming`	✅	✅
STT over WebSocket	`speech_to_text_streaming` / `speechToTextStreaming`	✅	✅
STT-translate over WebSocket (legacy — prefer STT WebSocket with `mode="translate"`)	`speech_to_text_translate_streaming` / `speechToTextTranslateStreaming`	✅	✅

HTTP streaming returns an iterable/BinaryResponse of raw audio bytes — see the HTTP Streaming guide.
WebSocket streaming uses the async clients (AsyncSarvamAI / SarvamAIClient) with an event-driven connection — see the TTS WebSocket and STT WebSocket guides.

Versioning

The SDKs follow semantic versioning. Pin a version in production and review the API changelog before upgrading across a major version. Always develop against the latest release:

$ pip install --upgrade sarvamai

Resources

Resource	Link
Example notebooks & agents	GitHub cookbook
Errors, retries & exceptions	Errors & Troubleshooting
Community support	Discord
Machine-readable API schema	`https://docs.sarvam.ai/openapi.json` · `https://docs.sarvam.ai/asyncapi.json`