Credits & Rate Limits
Credits & Rate Limits
Credits & Rate Limits
Sarvam offers ₹100 worth of free credits for every user on signup. These credits can be used across any of our APIs — explore, prototype, and build without upfront cost.
Credits are universal and never expire. Once exhausted, add more credits or upgrade your plan from the Sarvam Dashboard.
Rate limits restrict the number of API requests your account can make within a given time window. Key points:
Rate limits vary significantly by API type and plan. Review the limits for each API below before building your integration.
stt-rt)stt-ws)stt-batch)For batch endpoints, implement a minimum 5ms delay between consecutive status polling requests to avoid hitting rate limits unnecessarily.
tts-rt)For bulbul:v3 model specifically, Starter rate limit is 30 req/min. Pro and Business limits are the same as the default above.
tts-ws)For bulbul:v3 model specifically, Starter rate limit is 30 concurrent. Pro and Business limits are the same as the default above.
ms-ts)ms-llm)These large models have lower limits due to their compute requirements.
Applies to: sarvam-30b, sarvam-105b
Vision API limits are uniform across all plans (Starter, Pro, and Business). Upgrading your plan does not increase Vision limits.
vis-doc-dig)vis-rt)Rate limits are measured per account, not per API key. All keys under an account share the same limit pool. Your current limits are visible on the Dashboard → Rate Limits page.
View plans and upgrade directly from the dashboard. Rate limits update instantly.
Need higher rate limits, dedicated infrastructure, or custom SLAs? Talk to our team.
If your credits are exhausted, API requests will return errors. You can add credits at any time — adding credits does not change your plan or rate limits.