Credits & Rate Limits
Credits
Sarvam offers ₹1,000 worth of free credits for every user on signup. These credits can be used across any of our APIs — explore, prototype, and build without upfront cost.
Credits are universal and never expire. Once exhausted, add more credits or upgrade your plan from the Sarvam Dashboard.
How Rate Limits Work
Rate limits restrict the number of API requests your account can make within a given time window. Key points:
- Per-account enforcement — limits apply to your account as a whole, not individual API keys. All keys share the same rate limit pool.
- Continuous replenishment — capacity refills steadily over the window period rather than resetting all at once (token bucket model). Short bursts may still trigger limits.
- Per-API granularity — each API has its own concurrency limits across three modes (provisioned, burst, and high throughput), configured based on your plan tier.
Rate Limit Tiers
Rate limits are applied per account based on your subscription plan. Your current tier is visible on the Dashboard → Rate Limits page.
Concurrency limits are measured per account, not per API key. All keys under an account share the same limit pool. Each API has its own provisioned, burst, and high throughput limits visible on the dashboard.
Per-API Concurrency Limits
Each API has its own concurrency limits across three modes. These are configured per account and visible on the Dashboard → Limits page.
The following APIs each have independent concurrency limits configured per account:
- Speech to Text (Real-time)
- Speech to Text (Streaming)
- Speech to Text (Batch)
- Text to Speech (Real-time)
- Text to Speech (Streaming)
- Translate & Text Services
- Chat Completion
Your exact per-API limits (provisioned, burst, and high throughput) are shown on the Dashboard → Limits page. Limits vary by plan and update instantly when you upgrade.
For batch endpoints (Speech-to-Text, Speech-to-Text-Translate), implement a minimum 5ms delay between consecutive status polling requests to avoid hitting rate limits unnecessarily.
Upgrading Your Limits
View plans and upgrade directly from the dashboard. Rate limits update instantly.
Need higher rate limits, dedicated infrastructure, or custom SLAs? Talk to our team.
Managing Your Credits
If your credits are exhausted, API requests will return errors. You can add credits at any time — adding credits does not change your plan or rate limits.
-
Add Credits — Top up from the Billing page at any time. Credits never expire.
-
Upgrade Your Plan — Higher plans include bonus credits and increased rate limits.
-
Enterprise — For volume discounts and custom billing arrangements, email developer@sarvam.ai.