Saaras
Saaras-v2
Overview
Saaras-v2 is our flagship domain-aware speech recognition model, designed for production environments requiring high accuracy and robust performance.
Key Features
Advanced prompting system for domain-specific translation and hotword retention, ensuring accurate context preservation.
Optimized for 8KHz telephony audio with enhanced multi-speaker recognition capabilities.
Preserves proper nouns and entities accurately across languages, maintaining context and meaning.
Built-in Language Identification (LID) with confidence scores for automatic language detection.
Provides diarized outputs with precise timestamps for multi-speaker conversations through batch API.
Key Capabilities
Basic Usage
Code-Mixed Speech
Automatic Language Detection
Domain Prompting
Basic transcription with specified language code. Perfect for single-language content with clear audio quality.
Python
JavaScript
cURL
For detailed API documentation and advanced usage, visit our API Reference.