Speech To Text Translate WebSocket
WebSocket channel for real-time speech to text streaming with English translation.
Note: This API Reference page is provided for informational purposes only. The Try It playground may not provide the best experience for streaming audio. For optimal streaming performance, please use the SDK or implement your own WebSocket client.
HandshakeTry it
Headers
Query parameters
Speech to text model to use (defaults to “saaras:v2.5” if not specified)
Audio codec/format of the input file. Our API automatically detects all codec formats, but for PCM files specifically (pcm_s16le, pcm_l16, pcm_raw), you must pass this parameter. PCM files supports sample rate 16000 and 8000.
Enable high VAD (Voice Activity Detection) sensitivity
Send
Send audio data for real-time speech to text streaming with translation
Receive
Receive real-time transcription and translation results from the WebSocket