POST
/
speech-to-text-translate

Headers

api-subscription-key
string
default:

Body

multipart/form-data
file
file
required

The audio file to transcribe. Supported formats are wave (.wav) and MPEG-3 (.mp3). Works best at 16kHz. Multiple channels will be merged.

prompt
string | null

Prompt to assist the transcription

model
enum<string>
default: saaras:v1

Model to be used for converting speech to text in target language

Available options:
saaras:v1

Response

200 - application/json
transcript
string
required

Transcript of the provided speech

language_code
enum<string> | null
required

This will return the BCP-47 code of language spoken in the input. If multiple languages are detected, this will return language code of most predominant spoken language. If no language is detected, this will be null

Available options:
hi-IN,
bn-IN,
kn-IN,
ml-IN,
mr-IN,
od-IN,
pa-IN,
ta-IN,
te-IN,
gu-IN,
en-IN