Speech to Text AI Marketplace
AI can transcribe speech into text, supporting multiple file types such as mp3, wav, and flac.
Header Parameters |
---|
X-API-Key string — REQUIREDYour API key |
Request Body |
---|
files binary — REQUIREDAudio speech file |
num_speakers number(optional) Number of speakers diarization in 1-4 speakers |
word_timestamps string(optional) true or false // default 'false' |
speaking_rate string(optional) true or false // default 'false' |
decoder_type string(optional) Greedy, BeamSearch, or LMBeamSearch // default 'LMBeamSearch' |
word_list string(optional) List of terminology ex. ['word', 'word', ...] |
Responses | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
200 Return list of transcribed text that corresponds to speech detected in audio.
| ||||||||||||||||
204 No content | No result of transcription | ||||||||||||||||
400 No audio file | Not found audio file or Bad requests | Server cannot or will not process the request | ||||||||||||||||
401 Unauthorized | Incorrect X-API-Key or X-API-Key not have access to this model | ||||||||||||||||
415 Can't decode [filename] | Unsupported file format |