Speech to Text AI Marketplace
AI can transcribe speech into text, supporting multiple file types such as mp3, wav, and flac.
| Header Parameters |
|---|
X-API-Key string — REQUIREDYour API key |
| Request Body |
|---|
files binary — REQUIREDAudio speech file |
num_speakers number(optional) Number of speakers diarization in 1-4 speakers |
word_timestamps string(optional) true or false // default 'false' |
speaking_rate string(optional) true or false // default 'false' |
decoder_type string(optional) Greedy, BeamSearch, or LMBeamSearch // default 'LMBeamSearch' |
word_list string(optional) List of terminology ex. ['word', 'word', ...] |
| Responses | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
200Return list of transcribed text that corresponds to speech detected in audio.
| ||||||||||||||||
204No content | No result of transcription | ||||||||||||||||
400No audio file | Not found audio file or Bad requests | Server cannot or will not process the request | ||||||||||||||||
401Unauthorized | Incorrect X-API-Key or X-API-Key not have access to this model | ||||||||||||||||
415Can't decode [filename] | Unsupported file format |