Synchronous Speech to Text API
Header
Your API key
form-data body
Request Body
- files Filerequired
Audio raw files in a form of multi-part form data using the key name files.
The maximum size for each file is 50 MB.
The duration of each file should less than 1 minute.
optionalSend with the form of multi-part form data
Speaker file Maximum 5 files can be provided. Each file size must not exceed 20 MB.
Enhances recognition accuracy for specific words. Maximum 10 words can be provided. e.g., สวัสดี
- 200
Array [
object
success
| failed
The status of the transcription process.
The total duration of the audio file in seconds (e.g., 20.856)
.
File name
result Array [
object
The start time of the segment in the audio file, in seconds (e.g., 1.6382252559726962)
.
The end time of the segment in the audio file, in seconds (e.g., 1.6382252559726962)
.
The identifier of the speaker in SPEAKER_{number}
format
The transcribed text
[
{
"filename": "Record.wav",
"status": "success",
"result": [
{
"speaker": "SPEAKER_00",
"transcript": "วิสัย",
"start_time": 1.6382252559726962,
"end_time": 3.9761092150170647
},
{
"speaker": "SPEAKER_00",
"transcript": "บริษัทผู้พัฒนาแพลตฟอร์ม มีเป้าหมายหลักในการเป็นศูนย์กลางการให้บริการปัญญาประดิษฐ์",
"start_time": 4.658703071672355,
"end_time": 20.870307167235495
}
],
"duration": 20.856
}
]