Synchronous Speech to Text API
Header
Your API key
form-data body
Request Body
- files Filerequired
Audio raw files in a form of multi-part form data using the key name files.
The maximum size for each file is 50 MB.
The duration of each file should less than 1 minute.
 
optionalSend with the form of multi-part form data
Speaker file Maximum 5 files can be provided. Each file size must not exceed 20 MB.
Enhances recognition accuracy for specific words. Maximum 10 words can be provided. e.g., สวัสดี
- 200
 
Array [
object
success | failed
The status of the transcription process.
The total duration of the audio file in seconds (e.g., 20.856).
File name
result Array [
object
The start time of the segment in the audio file, in seconds (e.g., 1.6382252559726962).
The end time of the segment in the audio file, in seconds (e.g., 1.6382252559726962).
The identifier of the speaker in SPEAKER_{number} format
The transcribed text
[
  {
    "filename": "Record.wav",
    "status": "success",
    "result": [
      {
        "speaker": "SPEAKER_00",
        "transcript": "วิสัย",
        "start_time": 1.6382252559726962,
        "end_time": 3.9761092150170647
      },
      {
        "speaker": "SPEAKER_00",
        "transcript": "บริษัทผู้พัฒนาแพลตฟอร์ม มีเป้าหมายหลักในการเป็นศูนย์กลางการให้บริการปัญญาประดิษฐ์",
        "start_time": 4.658703071672355,
        "end_time": 20.870307167235495
      }
    ],
    "duration": 20.856
  }
]