Speech To Text

The API is highly scalable and supports a range of audio formats. User can choose between two primary API options based on the length of the audio input and the use case:

Synchronous API

Synchronous API returns the recognized text for short audio (less than 1 minutes).

To process a speech recognition request for audio longer than 1 minutes, use Asynchronous API.

Synchronous API

Asynchronous API

Asynchronous API starts a long running audio processing operation. Use asynchronous API to transcribe audio that is longer than 1 minutes. The upper limit for asynchronous API is 300 minutes.

For shorter audio, synchronous API is faster and simpler. The upper limit for asynchronous API is 300 minutes.

Asynchronous API

Speech To Text

Synchronous API​

Asynchronous API​

Synchronous API

Asynchronous API