Below you’ll find answers to some commonly asked questions. If you can’t see an answer to your question, get in touch.
For best possible results try to avoid using file formats that use compression technologies.
For a full list of features including file input formats, please see our product sheets.
Other factors that can impact include multiple languages in a single audio file.
Check out the product sheets to see the various deployment options available.
Alignment is only available on our SaaS.
The use of any of our solutions will depend on your use case. For example, for real-time captioning, you may want to consider the real time engine, whereas batch processing is more suited to transcribing an archive of data.
Our batch ASR will transcribe a file with an RTF of 0.5. For example, a 10-minute file will be transcribed within 5 minutes.
We cannot guarantee an RTF of 0.5 for files less than 3 minutes in length.
Unless agreed otherwise with Speechmatics, the following behaviour will be considered acceptable use of the Cloud Services ASR (for V2 customers only). Speechmatics reserve the right to change the rate limits at any time in order to ensure continuity of service for all customers of the Cloud.
a. The Customer shall limit the rate of submission of files to a maximum of 2 jobs per second with a maximum of 600 jobs in progress at any one time.
b. The Customer shall limit the rate of polling for the status of submitted jobs to a maximum of 20 queries per second (across all jobs).