Our speech-to-text API is quick and easy to use. We give you all the tools you need to deploy, operate, and manage our technology.
1import requests
2
3url = "https://asr.api.speechmatics.com/v2/jobs"
4payload = {'config': '{"type": "transcription", "transcription_config": {"language": "en"}}'}
5files = [('data_file', ('file', open('/path/to/file', 'rb'), 'application/octet-stream'))]
6headers = {'Authorization': 'Bearer insert-api-key-here'}
7
8response = requests.request("POST", url, headers=headers, data=payload, files=files)
9
10print(response.text)
Our unique speech-to-text API enables businesses to accurately transcribe human-level speech into text, regardless of demographic, age, gender, dialect or location.
Our technology covers the majority of the global population's native languages with unsurpassed accuracy. When you combine this precision with our accent and dialect robustness, you know you’re getting the best speech-to-text out there.
By using the latest machine learning approaches, we're constantly finding new ways to better understand human speech. Automatic Language Identification and Speaker Labels are just some of the many features we offer.
The fastest way to get started is to use our cloud service. If you want security-sensitive data to remain within your own environment, no problem. We offer on-premises deployment. Since we have common APIs, you can also pick a hybrid deployment option. Whatever you choose, rest assured that we provide a secure, robust service.
Key takeaways from Whisper and what it means for future AI research.
Turn a single node training setup into a robust, platform agnostic, multinodal one.
What the alternatives to WER are, as well as arguing a case for more useful approaches.
Sign up to our free speech-to-text SaaS Portal and we’ll guide you through the integration of our API. We pride ourselves on offering the best support for your business needs. If you have any questions, just ask.