Apr 18, 2023 | Read time 3 min

Speechmatics to launch pioneering real-time speech translation capabilities in 69 language pairs

Speechmatics, the leading speech recognition technology scaleup, unveils plans to amplify its real-time transcription capabilities by providing real-time speech translation in an all-in-one API.
Real-Time Translation
Speechmatics
SpeechmaticsEditorial team

This offering will integrate real-time translation with its industry-beating real-time transcription in an all-in-one API  Speechmatics, the leading speech recognition technology scaleup, unveils plans to amplify its real-time transcription capabilities by providing real-time translation in an all-in-one API. Breaking down language barriers enables more people to consume content regardless of industry and unlocks the ability to automatically translate live content from multiple regions. This combined offering enables customers to use the world’s most accurate speech-to-text engine and translate speech for 69 language pairs*.

Real-time translation follows on a month from Speechmatics’ launch of Ursa – the world’s most accurate speech-to-text engine, which is 25% more accurate than OpenAI’s Whisper and 38% more accurate than Google. Speechmatics has doubled down on these capabilities to develop real-time translation, offering language pairs to and from English*, including German, Spanish, and Vietnamese. The all-in-one API can also translate multiple languages in one request – for example, a single audio stream can provide real-time English transcription and translation to Japanese, French, Hindi, Mandarin, and Korean simultaneously.

Speechmatics’ real-time transcription and now translation delivers the same level of accuracy as its pre-recorded (batch) service, as well as providing a sliding scale to enable customers to tailor the speed (latency) and/or accuracy to meet their needs. The all-in-one API streamlines processes and speeds up workflows for businesses by combining real-time transcription and translation in one API.

Businesses can reach a wider geographical audience across multiple industries where translating in real-time has previously been a challenging and costly task when completed manually by humans. Particularly for the broadcast industry – valued at over $300 billion in the US alone in 2022 – generating quick and highly accurate translated speech in one API unlocks the ability to caption live stream content and news for viewers from around the world. Similarly, for contact centres where scale is essential, contact centres can scale operations to handle multiple languages using cost-effective automation technology and offer improved customer experiences in native languages.

Damir Derd, Head of Sales Engineering at Speechmatics, said, “This is a landmark development for speech recognition technology, and we are proud to remain at the forefront of innovation, demonstrating the commitment to our mission to understand every voice. This new offering opens up a truly global market for our customers with almost instant translation from the spoken word. As demand from viewers in different regions increases for TV shows and broadcast, sports, events, podcasts, game streaming, YouTube and social media videos, the need for captioned videos in multiple languages has too. We are excited to launch this capability to our customers in the next few weeks and will be continuing to work towards adding even more languages and enabling the engine to translate between languages, so the default isn’t always English.”

Ken Frommert, President of ENCO, said, “Speechmatics provides the most accurate speech-to-text on the market for pre-recorded files and live streams. Adding real-time translation to its all-in-one API is game-changing for live broadcast captions. The ability to not only transcribe but now leverage Speechmatics to translate in real-time to provide highly accurate captions globally.”

Real-time translation will be demoed at NAB Show, Booth N2960, 16th - 19th April 2023, and will be launching later this month. Sign up for free early access and early bird offer here. 

*Also includes Bokmål > Nynorsk language pair.

Latest Articles

Carousel slide image
Technical

How to build a microbatching workflow with the Speechmatics API

Build a cleaner path between batch and real time. Learn when micro-batching makes sense, how to chunk audio, submit jobs, stitch JSON, and scale safely with the Speechmatics API.

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Product

Alphanumeric speech recognition: why voice assistants mangle SKUs (and how to fix it)

A guide for voice AI engineers, ecommerce platforms and warehouse teams on SKU recognition accuracy voice assistant deployments depend on: why speech recognition systems produce transcription errors on product codes, what to measure when error rates matter, and the fixes that move the needle on order picking, voice ordering and customer-facing voice AI.

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Technical

The Adobe story: How we made cloud-grade AI work on your laptop

Behind the build: what it takes to make cloud-grade speech recognition work inside Adobe Premiere, and why Whisper raised the stakes.

Andrew Innes
Andrew InnesChief Architect
Carousel slide image
Company

Adobe and Speechmatics deliver cloud-grade speech recognition on-device for Premiere

Adobe Premiere users can run the most accurate on-device transcription locally; efficient enough for a laptop, powerful enough for professional work.

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Use Cases

Best speech-to-text AI guide: APIs, platforms and services compared

Speech-to-text has moved from novelty to enterprise infrastructure. Here's how the leading platforms stack up in 2026 — and how to pick the right one.

Tom Young
Tom YoungDigital Specialist
Speechmatics x Thymia combine medical-grade speech-to-text with clinical-grade voice biomarker intelligence to identify health signals.
News

AI can now understand health signals from 15 seconds of your voice, including fatigue, stress and type 2 diabetes

The joint platform returns transcription and health signals in real time, with no additional hardware required.

Speechmatics
SpeechmaticsEditorial Team