What does Speechmatics do?

Speechmatics provides speech technology and Voice AI for enterprises, offering accurate Speech-to-Text, Text-to-Speech, and Voice Agent solutions. Our models understand every voice and accent across 56+ languages, helping businesses unlock the full potential of voice data.

How accurate is Speechmatics Speech-to-Text?

Speechmatics delivers best-in-market accuracy, achieving up to 99% word accuracy and 96% medical keyword recall in industry benchmarks. Our models handle multiple accents, noisy environments, and multi speakers with ease.

What makes Speechmatics Text-to-Speech different?

Our low-latency Text-to-Speech (TTS) delivers lifelike, human-sounding voices with sub-150ms latency that is ideal for real-time conversations. Developers can stream natural speech in multiple voices and deploy it in the cloud, hybrid, or on-prem for privacy and control.

Can I build real-time voice agents with Speechmatics?

Our voice AI enables developers to build real-time voice agents that listen, understand, and respond naturally. Plug in fast with a flexible API and native integrations to power your AI voice agents.

Which industries use Speechmatics?

Speechmatics is trusted by organizations in media, healthcare, contact center, medical, finance, legal, education, and accessibility. Our technology powers transcription, translation, call analytics, and voice AI applications worldwide.

Speechmatics sets new standard for real-time medical transcription with German and Nordic roll-out

Speechmatics today expanded its Medical Model with German, Danish and Norwegian, bringing the total language count to seven alongside its industry-leading English model.

The expansion, trained on over 2 billion words of medical data, delivers significant accuracy improvements and gives healthcare organizations deployment choice across on-premises, private cloud and SaaS infrastructure.

Each new language undergoes rigorous testing and optimization before release, ensuring the models can handle the demands that define real healthcare environments.

Trained on 2 billion words of medical data

The new language models are trained on over 2 billion words of medical conversations, clinical documentation and healthcare interactions, adding to the 14 billion words of medical data in Speechmatics' existing models.

This training scale enables the models to understand the complexity of real clinical environments: rapid multi-speaker dialogue, medical abbreviations, drug dosages, and diverse accents.

That scale enables the models to handle what generic speech recognition systems miss: the difference between "hypertension" and "hypotension" in a noisy emergency room, a pharmaceutical name spoken with a regional accent, or overlapping speech between clinician and patient during a consultation.

The result is accuracy that changes clinical workflows.

Accuracy improvements for new language additions

The three new models demonstrate substantial Key Word Error Rate (KWER) reductions using our specially tailored medical keyword test set. This test set was designed to evaluate our models on challenging terminology, across a broad range of scenarios.

On average, Speechmatics has improved Word Error Rate (WER) on medical test sets by around 30–50% across German, Danish and Norwegian compared with previous Speechmatics models. The new models are also around 5–20% lower in word error rate than the closest evaluated competitor on medical test sets for most languages.

German shows one of the most notable uplifts, with error rates reduced by roughly a third versus Speechmatics' previous German Enhanced model on internal medical tests. That improvement is critical in a language dense with compound terms and specialist vocabulary, where a single misplaced token can change clinical meaning.

These numbers position Speechmatics ahead of evaluated competitors on medical test sets, with the German Medical Model showing particularly strong performance.

These accuracy gains translate directly to fewer corrections, cleaner EHR integration and reduced friction in patient interactions.

Across our newest medical models, our medical Keyword Error Rates (KWER) performance:

Language	KWER
German	5.43
Danish	6.17
Norwegian	8.02

Nordic expansion strengthens regional coverage

The addition of Danish and Norwegian builds on Speechmatics' Nordic medical coverage alongside Finnish, enabling providers across the region to standardize on a single Voice AI platform while working in their native languages.

The Nordic healthcare market is moving fast on Voice AI adoption, and they expect technology that works without compromise. That requires the rigorous testing and optimization that made our English Medical Model the industry benchmark. We don't compromise on quality when we add new languages.
Yahia Abaza, Product Manager, Speechmatics

Deployment flexibility: on your infrastructure and terms

The new multilingual Medical Model is available across on-premises, private cloud and SaaS infrastructure, giving healthcare organizations the flexibility to choose the deployment model that fits their compliance requirements, IT infrastructure and operational priorities.

This flexibility has proven critical for Speechmatics' expanding global medical client base. Whether a healthcare provider in Germany needs on-premises deployment for data residency, a telehealth platform in Spain wants private cloud, or an AI scribe company in the Netherlands prefers cloud-native SaaS, organizations can adopt Voice AI without compromising on either performance or their specific regulatory and operational requirements.

Real-time first, built for the pace of care

Real-time performance is at the center of the release. The medical models are designed to power live ambient scribes, telehealth, clinical contact centers and in-room assistants without forcing developers to trade accuracy for latency.

Above typical real-time latency thresholds, the models remain close to batch accuracy, and under one second they perform strongly compared with competing systems. That allows clinicians to see transcripts and summaries emerge as they speak, while back-office workflows can use the same models for high-volume file processing.

Ambient AI only helps if it keeps up with real clinical conversations. We built these models for fast, overlapping dialogue, non-native speakers, accents and imperfect audio, not just clean test clips. Real-time is the default use case, not an afterthought.
Stuart Wood, Senior Product Manager, Speechmatics

Built on proven industry-leading clinical accuracy

The seven language models build on the foundation of Speechmatics' English Medical Model, which set industry benchmarks in September 2025: 93% general real-time accuracy (7% WER), 96% medical keyword recall, and a keyword error rate 50% lower than the nearest competitor.

All models are optimized using NVIDIA GPU infrastructure, delivering the same level of performance across languages and handling the full complexity of clinical environments. Whether processing real-time ambient scribes or high-volume batch transcription, the models maintain consistent accuracy without forcing organizations to choose between speed and precision.

Availability

The expanded Medical Model with German, Danish and Norwegian is now available for production usage. Access is available through:

Speechmatics Portal for direct testing and evaluation
API integration for production deployment across real-time and batch workflows
On-premises and private cloud deployment for regulated healthcare environments

Healthcare technology partners can begin testing today. For more information, to provide feedback, or to schedule a technical demo, visit Speechmatics’ website or contact the team directly.

Dec 16, 2025 | Read time 6 min

Speechmatics sets new standard for real-time medical transcription with German and Nordic roll-out

TL;DR — Key Takeaways:

Trained on 2 billion words of medical data

Accuracy improvements for new language additions

Nordic expansion strengthens regional coverage

Deployment flexibility: on your infrastructure and terms

Real-time first, built for the pace of care

Built on proven industry-leading clinical accuracy

Availability

Read also

Related Articles

What’s next for ambient scribes? Healthcare's chaos zones

Speechmatics Medical Model launches in Spanish

What is Ambient AI? How Voice-First Tech is Transforming Healthcare

Latest Articles

Stenograph and Speechmatics Announce Industry-First On-Device Integration for CATalyst VP

Speaker Focus: Fixing Voice AI for the real world

From a Parked Side Project to 30 Teams Running Real Sales Calls on Speechmatics

A Simpler Way to Pay: Speechmatics Is Moving to Credits

Dutch doctors spend a quarter of their day on admin. Wellcom has built the fix.

A Practical Guide to Building Voice AI Applications With Real-Time Transcription in 2026