Dec 16, 2025 | Read time 6 min

Speechmatics sets new standard for real-time medical transcription with German and Nordic roll-out

New German, Danish and Norwegian Medical Models deliver up to 50% lower error rates, real-time accuracy under one second, and full deployment flexibility.
New real-time medical transcription with German and Nordic roll-out
Speechmatics
SpeechmaticsEditorial Team

TL;DR — Key Takeaways:

  • Launched: Medical Models for German, Danish, Norwegian (now 7 languages total).

  • Impact: Up to 50% lower error rates on medical speech; built for real-time use.

  • Deploy anywhere: SaaS, private cloud, or on-prem for regulated healthcare.

Speechmatics today expanded its Medical Model with German, Danish and Norwegian, bringing the total language count to seven alongside its industry-leading English model.

The expansion, trained on over 2 billion words of medical data, delivers significant accuracy improvements and gives healthcare organizations deployment choice across on-premises, private cloud and SaaS infrastructure.

Each new language undergoes rigorous testing and optimization before release, ensuring the models can handle the demands that define real healthcare environments.

Trained on 2 billion words of medical data

The new language models are trained on over 2 billion words of medical conversations, clinical documentation and healthcare interactions, adding to the 14 billion words of medical data in Speechmatics' existing models. 

This training scale enables the models to understand the complexity of real clinical environments: rapid multi-speaker dialogue, medical abbreviations, drug dosages, and diverse accents.

That scale enables the models to handle what generic speech recognition systems miss: the difference between "hypertension" and "hypotension" in a noisy emergency room, a pharmaceutical name spoken with a regional accent, or overlapping speech between clinician and patient during a consultation.

The result is accuracy that changes clinical workflows.

Accuracy improvements for new language additions

The three new models demonstrate substantial Key Word Error Rate (KWER) reductions using our specially tailored medical keyword test set. This test set was designed to evaluate our models on challenging terminology, across a broad range of scenarios. 

On average, Speechmatics has improved Word Error Rate (WER) on medical test sets by around 30–50% across German, Danish and Norwegian compared with previous Speechmatics models. The new models are also around 5–20% lower in word error rate than the closest evaluated competitor on medical test sets for most languages.

German shows one of the most notable uplifts, with error rates reduced by roughly a third versus Speechmatics' previous German Enhanced model on internal medical tests. That improvement is critical in a language dense with compound terms and specialist vocabulary, where a single misplaced token can change clinical meaning.

These numbers position Speechmatics ahead of evaluated competitors on medical test sets, with the German Medical Model showing particularly strong performance. 

These accuracy gains translate directly to fewer corrections, cleaner EHR integration and reduced friction in patient interactions. 

Across our newest medical models, our medical Keyword Error Rates (KWER) performance:

Language

KWER

German

5.43

Danish

6.17

Norwegian

8.02

Nordic expansion strengthens regional coverage

The addition of Danish and Norwegian builds on Speechmatics' Nordic medical coverage alongside Finnish, enabling providers across the region to standardize on a single Voice AI platform while working in their native languages.

The Nordic healthcare market is moving fast on Voice AI adoption, and they expect technology that works without compromise. That requires the rigorous testing and optimization that made our English Medical Model the industry benchmark. We don't compromise on quality when we add new languages.

Yahia Abaza, Product Manager, Speechmatics

Deployment flexibility: on your infrastructure and terms

The new multilingual Medical Model is available across on-premises, private cloud and SaaS infrastructure, giving healthcare organizations the flexibility to choose the deployment model that fits their compliance requirements, IT infrastructure and operational priorities.

This flexibility has proven critical for Speechmatics' expanding global medical client base. Whether a healthcare provider in Germany needs on-premises deployment for data residency, a telehealth platform in Spain wants private cloud, or an AI scribe company in the Netherlands prefers cloud-native SaaS, organizations can adopt Voice AI without compromising on either performance or their specific regulatory and operational requirements.

Real-time first, built for the pace of care

Real-time performance is at the center of the release. The medical models are designed to power live ambient scribes, telehealth, clinical contact centers and in-room assistants without forcing developers to trade accuracy for latency.

Above typical real-time latency thresholds, the models remain close to batch accuracy, and under one second they perform strongly compared with competing systems. That allows clinicians to see transcripts and summaries emerge as they speak, while back-office workflows can use the same models for high-volume file processing.

Ambient AI only helps if it keeps up with real clinical conversations. We built these models for fast, overlapping dialogue, non-native speakers, accents and imperfect audio, not just clean test clips. Real-time is the default use case, not an afterthought.

Stuart Wood, Senior Product Manager, Speechmatics

Built on proven industry-leading clinical accuracy

The seven language models build on the foundation of Speechmatics' English Medical Model, which set industry benchmarks in September 2025: 93% general real-time accuracy (7% WER), 96% medical keyword recall, and a keyword error rate 50% lower than the nearest competitor.

All models are optimized using NVIDIA GPU infrastructure, delivering the same level of performance across languages and handling the full complexity of clinical environments. Whether processing real-time ambient scribes or high-volume batch transcription, the models maintain consistent accuracy without forcing organizations to choose between speed and precision.

Availability

The expanded Medical Model with German, Danish and Norwegian is now available for production usage. Access is available through:

  • Speechmatics Portal for direct testing and evaluation

  • API integration for production deployment across real-time and batch workflows

  • On-premises and private cloud deployment for regulated healthcare environments

Healthcare technology partners can begin testing today. For more information, to provide feedback, or to schedule a technical demo, visit Speechmatics’ website or contact the team directly.

Latest Articles

Carousel slide image
Product

Alphanumeric speech recognition: why voice assistants mangle SKUs (and how to fix it)

A guide for voice AI engineers, ecommerce platforms and warehouse teams on SKU recognition accuracy voice assistant deployments depend on: why speech recognition systems produce transcription errors on product codes, what to measure when error rates matter, and the fixes that move the needle on order picking, voice ordering and customer-facing voice AI.

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Technical

The Adobe story: How we made cloud-grade AI work on your laptop

Behind the build: what it takes to make cloud-grade speech recognition work inside Adobe Premiere, and why Whisper raised the stakes.

Andrew Innes
Andrew InnesChief Architect
Carousel slide image
Company

Adobe and Speechmatics deliver cloud-grade speech recognition on-device for Premiere

Adobe Premiere users can run the most accurate on-device transcription locally; efficient enough for a laptop, powerful enough for professional work.

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Use Cases

Best speech-to-text AI guide: APIs, platforms and services compared

Speech-to-text has moved from novelty to enterprise infrastructure. Here's how the leading platforms stack up in 2026 — and how to pick the right one.

Tom Young
Tom YoungDigital Specialist
Speechmatics x Thymia combine medical-grade speech-to-text with clinical-grade voice biomarker intelligence to identify health signals.
News

AI can now understand health signals from 15 seconds of your voice, including fatigue, stress and type 2 diabetes

The joint platform returns transcription and health signals in real time, with no additional hardware required.

Speechmatics
SpeechmaticsEditorial Team
[alt: Concentric circles radiate outward from a central orange icon with a white Speechmatics logo. The background is dark blue, enhancing the orange glow. A thin green line runs horizontally across the lower part of the image.]
Technical

Speed you can trust: The STT metrics that matter for voice agents

What “fast” actually means for voice agents — and why Pipecat’s TTFS + semantic accuracy is the clearest benchmark we’ve seen.

Archie McMullan
Archie McMullanSpeechmatics Graduate