What does Speechmatics do?

Speechmatics provides speech technology and Voice AI for enterprises, offering accurate Speech-to-Text, Text-to-Speech, and Voice Agent solutions. Our models understand every voice and accent across 56+ languages, helping businesses unlock the full potential of voice data.

How accurate is Speechmatics Speech-to-Text?

Speechmatics delivers best-in-market accuracy, achieving up to 99% word accuracy and 96% medical keyword recall in industry benchmarks. Our models handle multiple accents, noisy environments, and multi speakers with ease.

What makes Speechmatics Text-to-Speech different?

Our low-latency Text-to-Speech (TTS) delivers lifelike, human-sounding voices with sub-150ms latency that is ideal for real-time conversations. Developers can stream natural speech in multiple voices and deploy it in the cloud, hybrid, or on-prem for privacy and control.

Can I build real-time voice agents with Speechmatics?

Our voice AI enables developers to build real-time voice agents that listen, understand, and respond naturally. Plug in fast with a flexible API and native integrations to power your AI voice agents.

Which industries use Speechmatics?

Speechmatics is trusted by organizations in media, healthcare, contact center, medical, finance, legal, education, and accessibility. Our technology powers transcription, translation, call analytics, and voice AI applications worldwide.

Speechmatics and Sully.ai partner to scale healthcare AI infrastructure globally

Cambridge, UK — 12 January 2025 — Speechmatics, the voice AI company on a mission to understand every voice, today announced a strategic partnership with Sully.ai to power the next generation of autonomous healthcare agents and scribes.

Built on NVIDIA AI infrastructure, the collaboration combines best in class medical-grade speech models with autonomous agent workflows to deliver AI receptionists and clinical scribes that handle real operational tasks and deliver tangible ROI across clinical settings.

The partnership arrives as global healthcare faces acute staffing shortages and mounting administrative costs. Sully, which scaled from single-doctor clinics to enterprise customers with 500+ providers in under a year, is already driving significant efficiency back into healthcare.

Their north star metric of Minutes Added to Workforce (MAW) measures how agentic AI drives efficiency within healthcare use cases. As of December 2025, Sully has added more than 30 million minutes back to the healthcare workforce and current customers include Oshi Health, Tebra and Midi.

21x ROI: Autonomous agents delivering measurable workforce impact, soon to benefit more patients around the globe

Sully's partners are seeing 21x ROI in early case studies with autonomous operating systems capitalizing on a suite of multiple agents working in coordination.

Typical ROIs include: 5%+ increase in patient retention, 2.4+ hours time saving per physician per day and 18.5% increase in capacity for patient appointments.

Building upon these early positive signals, Sully is seeing rapid expansion across large, multi-site provider networks, hence the requirement for Speechmatics - highly accurate, medical grade speech models that scale across different clinical environments.

Sully.ai selected Speechmatics after extensive internal testing across multiple speech model providers. Evaluation focused on clinical accuracy, handling ambiguous medical pronunciations, and real-time performance in noisy environments.

Speechmatics' English Medical Model set benchmark results in 2025, delivering 93% general real-time accuracy (7% WER) and 96% medical keyword recall, with a medical keyword error rate 50% lower than the nearest competitor. These gains reduce corrections, improve downstream documentation quality, and support more reliable automation in patient access and clinical workflows.

"We needed best-in-class speech models that work in real clinical environments: complex medical terminology, fast overlapping dialogue, accents, imperfect audio, not just clean test clips. Speechmatics has been the most responsive provider with solutioning for us, and we've seen them handle medications better on our troublesome audio than any competitor." — Ahmed Omar, Founder and CEO, Sully.ai

Speechmatics and Sully.ai plan to expand into new global markets including the Middle East following the launch of an English-Arabic bilingual model in early 2026. Bilingual, code-switching conversations are expected to be a defining requirement for voice automation in care delivery and patient access within this region.

Speechmatics' Arabic capabilities are designed to perform across Modern Standard Arabic as well as Egyptian, Gulf, and Levantine dialects, supporting consistent performance across varied speakers and accents.

Trained on 16 billion words of real medical conversations

Trained on over 16 billion words of medical conversations, clinical documentation, and healthcare interactions, the models deliver keyword error rates 5-20% lower than evaluated competitors on medical test sets across most languages.

This training scale enables the models to distinguish between "hypertension" and "hypotension" in noisy emergency rooms, understand pharmaceutical names with regional accents, handle overlapping clinician-patient speech, and parse medical abbreviations, drug dosages, and ICD-10 codes, all while maintaining near-batch accuracy at sub-second latency.

"High-accuracy, low-latency speech recognition is a core enabler for autonomous agents that 'actually listen' and operate safely in mission-critical environments. Together with Sully.ai, we're enabling healthcare organizations to deploy ambient scribes and AI agents across multiple languages and global markets without compromising on quality, security, or speed."
— Katy Wigdahl, CEO, Speechmatics

Built on NVIDIA, deployed on healthcare's terms

The model is deployed using NVIDIA AI infrastructure, optimized through NVIDIA Triton Inference Server and NVIDIA CUDA libraries. This enables high-throughput, low-latency processing at scale, with deployment flexibility across data center, private cloud, and edge environments. This architecture supports rapid customization and horizontal scaling for diverse healthcare deployments: from telehealth platforms and contact centers to EHR-connected scribes and bedside tools.

Unlike cloud-only competitors, Speechmatics supports on-premises, private cloud, and SaaS deployment, critical for organizations navigating data residency requirements, HIPAA compliance, and regulatory frameworks. This flexibility allows enterprises to keep sensitive patient data within their own infrastructure while accessing state-of-the-art speech technology.

About Speechmatics

Speechmatics is the voice AI company on a mission to understand every voice. The company's speech-to-text technology delivers industry-leading accuracy across 56+ languages, with specialized medical models trained on over 16 billion words of clinical data. Speechmatics' real-time and batch transcription APIs power applications for healthcare, media, contact centers, and voice agent organizations worldwide, with customers including AI Media, Content Guru and boost.ai. Founded in Cambridge, UK, Speechmatics has deployment options spanning on-premises, private cloud, SaaS infrastructure and On-Device. Learn more at www.speechmatics.com.

About Sully.ai

Sully.ai is transforming healthcare operations with autonomous AI agents that handle mission-critical workflows across multi-doctor practices and large provider networks. The company's full suite (from voice AI receptionists to clinical scribes) is built on an agentic operating system designed for the complexity of real healthcare environments. Founded by a team bringing together medical expertise, technical founders, and entrepreneurial leadership, Sully scaled from single-doctor clinics to enterprise customers with 500+ providers in under a year. Learn more at www.sully.ai.

Media Contact

Mieke Smith Content Lead, Speechmatics mieke.smith@speechmatics.com +44 7713 014319

Jan 12, 2026 | Read time 4 min

Speechmatics and Sully.ai partner to scale healthcare AI infrastructure globally

21x ROI: Autonomous agents delivering measurable workforce impact, soon to benefit more patients around the globe

Trained on 16 billion words of real medical conversations

Built on NVIDIA, deployed on healthcare's terms

About Speechmatics

About Sully.ai

Media Contact

Read also

Related Articles

What’s next for ambient scribes? Healthcare's chaos zones

Speechmatics sets new standard for real-time medical transcription with German and Nordic roll-out

The next generation of AI medical speech recognition is here

Latest Articles

Speechmatics launches Medical Model for real-time clinical transcription

Speaker Focus: Fixing Voice AI for the real world

Stenograph and Speechmatics Announce Industry-First On-Device Integration for CATalyst VP

A Simpler Way to Pay: Speechmatics Is Moving to Credits

From a Parked Side Project to 30 Teams Running Real Sales Calls on Speechmatics

Dutch doctors spend a quarter of their day on admin. Wellcom has built the fix.