What does Speechmatics do?

Speechmatics provides speech technology and Voice AI for enterprises, offering accurate Speech-to-Text, Text-to-Speech, and Voice Agent solutions. Our models understand every voice and accent across 55+ languages, helping businesses unlock the full potential of voice data.

How accurate is Speechmatics Speech-to-Text?

Speechmatics delivers best-in-market accuracy, achieving up to 99% word accuracy and 96% medical keyword recall in industry benchmarks. Our models handle multiple accents, noisy environments, and multi speakers with ease.

What makes Speechmatics Text-to-Speech different?

Our low-latency Text-to-Speech (TTS) delivers lifelike, human-sounding voices with sub-150ms latency that is ideal for real-time conversations. Developers can stream natural speech in multiple voices and deploy it in the cloud, hybrid, or on-prem for privacy and control.

Can I build real-time voice agents with Speechmatics?

Our voice AI enables developers to build real-time voice agents that listen, understand, and respond naturally. Plug in fast with a flexible API and native integrations to power your AI voice agents.

Which industries use Speechmatics?

Speechmatics is trusted by organizations in media, healthcare, contact center, finance, education, and accessibility. Our technology powers transcription, translation, call analytics, and voice AI applications worldwide.

Speechmatics launches industry-first Global Spanish language pack for automatic speech-to-text transcription at scale

Testing found that for all variations of Spanish, Speechmatics’ GS language pack was the best option in every instance.

Speechmatics, a UK leader in any-context speech recognition technology, today launches the first Global Spanish language pack on the market that supports all major Spanish accents.

Global Spanish (GS) is a single Spanish language pack trained on data drawn across a wide range of diverse sources – specifically those from Latin America – making it the most accurate and comprehensive accent-independent Spanish language pack for speech-to-text.

Compared directly, GS was between 3% and 20% more accurate than all Google’s Beta accent-specific language packs and between 4% and 13% more accurate than Microsoft’s Video Indexer accent-specific language packs. * Speechmatics utilized the latest advancements in machine learning and applied proprietary language training techniques to create the GS language pack. With Speechmatics’ GS model, businesses using speech recognition for Spanish voice data won’t have to jump between multiple language packs to optimize the accuracy of their transcriptions, reducing costs and streamlining operations.

The Speechmatics approach delivers a simpler user experience with no additional software or complex processes needed. Following the success of its Global English language pack, Global Spanish is fast, accurate, reliable and more flexible, convenient and inclusive.

Ian Firth, VP Products at Speechmatics comments:

“With Spanish being the second most natively spoken language across the world, it was time that businesses had access to an all-encompassing language pack to help streamline the transcription process and increase accuracy.
Our pack does just that by accommodating multiple accents, dialects and regional variations. By defying the industry convention of single accent-centric packs, Speechmatics is on a journey to make all languages truly accessible to businesses with our global language approach.”

Lee Worth, ASR & Live Captioning Operational Excellence Lead at Red Bee Media comments:

“At Red Bee Media, we regularly assess all ASR solutions to ensure our captioning services are built on the strongest possible foundations. Speechmatics’ engine has shown recent average improvements of 10% in English and 20% in Spanish, on top of already-excellent accuracy.
It’s clear Speechmatics have worked hard to increase their Global English and Spanish engines’ recognition of an increasing range of accents and dialects, which will enable us to further improve both our pre-recorded captioning workflows and our market-leading automated live captioning services.”

With approximately 500 million speakers globally, Spanish is the fourth most spoken language overall. Approximately 90% of Spanish spoken in the world is in the United States, Mexico, Central and South America, with the remaining 10% in Spain.

*Test sets comprised of almost 8.5 hours of diverse audio and transcribed text covering multiple use cases. Accented test files included variations in gender, age, region and ethnicity of speakers.

Want to see more content like this? Sign up for our newsletter!

Nov 12, 2020 | Read time 2 min

Speechmatics launches industry-first Global Spanish language pack for automatic speech-to-text transcription at scale

Testing found that for all variations of Spanish, Speechmatics’ GS language pack was the best option in every instance.

Latest Articles

What Word Error Rate Is Acceptable for Legal Transcription?

The court reporter shortage crisis: data, causes, and what legal teams are doing about it

Speechmatics achieves a world first in bilingual Voice AI with new Arabic–English model

Your voice agent speaks perfect Arabic. That's the problem.

How Nvidia Dominates the HuggingFace Leaderboards in This Key Metric

Why AI-native EHR platforms will treat speech as core infrastructure in 2026