Nov 12, 2020 | Read time 2 min

Speechmatics launches industry-first Global Spanish language pack for automatic speech-to-text transcription at scale

Speechmatics’ Global Spanish speech recognition lanaguage pack rivals competitors in accuracy across a range of accents and dialects.
languages 4
Speechmatics
SpeechmaticsEditorial team
Testing found that for all variations of Spanish, Speechmatics’ GS language pack was the best option in every instance.

Speechmatics, a UK leader in any-context speech recognition technology, today launches the first Global Spanish language pack on the market that supports all major Spanish accents.

Global Spanish (GS) is a single Spanish language pack trained on data drawn across a wide range of diverse sources – specifically those from Latin America – making it the most accurate and comprehensive accent-independent Spanish language pack for speech-to-text.

Compared directly, GS was between 3% and 20% more accurate than all Google’s Beta accent-specific language packs and between 4% and 13% more accurate than Microsoft’s Video Indexer accent-specific language packs. * Speechmatics utilized the latest advancements in machine learning and applied proprietary language training techniques to create the GS language pack. With Speechmatics’ GS model, businesses using speech recognition for Spanish voice data won’t have to jump between multiple language packs to optimize the accuracy of their transcriptions, reducing costs and streamlining operations.

The Speechmatics approach delivers a simpler user experience with no additional software or complex processes needed. Following the success of its Global English language pack, Global Spanish is fast, accurate, reliable and more flexible, convenient and inclusive.

Ian Firth, VP Products at Speechmatics comments:

“With Spanish being the second most natively spoken language across the world, it was time that businesses had access to an all-encompassing language pack to help streamline the transcription process and increase accuracy.

Our pack does just that by accommodating multiple accents, dialects and regional variations. By defying the industry convention of single accent-centric packs, Speechmatics is on a journey to make all languages truly accessible to businesses with our global language approach.”

Lee Worth, ASR & Live Captioning Operational Excellence Lead at Red Bee Media comments:

“At Red Bee Media, we regularly assess all ASR solutions to ensure our captioning services are built on the strongest possible foundations. Speechmatics’ engine has shown recent average improvements of 10% in English and 20% in Spanish, on top of already-excellent accuracy.

It’s clear Speechmatics have worked hard to increase their Global English and Spanish engines’ recognition of an increasing range of accents and dialects, which will enable us to further improve both our pre-recorded captioning workflows and our market-leading automated live captioning services.”

With approximately 500 million speakers globally, Spanish is the fourth most spoken language overall. Approximately 90% of Spanish spoken in the world is in the United States, Mexico, Central and South America, with the remaining 10% in Spain.

*Test sets comprised of almost 8.5 hours of diverse audio and transcribed text covering multiple use cases. Accented test files included variations in gender, age, region and ethnicity of speakers.

Want to see more content like this? Sign up for our newsletter!

Latest Articles

Carousel slide image
Use Cases

What Word Error Rate Is Acceptable for Legal Transcription?

Word error rate for legal transcription has no single acceptable threshold. But knowing how accuracy, audio quality, and review obligations connect to real legal risk is what separates a reliable transcript from a costly one.

Mieke Smith
Mieke SmithSenior Writer
Carousel slide image
Use Cases

The court reporter shortage crisis: data, causes, and what legal teams are doing about it

The court reporter shortage is reshaping litigation. Explore data, causes, and how legal teams are using digital reporting and AI transcription to adapt.

Tom Young
Tom YoungDigital Specialist
[alt: Bilingual medical model featuring terms related to various health conditions and medications in Arabic and English. Key terms include "Chronic kidney disease," "Heart attack," "Diabetes," and "Insulin," among others, displayed in an organized layout.]
Product

Speechmatics achieves a world first in bilingual Voice AI with new Arabic–English model

Sets a new accuracy bar for real-world code-switching: 35% fewer errors than the closest competitor.

Speechmatics
SpeechmaticsEditorial Team
[alt: Illuminated ancient mud-brick structures stand against a dusk sky, showcasing architectural details and textures. Palm trees are in the foreground, adding to the setting's ambiance. Visually captures a historic site in twilight.]
Product

Your voice agent speaks perfect Arabic. That's the problem.

Most voice AI models are trained on formal Arabic, but real conversations across the Middle East mix dialects and English in ways those systems aren’t built to handle.

Yahia Abaza
Yahia AbazaSenior Product Manger
new blog image header
Technical

How Nvidia Dominates the HuggingFace Leaderboards in This Key Metric

A technical deep-dive into Token Duration Transducers (TDT) — the frame-skipping architecture behind Nvidia's Parakeet models. Covers inference mechanics, training with forward-backward algorithm, and how TDT achieves up to 2.82x faster decoding than standard RNN-T.

Oliver Parish
Oliver Parish Machine Learning Engineer
[alt: Healthcare professionals in scrubs and lab coats walk briskly down a hospital corridor. A nurse uses a tablet while others carry patient charts and attend to a gurney. The setting conveys a busy, clinical environment focused on patient care.]
Use Cases

Why AI-native EHR platforms will treat speech as core infrastructure in 2026

As clinical workflows become automated and AI-driven, real-time speech is shifting from a transcription feature to the foundational intelligence layer inside modern EHR systems.

Vamsi Edara
Vamsi EdaraFounder and CEO, Edvak EHR