Mar 2, 2022 | Read time 3 min

Understanding Every Voice: The Aim of Speechmatics

Learn more about Speechmatics with a deep dive into one of our core aims - understanding every voice through our products and company values.
 Understanding Every Voice: The Aim of Speechmatics
Speechmatics
SpeechmaticsEditorial team

Understanding Every Voice: The Aim of Speechmatics

At Speechmatics, our aim is to make voice technology inclusive and accessible to everyone in the world, regardless of demographic, age, gender, accent, dialect, or location. In its simplest terms, we want to understand every voice. While we have a focus on how our technology gets us to this goal, something we must always consider is the human angle.

Our industry-leading Autonomous Speech Recognition (ASR) is pioneering for several reasons, not least because of its ability to understand the widest range of dialects. We’ve developed a Global speech-to-text language pack for English and Spanish, which means the technology can understand all accents and dialects within the language, negating the need for more complicated and less effective multi-layered systems.

It’s this people-first thinking that puts us ahead of our competitors. Understanding every voice must be led by the people. A core value that must be embodied to be believed. At Speechmatics, we’re continually looking at how we can implement understanding every voice into every aspect of what we do.

Our First Steps

As you’d expect, understanding every voice takes time. The tech is still evolving, and the company still growing. Naturally, it wasn’t always like this. The start of Speechmatics’ growth journey included – ‘Live for the Wow’, ‘Build Authentic Relationships’, and ‘Be the Adventure’.

While a good start, these values don’t reflect our business as it is today. That’s why we have new ones:

  • People First: hiring the best candidate for the role, not necessarily the most qualified.

  • Move Fast: Empowering employees to “fail fast and move on.” Creating an environment where we can “debate freely, make timely decisions, and commit to outcomes.”

  • Care Deeply: This is where the ‘understanding every voice’ aim originates from. We want to ensure we put people first, especially when it comes to the impact our actions have on the world.

  • Be Ambitious: Attempting to make change within the industry through big goals and breakthroughs.

Understanding every voice is the glue that holds our present values together. Plus, they’re much more relatable for new recruits giving us a better chance of hiring the best person for a role – something essential as we grow.

Shaping the Company

Speechmatics’ growth, while rapid, mirrors human life. Starting out, we had one aim: to grow. But soon we’re hitting the business equivalent of puberty, which means it’s time we take a thorough look at our path ahead. Much like real life, our aims, and values ebbed and flowed over our development.

The company was very much tech and machine learning-focused while still having that ‘people first’ culture. Over five years later, we are getting close to 200 employees. The more Speechlings there are, the more aspects of human life we need to consider. Nowadays, there is much more of an onus on everyone at the company to take our global approach to ASR and embed it in the hiring process.

If it’s people’s needs leading our direction, we must understand them.

Our Plans to Truly Understand Every Voice

At this moment in time, our ASR can transcribe 31 languages. While there is still some way to go, we are constantly looking to expand. Language, after all, is universal. One of the best things about this is we do not need a retrospective, hindsight-laden analysis to fully comprehend the power of understanding every voice.

We see it every day in the work we do, in the data we collect, and in the products we release. It’s this inclusivity we are looking to push even further.

After all, change happens one word at a time.

Latest Articles

[alt: Bilingual medical model featuring terms related to various health conditions and medications in Arabic and English. Key terms include "Chronic kidney disease," "Heart attack," "Diabetes," and "Insulin," among others, displayed in an organized layout.]
Product

Speechmatics achieves a world first in bilingual Voice AI with new Arabic–English model

Sets a new accuracy bar for real-world code-switching: 35% fewer errors than the closest competitor.

Speechmatics
SpeechmaticsEditorial Team
[alt: Illuminated ancient mud-brick structures stand against a dusk sky, showcasing architectural details and textures. Palm trees are in the foreground, adding to the setting's ambiance. Visually captures a historic site in twilight.]
Product

Your voice agent speaks perfect Arabic. That's the problem.

Most voice AI models are trained on formal Arabic, but real conversations across the Middle East mix dialects and English in ways those systems aren’t built to handle.

Yahia Abaza
Yahia AbazaSenior Product Manger
new blog image header
Technical

How Nvidia Dominates the HuggingFace Leaderboards in This Key Metric

A technical deep-dive into Token Duration Transducers (TDT) — the frame-skipping architecture behind Nvidia's Parakeet models. Covers inference mechanics, training with forward-backward algorithm, and how TDT achieves up to 2.82x faster decoding than standard RNN-T.

Oliver Parish
Oliver Parish Machine Learning Engineer
[alt: Healthcare professionals in scrubs and lab coats walk briskly down a hospital corridor. A nurse uses a tablet while others carry patient charts and attend to a gurney. The setting conveys a busy, clinical environment focused on patient care.]
Use Cases

Why AI-native EHR platforms will treat speech as core infrastructure in 2026

As clinical workflows become automated and AI-driven, real-time speech is shifting from a transcription feature to the foundational intelligence layer inside modern EHR systems.

Vamsi Edara
Vamsi EdaraFounder and CEO, Edvak EHR
[alt: Logos of Speechmatics and Edvak are displayed side by side, interconnected by a stylized x symbol. The background features soft, wavy lines in light blue, creating a modern and tech-focused aesthetic.]
Company

One word changes everything: Speechmatics and Edvak EHR partner to make voice AI safe for clinical automation at scale

Turning real-time clinical speech into trusted, EHR-native automation.

Speechmatics
SpeechmaticsEditorial Team
[alt: Concentric circles radiate outward from a central orange icon with a white Speechmatics logo. The background is dark blue, enhancing the orange glow. A thin green line runs horizontally across the lower part of the image.]
Technical

Speed you can trust: The STT metrics that matter for voice agents

What “fast” actually means for voice agents — and why Pipecat’s TTFS + semantic accuracy is the clearest benchmark we’ve seen.

Archie McMullan
Archie McMullanSpeechmatics Graduate