What does Speechmatics do?

Speechmatics provides speech technology and Voice AI for enterprises, offering accurate Speech-to-Text, Text-to-Speech, and Voice Agent solutions. Our models understand every voice and accent across 56+ languages, helping businesses unlock the full potential of voice data.

How accurate is Speechmatics Speech-to-Text?

Speechmatics delivers best-in-market accuracy, achieving up to 99% word accuracy and 96% medical keyword recall in industry benchmarks. Our models handle multiple accents, noisy environments, and multi speakers with ease.

What makes Speechmatics Text-to-Speech different?

Our low-latency Text-to-Speech (TTS) delivers lifelike, human-sounding voices with sub-150ms latency that is ideal for real-time conversations. Developers can stream natural speech in multiple voices and deploy it in the cloud, hybrid, or on-prem for privacy and control.

Can I build real-time voice agents with Speechmatics?

Our voice AI enables developers to build real-time voice agents that listen, understand, and respond naturally. Plug in fast with a flexible API and native integrations to power your AI voice agents.

Which industries use Speechmatics?

Speechmatics is trusted by organizations in media, healthcare, contact center, medical, finance, legal, education, and accessibility. Our technology powers transcription, translation, call analytics, and voice AI applications worldwide.

Speechmatics unlocks accurate understanding of financial terms with new language pack

Engine trained on 200,000 hours of earnings calls transcripts to reduce errors by 40%

Speechmatics, the leading speech recognition technology scaleup, has launched an English language pack specific to the finance industry. This addition has been built for use cases including compliance, fraud identification, analytics, financial news and earnings calls. The world’s most accurate and inclusive speech-to-text engine can now identify finance terminology in conversation helping to avoid confusion with abbreviations, acronyms and finance-specific terms.

The financial services sector is notoriously jargon-heavy with industry terms that are either completely unique to the industry or that can be confused with commonly used phrases. Acronyms such as VAT or SEC or abbreviations e.g. Generally Accepted Accounting Principles (GAAP), and the word ‘gap’ can often confuse standard speech-to-text engines. Speechmatics can now capture the speech data as intended, turning unstructured, audio data into usable information. By improving the accuracy of transcripts, downstream tasks can be more consistent and streamlined for users.

Global experts in deep learning and speech recognition, Speechmatics has built the most accurate and inclusive speech-to-text engine available. Historically, training data had to be manually tagged, classified or ‘labelled’. This has resulted in engines trained on narrow datasets, which fail to represent the diversity of voices that use them. In contrast, Speechmatics’ speech-to-text engine is trained through exposure to hundreds of thousands of individual voices using millions of hours of unlabelled, more representative voice data. This has enabled a paradigm shift in accuracy, dramatically reducing both AI bias and errors in speech recognition. Given the broad range of demographics that exist within financial services, Speechmatics’ new offering will be key to supporting and sustaining inclusivity in the sector.

Katy Wigdahl, CEO, Speechmatics, said, “Our aim is to understand every voice regardless of race, gender or accent and I’m proud that Speechmatics has overcome significant challenges that traditional speech-to-text engines have struggled with.

However, we wanted to go even further and dive into the complexities that specific industries present. Some sectors are known for complex terms and jargon that, if added to our global models, risk making the technology less effective for other users. This led to our approach for domain-specific packs that can directly address the needs of individual sectors. Financial services was an obvious place to start but we hope our language pack will set a blueprint for every high-stakes industry where the financial, reputational and social cost of misunderstanding is high.”

Customers are already using the finance language pack to transcribe financial news and earnings calls as well as utilising the technology to aid call centre analysts and traders. The pack is the first industry-specific pack and paves the way for industries with equally complex terminology such as medicine and law.

Jul 12, 2022 | Read time 2 min

Speechmatics unlocks accurate understanding of financial terms with new language pack

Read also

Latest Articles

Speaker Focus: Fixing Voice AI for the real world

Stenograph and Speechmatics Announce Industry-First On-Device Integration for CATalyst VP

From a Parked Side Project to 30 Teams Running Real Sales Calls on Speechmatics

Dutch doctors spend a quarter of their day on admin. Wellcom has built the fix.

A Practical Guide to Building Voice AI Applications With Real-Time Transcription in 2026

Speechmatics versus Whisper: how Adobe Premiere's on-device speech engine got rebuilt