Aug 31, 2022 | Read time 2 min

Speechmatics launches Language Identification, allowing users to automatically determine the predominant language in a media file

Speechmatics launches Language Identification, allowing users to automatically determine the predominant language in a media file
Speechmatics launches Language Identification
Speechmatics
SpeechmaticsEditorial team

Latest addition to Speechmatics engine saves time on manually reviewing files and is applicable to a wide variety of use cases

Speechmatics, the leading autonomous speech recognition technology scaleup has now added Language Identification (Language ID) to its industry-leading speech-to-text engine. This latest addition allows customers to automatically identify the predominant language spoken in any media file. Customers will save time and effort on manually reviewing files, safe in the knowledge that they will be provided with an accurate transcription of any media file.

Language ID drives efficiency by removing the manual step of selecting which language pack should be used when the language is not explicitly stated on the file. Often requested, it not only helps users identify unknown languages, but also adds useful metadata about the language of the spoken audio. Media and broadcast organizations have extensive archives of audio, the content of which is often unknown. Instead of manually listening to hours of speech – and relying on human interpretation to label it – Speechmatics Language ID confirms the language pre-transcription. For contact centers, being able to identify the predominant language spoken (especially when callers switch languages) is a huge benefit to those conducting call analysis.

Speechmatics has built the most accurate and inclusive speech-to-text engine available. Historically, training data had to be manually tagged, classified or ‘labelled’. This has resulted in engines that have been trained on narrow datasets, which fail to represent the diversity of voices that use them. In contrast, Speechmatics’ speech-to-text engine is trained through exposure to hundreds of thousands of individual voices using millions of hours of unlabelled, more representative voice data. Speechmatics has applied this technique to identifying predominant spoken languages on a diverse set of audio data.

Commenting on this rollout of Language ID, CEO Katy Wigdahl said, “Up until now, identifying languages without human intervention has been costly and time-consuming for users of speech-to-text. However, with our new Language ID, this will be a thing of the past and allow customers to swiftly identify and transcribe media files - with less hassle and more efficiency. We can’t wait for our customers to use this Language ID and see it deliver accurate and valuable results.’’

This latest update can be used with pre-recorded media files, works with up to 12 languages and adds a confidence score to show the certainty of the predominant language. Supported languages are English, German, Spanish, French, Hindi, Italian, Japanese, Korean, Mandarin, Dutch, Portuguese, and Russian.

Latest Articles

Carousel slide image
Technical

How to build a microbatching workflow with the Speechmatics API

Build a cleaner path between batch and real time. Learn when micro-batching makes sense, how to chunk audio, submit jobs, stitch JSON, and scale safely with the Speechmatics API.

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Product

Alphanumeric speech recognition: why voice assistants mangle SKUs (and how to fix it)

A guide for voice AI engineers, ecommerce platforms and warehouse teams on SKU recognition accuracy voice assistant deployments depend on: why speech recognition systems produce transcription errors on product codes, what to measure when error rates matter, and the fixes that move the needle on order picking, voice ordering and customer-facing voice AI.

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Technical

The Adobe story: How we made cloud-grade AI work on your laptop

Behind the build: what it takes to make cloud-grade speech recognition work inside Adobe Premiere, and why Whisper raised the stakes.

Andrew Innes
Andrew InnesChief Architect
Carousel slide image
Company

Adobe and Speechmatics deliver cloud-grade speech recognition on-device for Premiere

Adobe Premiere users can run the most accurate on-device transcription locally; efficient enough for a laptop, powerful enough for professional work.

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Use Cases

Best speech-to-text AI guide: APIs, platforms and services compared

Speech-to-text has moved from novelty to enterprise infrastructure. Here's how the leading platforms stack up in 2026 — and how to pick the right one.

Tom Young
Tom YoungDigital Specialist
Speechmatics x Thymia combine medical-grade speech-to-text with clinical-grade voice biomarker intelligence to identify health signals.
News

AI can now understand health signals from 15 seconds of your voice, including fatigue, stress and type 2 diabetes

The joint platform returns transcription and health signals in real time, with no additional hardware required.

Speechmatics
SpeechmaticsEditorial Team