Jul 8, 2023 | Read time 4 min

YouTube’s Captions Represent the Direct Need for Speech-to-Text Innovation

YouTube’s automated captioning service is notoriously unreliable and represents the dire need for innovation within the speech-to-text industry. Find out what we’re doing about it.
YouTube’s Captions Represent the Dire Need for Speech-to-Text Innovation
Benedetta Cevoli
Benedetta CevoliSenior Machine Learning Engineer
Carousel slide image
Company

Better than Whisper: how Adobe Premiere's on-device speech engine got rebuilt

Andrew Innes
Andrew InnesChief Architect
Carousel slide image
Technical

The Adobe story: How we made cloud-grade AI work on your laptop

Andrew Innes
Andrew InnesChief Architect
Carousel slide image
Technical

De-risk your voice agent: The 11 best voice agent testing platforms in 2026

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Technical

How to build a microbatching workflow with the Speechmatics API

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Product

Alphanumeric speech recognition: why voice assistants mangle SKUs (and how to fix it)

Speechmatics
SpeechmaticsEditorial Team
Carousel slide image
Company

Adobe and Speechmatics deliver cloud-grade speech recognition on-device for Premiere

Speechmatics
SpeechmaticsEditorial Team