Self-Supervised Learning: Do Believe the Hype

Posted on 1.8.2022

Each year Gartner®, a company that delivers actionable, objective insight to executives and their teams, publishes Hype Cycles, ‘a graphic representation of the maturity and adoption of technologies and applications.’ In 2022’s Hype Cycle™ for Data Science and Machine Learning, the Gartner® report explains the many advantages to self-supervised learning – benefits we experience every day with our Autonomous Speech Recognition (ASR) engine.

Gartner Hype Cycle

“Self-supervised learning is an approach to machine learning in which labeled data is created from the data itself, without having to rely on historical outcome data or external (human) supervisors that provide labels or feedback. It is inspired by the way humans learn through observation, gradually building up general knowledge about concepts, events and their relations, or spatiotemporal associations in the real world.”

At Speechmatics, our award-winning (ASR) engine needs vast quantities of data to keep improving and innovating. To put it into perspective, we’ve used self-supervised learning to train our technology on 1.1 million hours of audio – resulting in a more comprehensive understanding of voices.

The Many Benefits of Self-Supervised Learning

Fundamentally, self-supervised learning does what it says on the tin. The Gartner® report tells us that there’s no need for human supervision. “In self-supervised learning, labels can be generated automatically from the data itself, without the need for human annotation. In essence, this is done by masking elements in the available data (e.g., a part of an image, a sensor reading in a time series, a frame in a video or a word in a sentence) and then training a model to “predict” the missing element.”

If you’ve seen our ASR at work, you’ll notice the transcription might initially be incorrect, only for the AI to correct or ‘predict’ the missing word. From there, the model can fine-tune the data, deriving more value from it and developing a learning relationship.

From there, the Gartner® report tells that “Self-supervised learning has the potential to bring AI closer to the way humans learn. This occurs mainly via observation and association, building up general knowledge about the world through abstractions and then using this knowledge as a foundation for new learning tasks, thus incrementally building up ever-more knowledge that in future AI scenarios may serve as common sense.”

We believe that encapsulates how we innovate – by learning more about how humans talk, we can continue to grow our ASR and make it as accessible as possible. The more data we gather, the more knowledge we build. Consequently, our ASR understands voices with more common sense – a distinctly human approach.

See how great self-supervised learning is for yourself with our revamped SaaS Portal, or download the report to learn more.

Gartner Reports

John Hughes, Accuracy Lead, Speechmatics

Gartner, Hype Cycle for Data Science and Machine Learning, 2022, By Farhan Choudhary, Peter Krensky, 29 June 2022
Hype Cycle for Data Science and Machine Learning, 2022 – read the Gartner® report

Gartner and Hype Cycle are registered trademarks of Gartner, Inc. and/or its affiliates in the U.S. and internationally and are used herein with permission. All rights reserved.

Gartner does not endorse any vendor, product or service depicted in its research publications and does not advise technology users to select only those vendors with the highest ratings or other designation. Gartner research publications consist of the opinions of Gartner’s Research & Advisory organization and should not be construed as statements of fact. Gartner disclaims all warranties, expressed or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose.

Ready to Try Speechmatics?

Sign up for our free speech-to-text SaaS Portal and we'll guide you through the implementation of our API. We pride ourselves on offering the best support for your business needs. If you have any questions, just ask.