Enabling 100,000+ developers with leading speech recognition
Pairing LiveKit’s flexible agent framework with Speechmatics to build world-class agentsNoise-resistant recognition
Accurately transcribe speech even in challenging acoustic environments with background noise, echo, and interference.
Accent & dialect mastery
Trained on diverse global speech patterns to understand heavy accents, regional dialects, and non-native speakers.
Multi-speaker intelligence
Identify and separate different speakers in conversations, meetings, and complex audio scenarios.
Real-time processing
Low-latency streaming recognition perfect for live conversations, calls, and interactive applications.
Enterprise security
Bank-grade encryption and compliance with GDPR, SOC 2, HIPAA and other security standards.
Advanced analytics
Detailed insights into speech patterns, confidence scores, and conversation analytics.

Your coupon: BUILD200
1) 👤 Log in or signup to the Speechmatics Portal
2) 💳 Add a valid payment card (no charge until credit is used)
3) ✅ Complete billing setup to enable coupon
4) 🔑 Enter your code: BUILD200
5) 🚀 Start building with $200 free credit
Hurry, offer ends soon.
Voice AI Agent Resources
![[alt: Vapi integration launch blog social asset]](/_next/image?url=https%3A%2F%2Fimages.ctfassets.net%2Fyze1aysi0225%2F5rvEvjLDjyosWx3mVI7L76%2Fbacc01b541e87a90558373ca7b16d539%2FVapi-blog-assets-V1-Social-sharing.png&w=3840&q=75)
Vapi and Speechmatics: Build agents that understand every voice
Ship Voice AI agents that stay readable in real time, even in noisy, multi-speaker calls.
![[alt: Why we built our text to speech image]](/_next/image?url=https%3A%2F%2Fimages.ctfassets.net%2Fyze1aysi0225%2F6mXsRaEBqOQwN3XL6hsKhF%2Fa9a5730fc2c7855de6e1f6e446644892%2FTTS-preview-widecarousel_1200x480_1x.webp&w=3840&q=75)
Why we built our low-latency Text-to-Speech
Most TTS sounds great in demos but breaks in real conversations. We built ours for sub-150ms latency, natural voices, and global scale.
![[alt: Livekit and Speechmatics partnership]](/_next/image?url=https%3A%2F%2Fimages.ctfassets.net%2Fyze1aysi0225%2F55uo621nIAzecVIcDsrrGX%2Fa81809b4dcf9acd1883ce628f8a10552%2FLiveKit-blog_assets-V1_-_Header_16-9.webp&w=3840&q=75)
Introducing real-time, speaker-aware Voice Agents with LiveKit + Speechmatics
Speechmatics brings speaker diarization to LiveKit agents - enabling them to understand not just what was said, but who said it.
![[alt: Like a dalek social share]](/_next/image?url=https%3A%2F%2Fimages.ctfassets.net%2Fyze1aysi0225%2F5DW0xag6Zfydn7FBot26mh%2Fc3df41cc1f668b240a90e599e4b9ec34%2FTTS-Carousel-1.webp&w=3840&q=75)
Non-English TTS still sounds like a Dalek
Why most voices sound natural in English but still robotic in other languages, and how to fix it.