Catalog

Models

Explore models and compare pricing across providers.

Google's latest speech recognition model with improved accuracy across 100+ languages.

Latest OpenAI transcription with lower error rates than Whisper. Recommended over Whisper for API use.

Real-time STT specialist with sub-300ms latency, streaming WebSocket API, and domain-specific vocabulary.

Speech-language model with multilingual streaming, safety guardrails, and LLM gateway integration.

ElevenLabs speech recognition and transcription service.

Benchmark-leading accuracy at ~8.4% WER with 30% fewer hallucinations than Whisper. Full audio intelligence suite.

Open-weight speech recognition supporting 50+ languages. Handles accents, noise, and technical language.