Catalog
Models
Explore models and compare pricing across providers.
All Creators
Alibaba Amazon Anthropic AssemblyAI BAAI Black Forest Labs ByteDance Cartesia Cohere DeepSeek Deepgram ElevenLabs
Open Weight Only
Chirp 2
GoogleGoogle's latest speech recognition model with improved accuracy across 100+ languages.
1 provider
GPT-4o Transcribe
OpenAILatest OpenAI transcription with lower error rates than Whisper. Recommended over Whisper for API use.
1 provider
Nova 2
DeepgramReal-time STT specialist with sub-300ms latency, streaming WebSocket API, and domain-specific vocabulary.
No providers yet
Slam-1
AssemblyAISpeech-language model with multilingual streaming, safety guardrails, and LLM gateway integration.
No providers yet
Speech to Text
ElevenLabsElevenLabs speech recognition and transcription service.
3 providers
Universal-2
AssemblyAIBenchmark-leading accuracy at ~8.4% WER with 30% fewer hallucinations than Whisper. Full audio intelligence suite.
No providers yet
Whisper Large V3
OpenAIOpen-weight speech recognition supporting 50+ languages. Handles accents, noise, and technical language.
1.5B Open
5 providers