Models
Explore models and compare pricing across providers.
Claude 3.5 Haiku
AnthropicFast and affordable Claude model for high-throughput tasks.
Claude 4.5 Haiku
AnthropicLatest Haiku tier with improved capabilities at fast speed and low cost.
Claude 4.5 Opus
AnthropicPrevious Opus generation with strong reasoning and coding.
Claude 4.5 Sonnet
AnthropicHigh-capability Claude model balancing intelligence and speed.
Claude 4.6 Opus
AnthropicAnthropic's most capable model with 1M token context and advanced reasoning.
Claude 4.6 Sonnet
AnthropicLatest Sonnet with Opus-tier capabilities at Sonnet pricing.
Command A
CohereCohere's latest flagship model for enterprise RAG, tool use, and agents.
Command R+
CohereScalable enterprise model optimized for RAG and multilingual tasks.
DeepSeek R1
DeepSeekReasoning-focused model with chain-of-thought capabilities rivaling o1.
DeepSeek R1 0528
DeepSeekUpdated R1 with improved reasoning accuracy and reduced hallucination.
DeepSeek V3
DeepSeekOpen-weight 671B MoE model with strong coding and reasoning at low cost.
DeepSeek V3.1
DeepSeekUpdated DeepSeek V3 with improved coding and reasoning performance.
DeepSeek V3.2
DeepSeekLatest DeepSeek V3 with improved reasoning and coding. 671B MoE (37B active), MIT licensed, 164K context.
GLM 4.5
Zhipu AIStrong reasoning and coding with 106B total, 12B active MoE architecture.
GLM 4.6
Zhipu AIOpen-source frontier model with 355B parameters. MIT licensed.
GLM 4.7
Zhipu AIOptimized for coding, reasoning, and tool use.
GLM 5
Zhipu AIFrontier 744B model trained on Huawei Ascend chips. Open source with strong agentic capabilities.
GLM 5.1
Z.aiCoding-focused frontier model scoring 94% of Claude Opus 4.6. 744B MoE trained on Huawei Ascend 910B. #1 on SWE-Bench Pro (open source).
GPT OSS 120B
OpenAIOpen-weight 117B MoE model (5.1B active) achieving near o4-mini reasoning. Apache 2.0 licensed, runs on a single 80GB GPU.
GPT-4o
OpenAIOpenAI's flagship multimodal model with strong reasoning, coding, and vision capabilities.
GPT-4o Mini
OpenAICost-efficient smaller GPT-4o variant for lightweight tasks.
GPT-5 Codex
OpenAIGPT-5 variant optimized for code generation and software engineering.
GPT-5 Mini
OpenAICompact GPT-5 variant for lightweight tasks and rapid prototyping.
GPT-5 Nano
OpenAIUltra-lightweight GPT-5 for high-speed, low-cost text generation.
GPT-5.1 Codex
OpenAIGPT-5.1 code-optimized variant.
GPT-5.2
OpenAIGPT-5.2 general-purpose model.
GPT-5.2 Codex
OpenAIGPT-5.2 code-optimized variant.
GPT-5.3 Codex
OpenAIGPT-5.3 code-optimized variant.
GPT-5.4
OpenAIOpenAI's latest frontier model combining reasoning, coding, and agentic workflows.
GPT-5.4 Codex
OpenAILatest GPT-5.4 code-optimized variant with industry-leading coding capabilities.
GPT-5.4 Mini
OpenAICompact GPT-5.4 variant balancing capability and cost.
GPT-5.4 Nano
OpenAIUltra-lightweight GPT-5.4 for high-speed, low-cost tasks.
GPT-5.4 Pro
OpenAIHighest capability GPT-5.4 tier with maximum reasoning depth. Premium pricing.
Gemini 2.0 Flash
GoogleFast and efficient Gemini model for high-throughput workloads.
Gemini 2.5 Flash
GoogleSpeed-optimized Gemini with strong reasoning and multimodal capabilities.
Gemini 2.5 Pro
GoogleHigh-capability Gemini model for complex reasoning and coding tasks.
Gemini 3 Flash
GoogleFast and efficient Gemini 3 model for high-throughput workloads.
Gemini 3 Pro
GoogleHigh-capability Gemini 3 model. Deprecated in favor of 3.1 Pro.
Gemini 3.1 Pro
GoogleGoogle's current flagship model with top benchmark scores and 1M context.
Gemma 3 12B
GoogleMid-size open-weight Gemma model with vision support.
Gemma 3 27B
GoogleLargest Gemma 3 model with strong reasoning and instruction following.
Gemma 3 4B
GoogleCompact open-weight model for edge and mobile deployment.
Gemma 4 12B
GoogleLatest Gemma generation optimized for reasoning and agentic workflows.
Gemma 4 27B
GoogleMost capable open Gemma model with best intelligence-per-parameter.
Grok 3
xAIxAI's flagship LLM trained on 200K+ GPUs with real-time web and X integration.
Grok 3 Mini
xAILightweight Grok optimized for cost-efficient reasoning.
Grok 4
xAILatest Grok with improved instruction following and reduced hallucination.
Kimi K2
Moonshot AIState-of-the-art 1T MoE model with 32B active parameters. Strong coding and agentic capabilities.
Kimi K2.5
Moonshot AIOpen-weight multimodal model with agent swarm mode supporting up to 100 parallel sub-agents.
Llama 3.3 70B
MetaWidely deployed open-weight model with strong general capabilities.
Llama 4 Maverick
MetaLargest open Llama 4 with 128 experts. 400B total, 17B active. Beats GPT-4o on benchmarks.
Llama 4 Scout
MetaNatively multimodal MoE model with 10M context. 109B total, 17B active. Fits single H100.
MiniMax M2.5
MiniMaxMiniMax general-purpose LLM with competitive reasoning and coding capabilities.
MiniMax M2.7
MiniMaxLatest MiniMax general-purpose LLM with improved reasoning.
Ministral 3 8B
Mistral AIEdge-optimized model with vision support. Apache 2.0 licensed.
Mistral Large 3
Mistral AIMistral's most capable model. 675B MoE with 41B active parameters.
Mistral Small 4
Mistral AIUnified model combining fast instruct, deep reasoning, and multimodal chat. 119B params.
Nova Lite
AmazonAmazon Nova Lite for fast, cost-efficient tasks.
Nova Premier
AmazonAmazon's most capable LLM for complex reasoning and enterprise tasks.
Nova Pro
AmazonAmazon Nova Pro for balanced capability and cost.
Qwen 3 235B
AlibabaLargest Qwen 3 model with hybrid thinking modes for flexible reasoning control.
Qwen 3 32B
AlibabaMid-size Qwen 3 with strong coding and math capabilities. Open weight.
Qwen 3 8B
AlibabaCompact Qwen 3 for edge and single-GPU deployment. Open weight.
Qwen 3 Max
AlibabaAlibaba's highest capability Qwen 3 model.
Qwen 3 Max Thinking
AlibabaQwen 3 Max with extended reasoning and chain-of-thought capabilities.
Qwen 3.5 122B
AlibabaLarge Qwen 3.5 MoE model with 122B total, 10B active parameters.
Qwen 3.5 35B
AlibabaMid-size Qwen 3.5 MoE model with 35B total, 3B active parameters.
Qwen 3.5 397B
AlibabaLargest Qwen 3.5 MoE model with 397B total, 17B active parameters.
Qwen 3.5 72B
AlibabaNative multimodal Qwen with text, image, and video processing.
Qwen 3.5 9B
AlibabaCompact Qwen 3.5 for single-GPU deployment.
Qwen 3.6 Plus
AlibabaAlibaba's latest flagship with 1M context and advanced agentic coding.
o3
OpenAIReasoning-focused model with step-by-step deliberation for complex math, coding, and science tasks.
o3 Pro
OpenAIMost capable reasoning model in OpenAI's lineup with extended thinking for maximum reliability.
o4 Mini
OpenAILightweight reasoning model balancing chain-of-thought rigor with speed and cost efficiency.