Chat / Instruct
Browse models for chat / instruct and compare pricing across providers.
Gemini 3 Flash
GoogleFast and efficient Gemini 3 model for high-throughput workloads.
Gemma 3 4B
GoogleCompact open-weight model for edge and mobile deployment.
Gemma 3 12B
GoogleMid-size open-weight Gemma model with vision support.
Qwen 3.5 9B
AlibabaCompact Qwen 3.5 for single-GPU deployment.
GPT-5 Nano
OpenAIUltra-lightweight GPT-5 for high-speed, low-cost text generation.
Qwen 3 8B
AlibabaCompact Qwen 3 for edge and single-GPU deployment. Open weight.
GLM 4.7
Zhipu AIOptimized for coding, reasoning, and tool use.
Nova Lite
AmazonAmazon Nova Lite for fast, cost-efficient tasks.
GPT OSS 120B
OpenAIOpen-weight 117B MoE model (5.1B active) achieving near o4-mini reasoning. Apache 2.0 licensed, runs on a single 80GB GPU.
Gemma 3 27B
GoogleLargest Gemma 3 model with strong reasoning and instruction following.
Llama 4 Scout
MetaNatively multimodal MoE model with 10M context. 109B total, 17B active. Fits single H100.
Qwen 3 32B
AlibabaMid-size Qwen 3 with strong coding and math capabilities. Open weight.
Gemini 2.5 Flash
GoogleSpeed-optimized Gemini with strong reasoning and multimodal capabilities.
Mistral Small 4
Mistral AIUnified model combining fast instruct, deep reasoning, and multimodal chat. 119B params.
Gemini 2.0 Flash
GoogleFast and efficient Gemini model for high-throughput workloads.
Llama 3.3 70B
MetaWidely deployed open-weight model with strong general capabilities.
Ministral 3 8B
Mistral AIEdge-optimized model with vision support. Apache 2.0 licensed.
Qwen 3 235B
AlibabaLargest Qwen 3 model with hybrid thinking modes for flexible reasoning control.
Qwen 3.5 35B
AlibabaMid-size Qwen 3.5 MoE model with 35B total, 3B active parameters.
MiniMax M2.5
MiniMaxMiniMax general-purpose LLM with competitive reasoning and coding capabilities.
Gemma 4 27B
GoogleMost capable open Gemma model with best intelligence-per-parameter.
GLM 4.5
Zhipu AIStrong reasoning and coding with 106B total, 12B active MoE architecture.
DeepSeek V3.1
DeepSeekUpdated DeepSeek V3 with improved coding and reasoning performance.
GPT-4o Mini
OpenAICost-efficient smaller GPT-4o variant for lightweight tasks.
Llama 4 Maverick
MetaLargest open Llama 4 with 128 experts. 400B total, 17B active. Beats GPT-4o on benchmarks.
Kimi K2
Moonshot AIState-of-the-art 1T MoE model with 32B active parameters. Strong coding and agentic capabilities.
DeepSeek V3
DeepSeekOpen-weight 671B MoE model with strong coding and reasoning at low cost.
GPT-5.4 Nano
OpenAIUltra-lightweight GPT-5.4 for high-speed, low-cost tasks.
Grok 3 Mini
xAILightweight Grok optimized for cost-efficient reasoning.
Qwen 3.5 72B
AlibabaNative multimodal Qwen with text, image, and video processing.
Kimi K2.5
Moonshot AIOpen-weight multimodal model with agent swarm mode supporting up to 100 parallel sub-agents.
GPT-5 Mini
OpenAICompact GPT-5 variant for lightweight tasks and rapid prototyping.
DeepSeek V3.2
DeepSeekLatest DeepSeek V3 with improved reasoning and coding. 671B MoE (37B active), MIT licensed, 164K context.
DeepSeek R1
DeepSeekReasoning-focused model with chain-of-thought capabilities rivaling o1.
Qwen 3.5 122B
AlibabaLarge Qwen 3.5 MoE model with 122B total, 10B active parameters.
MiniMax M2.7
MiniMaxLatest MiniMax general-purpose LLM with improved reasoning.
Claude 4.5 Haiku
AnthropicLatest Haiku tier with improved capabilities at fast speed and low cost.
Gemini 2.5 Pro
GoogleHigh-capability Gemini model for complex reasoning and coding tasks.
GLM 4.6
Zhipu AIOpen-source frontier model with 355B parameters. MIT licensed.
GPT-5.4 Mini
OpenAICompact GPT-5.4 variant balancing capability and cost.
Qwen 3.6 Plus
AlibabaAlibaba's latest flagship with 1M context and advanced agentic coding.
GPT-5.2
OpenAIGPT-5.2 general-purpose model.
DeepSeek R1 0528
DeepSeekUpdated R1 with improved reasoning accuracy and reduced hallucination.
Gemini 3 Pro
GoogleHigh-capability Gemini 3 model. Deprecated in favor of 3.1 Pro.
GPT-5 Codex
OpenAIGPT-5 variant optimized for code generation and software engineering.
GPT-5.1 Codex
OpenAIGPT-5.1 code-optimized variant.
Qwen 3.5 397B
AlibabaLargest Qwen 3.5 MoE model with 397B total, 17B active parameters.
GPT-5.2 Codex
OpenAIGPT-5.2 code-optimized variant.
GPT-5.3 Codex
OpenAIGPT-5.3 code-optimized variant.
GPT-5.4
OpenAIOpenAI's latest frontier model combining reasoning, coding, and agentic workflows.
GPT-5.4 Codex
OpenAILatest GPT-5.4 code-optimized variant with industry-leading coding capabilities.
GLM 5
Zhipu AIFrontier 744B model trained on Huawei Ascend chips. Open source with strong agentic capabilities.
Claude 3.5 Haiku
AnthropicFast and affordable Claude model for high-throughput tasks.
Nova Pro
AmazonAmazon Nova Pro for balanced capability and cost.
o4 Mini
OpenAILightweight reasoning model balancing chain-of-thought rigor with speed and cost efficiency.
Claude 4.5 Sonnet
AnthropicHigh-capability Claude model balancing intelligence and speed.
Claude 4.6 Sonnet
AnthropicLatest Sonnet with Opus-tier capabilities at Sonnet pricing.
Mistral Large 3
Mistral AIMistral's most capable model. 675B MoE with 41B active parameters.
Qwen 3 Max
AlibabaAlibaba's highest capability Qwen 3 model.
Qwen 3 Max Thinking
AlibabaQwen 3 Max with extended reasoning and chain-of-thought capabilities.
Gemini 3.1 Pro
GoogleGoogle's current flagship model with top benchmark scores and 1M context.
GLM 5.1
Z.aiCoding-focused frontier model scoring 94% of Claude Opus 4.6. 744B MoE trained on Huawei Ascend 910B. #1 on SWE-Bench Pro (open source).
Claude 4.5 Opus
AnthropicPrevious Opus generation with strong reasoning and coding.
Claude 4.6 Opus
AnthropicAnthropic's most capable model with 1M token context and advanced reasoning.
Grok 4
xAILatest Grok with improved instruction following and reduced hallucination.
o3
OpenAIReasoning-focused model with step-by-step deliberation for complex math, coding, and science tasks.
Command A
CohereCohere's latest flagship model for enterprise RAG, tool use, and agents.
Command R+
CohereScalable enterprise model optimized for RAG and multilingual tasks.
GPT-4o
OpenAIOpenAI's flagship multimodal model with strong reasoning, coding, and vision capabilities.
Nova Premier
AmazonAmazon's most capable LLM for complex reasoning and enterprise tasks.
Grok 3
xAIxAI's flagship LLM trained on 200K+ GPUs with real-time web and X integration.
o3 Pro
OpenAIMost capable reasoning model in OpenAI's lineup with extended thinking for maximum reliability.
GPT-5.4 Pro
OpenAIHighest capability GPT-5.4 tier with maximum reasoning depth. Premium pricing.
Gemma 4 12B
GoogleLatest Gemma generation optimized for reasoning and agentic workflows.