Vision
Browse models for vision and compare pricing across providers.
Gemini 3 Flash
GoogleFast and efficient Gemini 3 model for high-throughput workloads.
Gemma 3 4B
GoogleCompact open-weight model for edge and mobile deployment.
Gemma 3 12B
GoogleMid-size open-weight Gemma model with vision support.
Qwen 3.5 9B
AlibabaCompact Qwen 3.5 for single-GPU deployment.
Nova Lite
AmazonAmazon Nova Lite for fast, cost-efficient tasks.
Gemma 3 27B
GoogleLargest Gemma 3 model with strong reasoning and instruction following.
Llama 4 Scout
MetaNatively multimodal MoE model with 10M context. 109B total, 17B active. Fits single H100.
Gemini 2.5 Flash
GoogleSpeed-optimized Gemini with strong reasoning and multimodal capabilities.
Mistral Small 4
Mistral AIUnified model combining fast instruct, deep reasoning, and multimodal chat. 119B params.
Gemini 2.0 Flash
GoogleFast and efficient Gemini model for high-throughput workloads.
Ministral 3 8B
Mistral AIEdge-optimized model with vision support. Apache 2.0 licensed.
Qwen 3.5 35B
AlibabaMid-size Qwen 3.5 MoE model with 35B total, 3B active parameters.
Gemma 4 27B
GoogleMost capable open Gemma model with best intelligence-per-parameter.
GPT-4o Mini
OpenAICost-efficient smaller GPT-4o variant for lightweight tasks.
Llama 4 Maverick
MetaLargest open Llama 4 with 128 experts. 400B total, 17B active. Beats GPT-4o on benchmarks.
Grok 3 Mini
xAILightweight Grok optimized for cost-efficient reasoning.
Qwen 3.5 72B
AlibabaNative multimodal Qwen with text, image, and video processing.
Kimi K2.5
Moonshot AIOpen-weight multimodal model with agent swarm mode supporting up to 100 parallel sub-agents.
GPT-5 Mini
OpenAICompact GPT-5 variant for lightweight tasks and rapid prototyping.
Qwen 3.5 122B
AlibabaLarge Qwen 3.5 MoE model with 122B total, 10B active parameters.
Claude 4.5 Haiku
AnthropicLatest Haiku tier with improved capabilities at fast speed and low cost.
Gemini 2.5 Pro
GoogleHigh-capability Gemini model for complex reasoning and coding tasks.
GPT-5.4 Mini
OpenAICompact GPT-5.4 variant balancing capability and cost.
Qwen 3.6 Plus
AlibabaAlibaba's latest flagship with 1M context and advanced agentic coding.
GPT-5.2
OpenAIGPT-5.2 general-purpose model.
Gemini 3 Pro
GoogleHigh-capability Gemini 3 model. Deprecated in favor of 3.1 Pro.
Qwen 3.5 397B
AlibabaLargest Qwen 3.5 MoE model with 397B total, 17B active parameters.
GPT-5.4
OpenAIOpenAI's latest frontier model combining reasoning, coding, and agentic workflows.
Claude 3.5 Haiku
AnthropicFast and affordable Claude model for high-throughput tasks.
Nova Pro
AmazonAmazon Nova Pro for balanced capability and cost.
o4 Mini
OpenAILightweight reasoning model balancing chain-of-thought rigor with speed and cost efficiency.
Claude 4.5 Sonnet
AnthropicHigh-capability Claude model balancing intelligence and speed.
Claude 4.6 Sonnet
AnthropicLatest Sonnet with Opus-tier capabilities at Sonnet pricing.
Mistral Large 3
Mistral AIMistral's most capable model. 675B MoE with 41B active parameters.
Gemini 3.1 Pro
GoogleGoogle's current flagship model with top benchmark scores and 1M context.
Claude 4.5 Opus
AnthropicPrevious Opus generation with strong reasoning and coding.
Claude 4.6 Opus
AnthropicAnthropic's most capable model with 1M token context and advanced reasoning.
Grok 4
xAILatest Grok with improved instruction following and reduced hallucination.
o3
OpenAIReasoning-focused model with step-by-step deliberation for complex math, coding, and science tasks.
GPT-4o
OpenAIOpenAI's flagship multimodal model with strong reasoning, coding, and vision capabilities.
Nova Premier
AmazonAmazon's most capable LLM for complex reasoning and enterprise tasks.
Grok 3
xAIxAI's flagship LLM trained on 200K+ GPUs with real-time web and X integration.
o3 Pro
OpenAIMost capable reasoning model in OpenAI's lineup with extended thinking for maximum reliability.
GPT-5.4 Pro
OpenAIHighest capability GPT-5.4 tier with maximum reasoning depth. Premium pricing.
Gemma 4 12B
GoogleLatest Gemma generation optimized for reasoning and agentic workflows.