Catalog

Models

Explore models and compare pricing across providers.

BGE-M3

BAAI
Embedding

Most popular open-source multilingual embedding model. Supports dense, sparse, and multi-vector retrieval.

Open
1 provider

DeepSeek R1

DeepSeek
LLM

Reasoning-focused model with chain-of-thought capabilities rivaling o1.

671B MoE 128k ctx Open
8 providers

DeepSeek R1 0528

DeepSeek
LLM

Updated R1 with improved reasoning accuracy and reduced hallucination.

671B MoE 128k ctx Open
8 providers

DeepSeek V3

DeepSeek
LLM

Open-weight 671B MoE model with strong coding and reasoning at low cost.

671B MoE 128k ctx Open
10 providers

DeepSeek V3.1

DeepSeek
LLM

Updated DeepSeek V3 with improved coding and reasoning performance.

671B MoE 128k ctx Open
7 providers

DeepSeek V3.2

DeepSeek
LLM

Latest DeepSeek V3 with improved reasoning and coding. 671B MoE (37B active), MIT licensed, 164K context.

671B MoE (37B active) 164k ctx Open
10 providers

Flux 1 Dev

Black Forest Labs
Image Gen

Open-weight development model for high-quality image generation. 12B parameters.

Open
5 providers

Flux 1 Schnell

Black Forest Labs
Image Gen

Fastest Flux model optimized for speed. 12B parameters, Apache 2.0 licensed.

Open
5 providers

Flux 2 Dev

Black Forest Labs
Image Gen

Open-weight Flux 2 development model.

Open
4 providers

Flux 2 Klein

Black Forest Labs
Image Gen

Ultra-fast Flux 2 model generating images in under 0.5 seconds. Available in 4B and 9B.

Open
4 providers

Flux Kontext Dev

Black Forest Labs
Image Gen

Open-weight context-aware image editing model.

Open
4 providers

GLM 4.5

Zhipu AI
LLM

Strong reasoning and coding with 106B total, 12B active MoE architecture.

106B MoE (12B active) 128k ctx Open
5 providers

GLM 4.6

Zhipu AI
LLM

Open-source frontier model with 355B parameters. MIT licensed.

355B 128k ctx Open
5 providers

GLM 5

Zhipu AI
LLM

Frontier 744B model trained on Huawei Ascend chips. Open source with strong agentic capabilities.

744B 128k ctx Open
11 providers

GPT OSS 120B

OpenAI
LLM

Open-weight 117B MoE model (5.1B active) achieving near o4-mini reasoning. Apache 2.0 licensed, runs on a single 80GB GPU.

117B MoE (5.1B active) 131k ctx Open
7 providers

Gemma 3 12B

Google
LLM

Mid-size open-weight Gemma model with vision support.

12B 128k ctx Open
4 providers

Gemma 3 27B

Google
LLM

Largest Gemma 3 model with strong reasoning and instruction following.

27B 128k ctx Open
5 providers

Gemma 3 4B

Google
LLM

Compact open-weight model for edge and mobile deployment.

4B 32k ctx Open
3 providers

Gemma 4 12B

Google
LLM

Latest Gemma generation optimized for reasoning and agentic workflows.

12B 128k ctx Open
No providers yet

Gemma 4 27B

Google
LLM

Most capable open Gemma model with best intelligence-per-parameter.

27B 128k ctx Open
4 providers

HiDream I1

HiDream
Image Gen

Open-source 17B parameter image model with sparse DiT architecture. MIT licensed.

Open
2 providers

HunyuanVideo 1.5

Tencent
Video Gen

Open-source 8.3B parameter video model with state-of-the-art visual quality on consumer GPUs.

Open
3 providers

Jina Embeddings V3

Jina AI
Embedding

Multilingual text embedding model with Matryoshka representation learning.

Open
No providers yet

Jina Embeddings V4

Jina AI
Embedding

Multimodal embedding model supporting text, images, and PDFs. Built on Qwen2.5-VL-3B with LoRA adapters.

3.8B Open
No providers yet

Kimi K2

Moonshot AI
LLM

State-of-the-art 1T MoE model with 32B active parameters. Strong coding and agentic capabilities.

1T MoE (32B active) 128k ctx Open
9 providers

Kimi K2.5

Moonshot AI
LLM

Open-weight multimodal model with agent swarm mode supporting up to 100 parallel sub-agents.

128k ctx Open
12 providers

Kolors

Kuaishou
Image Gen

Open-source bilingual text-to-image model trained on billions of pairs. Apache 2.0 licensed.

Open
2 providers

Llama 3.3 70B

Meta
LLM

Widely deployed open-weight model with strong general capabilities.

70B 128k ctx Open
13 providers

Llama 4 Maverick

Meta
LLM

Largest open Llama 4 with 128 experts. 400B total, 17B active. Beats GPT-4o on benchmarks.

400B (17B active) 1M ctx Open
6 providers

Llama 4 Scout

Meta
LLM

Natively multimodal MoE model with 10M context. 109B total, 17B active. Fits single H100.

109B (17B active) 10M ctx Open
6 providers

Ministral 3 8B

Mistral AI
LLM

Edge-optimized model with vision support. Apache 2.0 licensed.

8B 128k ctx Open
3 providers

Mistral Large 3

Mistral AI
LLM

Mistral's most capable model. 675B MoE with 41B active parameters.

675B MoE (41B active) 128k ctx Open
3 providers

Mistral Small 4

Mistral AI
LLM

Unified model combining fast instruct, deep reasoning, and multimodal chat. 119B params.

119B 256k ctx Open
4 providers

Qwen 3 235B

Alibaba
LLM

Largest Qwen 3 model with hybrid thinking modes for flexible reasoning control.

235B MoE 128k ctx Open
10 providers

Qwen 3 32B

Alibaba
LLM

Mid-size Qwen 3 with strong coding and math capabilities. Open weight.

32B 128k ctx Open
6 providers

Qwen 3 8B

Alibaba
LLM

Compact Qwen 3 for edge and single-GPU deployment. Open weight.

8B 128k ctx Open
6 providers

Qwen 3 TTS

Alibaba
TTS

Qwen 3 text-to-speech model with voice cloning support.

Open
2 providers

Qwen 3.5 122B

Alibaba
LLM

Large Qwen 3.5 MoE model with 122B total, 10B active parameters.

122B MoE (10B active) 128k ctx Open
3 providers

Qwen 3.5 35B

Alibaba
LLM

Mid-size Qwen 3.5 MoE model with 35B total, 3B active parameters.

35B MoE (3B active) 128k ctx Open
3 providers

Qwen 3.5 397B

Alibaba
LLM

Largest Qwen 3.5 MoE model with 397B total, 17B active parameters.

397B MoE (17B active) 128k ctx Open
3 providers

Qwen 3.5 72B

Alibaba
LLM

Native multimodal Qwen with text, image, and video processing.

72B 128k ctx Open
2 providers

Qwen 3.5 9B

Alibaba
LLM

Compact Qwen 3.5 for single-GPU deployment.

9B 128k ctx Open
2 providers

Qwen3 Embedding 0.6B

Alibaba
Embedding

Compact Qwen3 embedding for edge and low-resource deployment. Apache 2.0.

0.6B Open
2 providers

Qwen3 Embedding 4B

Alibaba
Embedding

Mid-size Qwen3 embedding balancing performance and efficiency. Apache 2.0.

4B Open
1 provider

Qwen3 Embedding 8B

Alibaba
Embedding

#1 on MTEB multilingual leaderboard. Best open-source embedding model. Apache 2.0.

8B Open
1 provider

SDXL 1.0

Stability AI
Image Gen

Stable Diffusion XL — widely adopted open-weight image generation model.

Open
2 providers

Stable Diffusion 3.5 Large

Stability AI
Image Gen

Stability AI's largest SD3 model with best quality. Open weight.

Open
1 provider

Stable Diffusion 3.5 Large Turbo

Stability AI
Image Gen

Distilled SD3 Large for faster generation with minimal quality loss. Open weight.

Open
No providers yet

Stable Diffusion 3.5 Medium

Stability AI
Image Gen

Balanced SD3 model for quality and speed. Open weight.

Open
2 providers

Voxtral TTS

Mistral AI
TTS

Open-weight 4B TTS model. 9 languages, ~90ms TTFA, voice cloning from 3s reference. CC BY NC 4.0.

4B Open
No providers yet

Wan 2.2

Alibaba
Video Gen

Top open-source video model with MoE architecture. Trained on 1.5B videos and 10B images.

Open
6 providers

Wan 2.5

Alibaba
Video Gen

Previous generation Wan video model with 720p generation 30% faster than 2.2.

Open
5 providers

Wan 2.6

Alibaba
Video Gen

Updated Wan video model with improved quality and speed.

Open
6 providers

Wan 2.7 Video

Alibaba
Video Gen

Latest Alibaba Wan video model with editing, extending, and reference-to-video capabilities.

Open
3 providers

Whisper Large V3

OpenAI
STT

Open-weight speech recognition supporting 50+ languages. Handles accents, noise, and technical language.

1.5B Open
5 providers