AI Models Directory | Inference Hub

BGE-M3

BAAI

Embedding

Most popular open-source multilingual embedding model. Supports dense, sparse, and multi-vector retrieval.

Open

1 provider

DeepSeek R1

DeepSeek

LLM

Reasoning-focused model with chain-of-thought capabilities rivaling o1.

671B MoE 128k ctx Open

8 providers

DeepSeek R1 0528

DeepSeek

LLM

Updated R1 with improved reasoning accuracy and reduced hallucination.

671B MoE 128k ctx Open

8 providers

DeepSeek V3

DeepSeek

LLM

Open-weight 671B MoE model with strong coding and reasoning at low cost.

671B MoE 128k ctx Open

10 providers

DeepSeek V3.1

DeepSeek

LLM

Updated DeepSeek V3 with improved coding and reasoning performance.

671B MoE 128k ctx Open

7 providers

DeepSeek V3.2

DeepSeek

LLM

Latest DeepSeek V3 with improved reasoning and coding. 671B MoE (37B active), MIT licensed, 164K context.

671B MoE (37B active) 164k ctx Open

10 providers

Flux 1 Dev

Black Forest Labs

Image Gen

Open-weight development model for high-quality image generation. 12B parameters.

Open

5 providers

Flux 1 Schnell

Black Forest Labs

Image Gen

Fastest Flux model optimized for speed. 12B parameters, Apache 2.0 licensed.

Open

5 providers

Flux 2 Dev

Black Forest Labs

Image Gen

Open-weight Flux 2 development model.

Open

4 providers

Flux 2 Klein

Black Forest Labs

Image Gen

Ultra-fast Flux 2 model generating images in under 0.5 seconds. Available in 4B and 9B.

Open

4 providers

Flux Kontext Dev

Black Forest Labs

Image Gen

Open-weight context-aware image editing model.

Open

4 providers

GLM 4.5

Zhipu AI

LLM

Strong reasoning and coding with 106B total, 12B active MoE architecture.

106B MoE (12B active) 128k ctx Open

5 providers

GLM 4.6

Zhipu AI

LLM

Open-source frontier model with 355B parameters. MIT licensed.

355B 128k ctx Open

5 providers

GLM 5

Zhipu AI

LLM

Frontier 744B model trained on Huawei Ascend chips. Open source with strong agentic capabilities.

744B 128k ctx Open

11 providers

GPT OSS 120B

OpenAI

LLM

Open-weight 117B MoE model (5.1B active) achieving near o4-mini reasoning. Apache 2.0 licensed, runs on a single 80GB GPU.

117B MoE (5.1B active) 131k ctx Open

7 providers

Gemma 3 12B

Google

LLM

Mid-size open-weight Gemma model with vision support.

12B 128k ctx Open

4 providers

Gemma 3 27B

Google

LLM

Largest Gemma 3 model with strong reasoning and instruction following.

27B 128k ctx Open

5 providers

Gemma 3 4B

Google

LLM

Compact open-weight model for edge and mobile deployment.

4B 32k ctx Open

3 providers

Gemma 4 12B

Google

LLM

Latest Gemma generation optimized for reasoning and agentic workflows.

12B 128k ctx Open

No providers yet

Gemma 4 27B

Google

LLM

Most capable open Gemma model with best intelligence-per-parameter.

27B 128k ctx Open

4 providers

HiDream I1

HiDream

Image Gen

Open-source 17B parameter image model with sparse DiT architecture. MIT licensed.

Open

2 providers

HunyuanVideo 1.5

Tencent

Video Gen

Open-source 8.3B parameter video model with state-of-the-art visual quality on consumer GPUs.

Open

3 providers

Jina Embeddings V3

Jina AI

Embedding

Multilingual text embedding model with Matryoshka representation learning.

Open

No providers yet

Jina Embeddings V4

Jina AI

Embedding

Multimodal embedding model supporting text, images, and PDFs. Built on Qwen2.5-VL-3B with LoRA adapters.

3.8B Open

No providers yet

Kimi K2

Moonshot AI

LLM

State-of-the-art 1T MoE model with 32B active parameters. Strong coding and agentic capabilities.

1T MoE (32B active) 128k ctx Open

9 providers

Kimi K2.5

Moonshot AI

LLM

Open-weight multimodal model with agent swarm mode supporting up to 100 parallel sub-agents.

128k ctx Open

12 providers

Kolors

Kuaishou

Image Gen

Open-source bilingual text-to-image model trained on billions of pairs. Apache 2.0 licensed.

Open

2 providers

Llama 3.3 70B

Llama 4 Maverick

Llama 4 Scout

Ministral 3 8B

Mistral AI

LLM

Edge-optimized model with vision support. Apache 2.0 licensed.

8B 128k ctx Open

3 providers

Mistral Large 3

Mistral AI

LLM

Mistral's most capable model. 675B MoE with 41B active parameters.

675B MoE (41B active) 128k ctx Open

3 providers

Mistral Small 4

Mistral AI

LLM

Unified model combining fast instruct, deep reasoning, and multimodal chat. 119B params.

119B 256k ctx Open

4 providers

Qwen 3 235B

Alibaba

LLM

Largest Qwen 3 model with hybrid thinking modes for flexible reasoning control.

235B MoE 128k ctx Open

10 providers

Qwen 3 32B

Alibaba

LLM

Mid-size Qwen 3 with strong coding and math capabilities. Open weight.

32B 128k ctx Open

6 providers

Qwen 3 8B

Alibaba

LLM

Compact Qwen 3 for edge and single-GPU deployment. Open weight.

8B 128k ctx Open

6 providers

Qwen 3 TTS

Alibaba

TTS

Qwen 3 text-to-speech model with voice cloning support.

Open

2 providers

Qwen 3.5 122B

Alibaba

LLM

Large Qwen 3.5 MoE model with 122B total, 10B active parameters.

122B MoE (10B active) 128k ctx Open

3 providers

Qwen 3.5 35B

Alibaba

LLM

Mid-size Qwen 3.5 MoE model with 35B total, 3B active parameters.

35B MoE (3B active) 128k ctx Open

3 providers

Qwen 3.5 397B

Alibaba

LLM

Largest Qwen 3.5 MoE model with 397B total, 17B active parameters.

397B MoE (17B active) 128k ctx Open

3 providers

Qwen 3.5 72B

Alibaba

LLM

Native multimodal Qwen with text, image, and video processing.

72B 128k ctx Open

2 providers

Qwen 3.5 9B

Alibaba

LLM

Compact Qwen 3.5 for single-GPU deployment.

9B 128k ctx Open

2 providers

Qwen3 Embedding 0.6B

Alibaba

Embedding

Compact Qwen3 embedding for edge and low-resource deployment. Apache 2.0.

0.6B Open

2 providers

Qwen3 Embedding 4B

Alibaba

Embedding

Mid-size Qwen3 embedding balancing performance and efficiency. Apache 2.0.

4B Open

1 provider

Qwen3 Embedding 8B

Alibaba

Embedding

#1 on MTEB multilingual leaderboard. Best open-source embedding model. Apache 2.0.

8B Open

1 provider

SDXL 1.0

Stability AI

Image Gen

Stable Diffusion XL — widely adopted open-weight image generation model.

Open

2 providers

Stable Diffusion 3.5 Large

Stability AI

Image Gen

Stability AI's largest SD3 model with best quality. Open weight.

Open

1 provider

Stable Diffusion 3.5 Large Turbo

Stability AI

Image Gen

Distilled SD3 Large for faster generation with minimal quality loss. Open weight.

Open

No providers yet

Stable Diffusion 3.5 Medium

Stability AI

Image Gen

Balanced SD3 model for quality and speed. Open weight.

Open

2 providers

Voxtral TTS

Mistral AI

TTS

Open-weight 4B TTS model. 9 languages, ~90ms TTFA, voice cloning from 3s reference. CC BY NC 4.0.

4B Open

No providers yet

Wan 2.2

Alibaba

Video Gen

Top open-source video model with MoE architecture. Trained on 1.5B videos and 10B images.

Open

6 providers

Wan 2.5

Alibaba

Video Gen

Previous generation Wan video model with 720p generation 30% faster than 2.2.

Open

5 providers

Wan 2.6

Alibaba

Video Gen

Updated Wan video model with improved quality and speed.

Open

6 providers

Wan 2.7 Video

Alibaba

Video Gen

Latest Alibaba Wan video model with editing, extending, and reference-to-video capabilities.

Open

3 providers

Whisper Large V3

OpenAI

STT

Open-weight speech recognition supporting 50+ languages. Handles accents, noise, and technical language.

1.5B Open

5 providers