CL

Cloudflare Workers AI

Serverless

Edge AI inference across 200+ cities worldwide. Serverless, pay-per-use with OpenAI-compatible API.

Features

OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode

Compliance

SOC 2
HIPAA
GDPR

Pricing Model

per token
Free tier: Available

Details

Models 15
API Base https://api.cloudflare.com/client/v4/accounts/{account_id}/ai/run
Audio / MusicEmbeddingsImage GenerationLLM Inference

Model Catalog (15)

Model Type Input $/1M Output $/1M Context Speed Status
BGE-M3
BAAI
embedding $0.012
Flux 1 Schnell
Black Forest Labs
image gen $0.0030/img
Flux 2 Klein
Black Forest Labs
image gen $0.015/img
GLM 4.7
Zhipu AI
llm $0.060 $0.400
Gemma 3 12B
Google · 12B
llm $0.345 $0.556
Kimi K2.5
Moonshot AI · 1T MoE (32B active)
llm $0.600 $3.00
Kimi K2.6
Moonshot AI · 1T MoE (32B active)
llm $0.950 $4.00 262k
Kimi K2.7 Code
Moonshot AI · 1T MoE (32B active)
code $0.950 $4.00 262k
Leonardo Phoenix
Leonardo AI
image gen $0.0060/img
Llama 3.1 8B Instruct
Meta
llm $0.045 $0.384
Llama 3.3 70B
Meta · 70B
llm $0.293 $2.25
Llama 4 Scout
Meta · 109B (17B active)
llm $0.270 $0.850
Qwen 3 8B
Alibaba · 8B
llm $0.051 $0.335
Qwen3 Embedding 0.6B
Alibaba · 0.6B
embedding $0.012
Whisper Large V3
OpenAI · 1.5B
speech to_text $0.0000/sec