CL

Cloudflare Workers AI

Serverless

Edge AI inference across 200+ cities worldwide. Serverless, pay-per-use with OpenAI-compatible API.

Features

OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode

Compliance

SOC 2
HIPAA
GDPR

Pricing Model

per token
Free tier: Available

Details

Models 12
API Base https://api.cloudflare.com/client/v4/accounts/{account_id}/ai/run
Audio / MusicEmbeddingsImage GenerationLLM Inference

Model Catalog (12)

Model Type Input $/1M Output $/1M Context Speed Status
BGE-M3
BAAI
embedding $0.012
Flux 1 Schnell
Black Forest Labs
image gen $0.0030/img
Flux 2 Klein
Black Forest Labs
image gen $0.015/img
GLM 4.7
Zhipu AI
llm $0.060 $0.400
Gemma 3 12B
Google · 12B
llm $0.345 $0.556
Kimi K2.5
Moonshot AI
llm $0.600 $3.00
Leonardo Phoenix
Leonardo AI
image gen $0.0060/img
Llama 3.3 70B
Meta · 70B
llm $0.293 $2.25
Llama 4 Scout
Meta · 109B (17B active)
llm $0.270 $0.850
Qwen 3 8B
Alibaba · 8B
llm $0.051 $0.335
Qwen3 Embedding 0.6B
Alibaba · 0.6B
embedding $0.012
Whisper Large V3
OpenAI · 1.5B
speech to_text $0.0005/req