Edge AI inference across 200+ cities worldwide. Serverless, pay-per-use with OpenAI-compatible API.
Features
OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode
Compliance
SOC 2
HIPAA
GDPR
Pricing Model
per token Free tier: Available
Details
Models 12
API Base
https://api.cloudflare.com/client/v4/accounts/{account_id}/ai/run Audio / MusicEmbeddingsImage GenerationLLM Inference
Model Catalog (12)
| Model | Type | Input $/1M | Output $/1M | Context | Speed | Status |
|---|---|---|---|---|---|---|
| BGE-M3 BAAI | embedding | $0.012 | — | — | — | |
| Flux 1 Schnell Black Forest Labs | image gen | $0.0030/img | — | — | — | |
| Flux 2 Klein Black Forest Labs | image gen | $0.015/img | — | — | — | |
| GLM 4.7 Zhipu AI | llm | $0.060 | $0.400 | — | — | |
| Gemma 3 12B Google · 12B | llm | $0.345 | $0.556 | — | — | |
| Kimi K2.5 Moonshot AI | llm | $0.600 | $3.00 | — | — | |
| Leonardo Phoenix Leonardo AI | image gen | $0.0060/img | — | — | — | |
| Llama 3.3 70B Meta · 70B | llm | $0.293 | $2.25 | — | — | |
| Llama 4 Scout Meta · 109B (17B active) | llm | $0.270 | $0.850 | — | — | |
| Qwen 3 8B Alibaba · 8B | llm | $0.051 | $0.335 | — | — | |
| Qwen3 Embedding 0.6B Alibaba · 0.6B | embedding | $0.012 | — | — | — | |
| Whisper Large V3 OpenAI · 1.5B | speech to_text | $0.0005/req | — | — | — |