Directory
Providers
Compare inference API providers by features, pricing model, and supported models.
All
Serverless Proprietary GPU Cloud Aggregator Platform LLM Inference Image Generation Video Generation Audio / Music Embeddings GPU Cloud Multi-Modal
GO
Proprietary Featured
Official Gemini API via Google AI Studio and Vertex AI. Direct access to Gemini, Imagen, and Gemma models.
19 models
Streaming Embeddings Functions Vision Free Tier
MI
Mistral AI
Proprietary Featured
Official Mistral API. Direct access to Mistral Large, Small, and Ministral models. EU data residency available.
3 models
OpenAI Compat Streaming Functions Vision Free Tier
OP
OpenAI
Proprietary Featured
Official OpenAI API. Direct access to GPT, DALL-E, Whisper, and embedding models.
18 models
OpenAI Compat Streaming Embeddings Functions Vision Free Tier
TO
Together AI
Serverless Featured
Serverless and dedicated inference for open-source LLMs, image, video, and audio models. GPU clusters available.
9 models
OpenAI Compat Streaming Finetuning Embeddings Vision Free Tier
AI
AIMLAPI
Aggregator
Unified API for 400+ AI models across text, image, video, and audio. OpenAI-compatible with serverless inference.
25 models
OpenAI Compat Streaming Embeddings Vision Free Tier
CL
Cloudflare Workers AI
Serverless
Edge AI inference across 200+ cities worldwide. Serverless, pay-per-use with OpenAI-compatible API.
12 models
OpenAI Compat Streaming Embeddings Vision Free Tier