AI Inference API Providers
Compare AI inference API providers side by side — pricing, supported models, features, and free tiers. Whether you need the cheapest LLM API, the fastest image generation endpoint, or a provider with OpenAI-compatible routing, find the right fit below.
Covering 7+ providers including OpenRouter, Together AI, Fireworks AI, fal.ai, Replicate, DeepInfra, Groq, and more. Filter by type, category, or browse the full directory.
Amazon Bedrock
Fully managed AWS service providing foundation models from Anthropic, Meta, Mistral, Cohere, and more. OpenAI-compatible API with enterprise-grade security and compliance.
Official Gemini API via Google AI Studio and Vertex AI. Direct access to Gemini, Imagen, and Gemma models.
Mistral AI
Official Mistral API. Direct access to Mistral Large, Small, and Ministral models. EU data residency available.
OpenAI
Official OpenAI API. Direct access to GPT, DALL-E, Whisper, and embedding models.
Together AI
Serverless and dedicated inference for open-source LLMs, image, video, and audio models. GPU clusters available.
AIMLAPI
Unified API for 400+ AI models across text, image, video, and audio. OpenAI-compatible with serverless inference.
Cloudflare Workers AI
Edge AI inference across 200+ cities worldwide. Serverless, pay-per-use with OpenAI-compatible API.