Directory

AI Inference API Providers

Compare AI inference API providers side by side — pricing, supported models, features, and free tiers. Whether you need the cheapest LLM API, the fastest image generation endpoint, or a provider with OpenAI-compatible routing, find the right fit below.

Covering 18+ providers including OpenRouter, Together AI, Fireworks AI, fal.ai, Replicate, DeepInfra, Groq, and more. Filter by type, category, or browse the full directory.

AM

Amazon Bedrock

Platform Featured

Fully managed AWS service providing foundation models from Anthropic, Meta, Mistral, Cohere, and more. OpenAI-compatible API with enterprise-grade security and compliance.

12 models
Up to $200 in AWS free tier credits for new accounts free
OpenAI Compat Streaming Finetuning Embeddings Functions Vision SOC2 HIPAA GDPR Free Tier
AN

Anthropic

Proprietary Featured

Official Claude API. Direct access to Claude Opus, Sonnet, and Haiku models.

6 models
Streaming Functions Vision
DE

DeepInfra

Serverless Featured

Serverless inference for open-source LLMs and generative models. Pay-per-token with fast cold starts.

13 models
OpenAI Compat Streaming Finetuning Embeddings Vision Free Tier
GO

Google

Proprietary Featured

Official Gemini API via Google AI Studio and Vertex AI. Direct access to Gemini, Imagen, and Gemma models.

19 models
Streaming Embeddings Functions Vision Free Tier
GR

Groq

Serverless Featured

Fastest LLM inference powered by custom LPU chips. OpenAI-compatible API with sub-second latency.

4 models
OpenAI Compat Streaming Functions Vision Free Tier
KI

KIE AI

Aggregator Featured

Affordable AI API aggregator offering 259+ models across chat, image, video, and music at discounted prices.

55 models
OpenAI Compat Streaming Vision
MI

Mistral AI

Proprietary Featured

Official Mistral API. Direct access to Mistral Large, Small, and Ministral models. EU data residency available.

3 models
OpenAI Compat Streaming Functions Vision Free Tier
MU

Muapi

Aggregator Featured

AI API aggregator with 315+ model endpoints across text, image, video, and audio at competitive prices.

75 models
OpenAI Compat Streaming Vision
NO

Novita AI

Aggregator Featured

Budget AI inference platform with broad model catalog across LLM, image, video, and audio. Very competitive per-token pricing.

41 models
OpenAI Compat Streaming Vision Free Tier
OP

OpenAI

Proprietary Featured

Official OpenAI API. Direct access to GPT, DALL-E, Whisper, and embedding models.

18 models
OpenAI Compat Streaming Embeddings Functions Vision Free Tier
OP

OpenRouter

Aggregator Featured

Unified API for 300+ LLMs from OpenAI, Anthropic, Google, Meta, and more. Routes to the best provider automatically.

58 models
OpenAI Compat Streaming Functions Vision Free Tier
RE

Replicate

Serverless Featured

Run and deploy machine learning models with a cloud API. Pay-per-use with serverless GPU infrastructure.

67 models
Streaming Finetuning Vision
SI

SiliconFlow

Serverless Featured

Fast and affordable AI inference platform. 2.3x faster speeds and 32% lower latency than major cloud platforms. Supports LLM, image, video, and audio models.

20 models
OpenAI Compat Streaming Vision Free Tier
TO

Together AI

Serverless Featured

Serverless and dedicated inference for open-source LLMs, image, video, and audio models. GPU clusters available.

9 models
OpenAI Compat Streaming Finetuning Embeddings Vision Free Tier
AI

AIMLAPI

Aggregator

Unified API for 400+ AI models across text, image, video, and audio. OpenAI-compatible with serverless inference.

25 models
OpenAI Compat Streaming Embeddings Vision Free Tier
AT

Atlas Cloud

Aggregator

Full-modal AI inference platform with 300+ models. Smart routing to cheapest servers with transparent pay-as-you-go pricing.

10 models
OpenAI Compat Streaming Vision Free Tier
CL

Cloudflare Workers AI

Serverless

Edge AI inference across 200+ cities worldwide. Serverless, pay-per-use with OpenAI-compatible API.

12 models
OpenAI Compat Streaming Embeddings Vision Free Tier
WA

WaveSpeed AI

Serverless

Fast AI inference platform specializing in image and video generation with serverless GPU infrastructure.

16 models
OpenAI Compat Streaming Vision