DE
DeepInfra
Serverless Featured
Serverless inference for open-source LLMs and generative models. Pay-per-token with fast cold starts.
Features
OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode
Compliance
SOC 2
HIPAA
GDPR
Pricing Model
per token Free tier: Available
Details
Models 19
API Base
https://api.deepinfra.com/v1/openai Audio / MusicImage GenerationLLM Inference