Fast serverless inference for open-source models with per-token pricing, fine-tuning, and on-demand deployments.
Features
OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode
Compliance
SOC 2
HIPAA
GDPR
Pricing Model
per token Free tier: $1 in free credits
Details
Models 24
API Base
https://api.fireworks.ai/inference/v1 Audio / MusicImage GenerationLLM Inference