Fast serverless and dedicated AI inference. Korean provider with competitive pricing on open-source models and prompt caching support.
Features
OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode
Compliance
SOC 2
HIPAA
GDPR
Pricing Model
per token Free tier: Up to 50K inference credit for new users
Details
Models 7
API Base
https://api.friendli.ai/v1 Audio / MusicLLM Inference
Model Catalog (7)
| Model | Type | Input $/1M | Output $/1M | Context | Speed | Status |
|---|---|---|---|---|---|---|
| DeepSeek V3.2 DeepSeek · 671B MoE (37B active) | llm | $0.500 | $1.50 | — | — | |
| GLM 4.7 Zhipu AI | llm | $0.600 | $2.20 | — | — | |
| GLM 5 Zhipu AI · 744B | llm | $1.00 | $3.20 | — | — | |
| GLM 5.1 Z.ai · 744B MoE (40B active) | llm | $1.40 | $4.40 | — | — | |
| Kimi K2.5 Moonshot AI | llm | $0.600 | $3.00 | — | — | |
| Llama 3.3 70B Meta · 70B | llm | $0.600 | $0.600 | — | — | |
| Qwen 3 235B Alibaba · 235B MoE | llm | $0.200 | $0.800 | — | — |