Distributed AI inference network with serverless, dedicated, and batch endpoints. No rate limits, no contracts, up to 30x cheaper than legacy cloud.
Features
OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode
Compliance
SOC 2
HIPAA
GDPR
Pricing Model
per token Free tier: Free credits on signup
Details
Models 13
API Base
https://api.parasail.io/v1 LLM Inference
Model Catalog (13)
| Model | Type | Input $/1M | Output $/1M | Context | Speed | Status |
|---|---|---|---|---|---|---|
| DeepSeek V3 DeepSeek · 671B MoE | llm | $0.280 | $0.450 | — | — | |
| DeepSeek V3.2 DeepSeek · 671B MoE (37B active) | llm | $0.280 | $0.450 | — | — | |
| GLM 4.7 Zhipu AI | llm | $0.450 | $2.10 | — | — | |
| GLM 5 Zhipu AI · 744B | llm | $1.00 | $3.20 | — | — | |
| GLM 5.1 Z.ai · 744B MoE (40B active) | llm | $1.40 | $4.40 | — | — | |
| GPT OSS 120B OpenAI · 117B MoE (5.1B active) | llm | $0.100 | $0.750 | — | — | |
| Gemma 3 27B Google · 27B | llm | $0.080 | $0.450 | — | — | |
| Gemma 4 27B Google · 27B | llm | $0.130 | $0.400 | — | — | |
| Kimi K2.5 Moonshot AI | llm | $0.600 | $2.80 | — | — | |
| Llama 3.3 70B Meta · 70B | llm | $0.220 | $0.500 | — | — | |
| Llama 4 Maverick Meta · 400B (17B active) | llm | $0.350 | $1.00 | — | — | |
| Mistral Small 4 Mistral AI · 119B | llm | $0.090 | $0.600 | — | — | |
| Qwen 3 235B Alibaba · 235B MoE | llm | $0.100 | $0.600 | — | — |