Parasail

Serverless

Distributed AI inference network with serverless, dedicated, and batch endpoints. No rate limits, no contracts, up to 30x cheaper than legacy cloud.

OpenAI Compatible

Streaming

Batching

Fine-tuning

Embeddings

Vision

Audio

Function Calling

JSON Mode

SOC 2

HIPAA

GDPR

per token

Free tier: Free credits on signup

Models 22

API Base https://api.parasail.io/v1

LLM Inference

Model Catalog (22)

Model	Type	Input $/1M	Output $/1M	Context	Speed
DeepSeek V3 DeepSeek · 671B MoE	llm	$0.280	$0.450	—	—
DeepSeek V3.2 DeepSeek · 671B MoE (37B active)	llm	$0.280	$0.450	—	—
DeepSeek V4 Flash DeepSeek	llm	$0.140	$0.280	—	—
DeepSeek V4 Pro DeepSeek	llm	$1.74	$3.48	—	—
GLM 4.7 Zhipu AI	llm	$0.450	$2.10	—	—
GLM 5 Zhipu AI · 744B	llm	$1.00	$3.20	—	—
GLM 5.1 Z.ai · 744B MoE (40B active)	llm	$1.40	$4.40	—	—
GLM 5.2 Z.ai	llm	$1.40	$4.40	—	—
GPT OSS 120B OpenAI · 117B MoE (5.1B active)	llm	$0.100	$0.750	—	—
Gemma 3 27B Google · 27B	llm	$0.080	$0.450	—	—
Gemma 4 27B Google · 27B	llm	$0.130	$0.400	—	—
Kimi K2.5 Moonshot AI · 1T MoE (32B active)	llm	$0.600	$2.80	—	—
Kimi K2.6 Moonshot AI · 1T MoE (32B active)	llm	$0.750	$3.50	—	—
Kimi K2.7 Code Moonshot AI · 1T MoE (32B active)	code	$0.750	$3.50	262k	—
Llama 3.3 70B Meta · 70B	llm	$0.220	$0.500	—	—
Llama 4 Maverick Meta · 400B (17B active)	llm	$0.350	$1.00	—	—
MiniMax M3 MiniMax	llm	$0.300	$1.20	—	—
Mistral Small 4 Mistral AI · 119B	llm	$0.090	$0.300	—	—
Qwen 3 235B Alibaba · 235B MoE	llm	$0.140	$0.800	—	—
Qwen3 Coder Next Alibaba	code	$0.120	$0.800	—	—
Qwen3.5 397B A17B Alibaba	llm	$0.500	$3.60	—	—
gpt-oss-20b OpenAI	llm	$0.040	$0.200	—	—