PA

Parasail

Serverless

Distributed AI inference network with serverless, dedicated, and batch endpoints. No rate limits, no contracts, up to 30x cheaper than legacy cloud.

Features

OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode

Compliance

SOC 2
HIPAA
GDPR

Pricing Model

per token
Free tier: Free credits on signup

Details

Models 13
API Base https://api.parasail.io/v1
LLM Inference

Model Catalog (13)

Model Type Input $/1M Output $/1M Context Speed Status
DeepSeek V3
DeepSeek · 671B MoE
llm $0.280 $0.450
DeepSeek V3.2
DeepSeek · 671B MoE (37B active)
llm $0.280 $0.450
GLM 4.7
Zhipu AI
llm $0.450 $2.10
GLM 5
Zhipu AI · 744B
llm $1.00 $3.20
GLM 5.1
Z.ai · 744B MoE (40B active)
llm $1.40 $4.40
GPT OSS 120B
OpenAI · 117B MoE (5.1B active)
llm $0.100 $0.750
Gemma 3 27B
Google · 27B
llm $0.080 $0.450
Gemma 4 27B
Google · 27B
llm $0.130 $0.400
Kimi K2.5
Moonshot AI
llm $0.600 $2.80
Llama 3.3 70B
Meta · 70B
llm $0.220 $0.500
Llama 4 Maverick
Meta · 400B (17B active)
llm $0.350 $1.00
Mistral Small 4
Mistral AI · 119B
llm $0.090 $0.600
Qwen 3 235B
Alibaba · 235B MoE
llm $0.100 $0.600