Privacy-focused AI inference with no logging. Supports open-source and proprietary models including Claude, GPT, Grok, and open-weight LLMs.
Features
OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode
Compliance
SOC 2
HIPAA
GDPR
Pricing Model
per tokenDetails
Models 15
API Base
https://api.venice.ai/api/v1 LLM Inference
Model Catalog (15)
| Model | Type | Input $/1M | Output $/1M | Context | Speed | Status |
|---|---|---|---|---|---|---|
| Claude 4.6 Opus Anthropic | llm | $6.00 | $30.00 | — | — | |
| Claude 4.6 Sonnet Anthropic | llm | $3.60 | $18.00 | — | — | |
| DeepSeek V3.2 DeepSeek · 671B MoE (37B active) | llm | $0.330 | $0.480 | — | — | |
| GLM 4.6 Zhipu AI · 355B | llm | $0.850 | $2.75 | — | — | |
| GLM 4.7 Zhipu AI | llm | $0.550 | $2.65 | — | — | |
| GLM 5 Zhipu AI · 744B | llm | $1.00 | $3.20 | — | — | |
| GPT OSS 120B OpenAI · 117B MoE (5.1B active) | llm | $0.070 | $0.300 | — | — | |
| GPT-4o OpenAI | llm | $3.13 | $12.50 | — | — | |
| GPT-4o Mini OpenAI | llm | $0.190 | $0.750 | — | — | |
| Gemma 3 27B Google · 27B | llm | $0.120 | $0.200 | — | — | |
| Grok 3 Mini xAI | llm | $0.250 | $0.630 | — | — | |
| Kimi K2.5 Moonshot AI | llm | $0.560 | $3.50 | — | — | |
| Llama 3.3 70B Meta · 70B | llm | $0.700 | $2.80 | — | — | |
| Mistral Small 4 Mistral AI · 119B | llm | $0.090 | $0.250 | — | — | |
| Qwen 3 235B Alibaba · 235B MoE | llm | $0.150 | $0.750 | — | — |