VE

Venice AI

Serverless

Privacy-focused AI inference with no logging. Supports open-source and proprietary models including Claude, GPT, Grok, and open-weight LLMs.

Features

OpenAI Compatible
Streaming
Batching
Fine-tuning
Embeddings
Vision
Audio
Function Calling
JSON Mode

Compliance

SOC 2
HIPAA
GDPR

Pricing Model

per token

Details

Models 15
API Base https://api.venice.ai/api/v1
LLM Inference

Model Catalog (15)

Model Type Input $/1M Output $/1M Context Speed Status
Claude 4.6 Opus
Anthropic
llm $6.00 $30.00
Claude 4.6 Sonnet
Anthropic
llm $3.60 $18.00
DeepSeek V3.2
DeepSeek · 671B MoE (37B active)
llm $0.330 $0.480
GLM 4.6
Zhipu AI · 355B
llm $0.850 $2.75
GLM 4.7
Zhipu AI
llm $0.550 $2.65
GLM 5
Zhipu AI · 744B
llm $1.00 $3.20
GPT OSS 120B
OpenAI · 117B MoE (5.1B active)
llm $0.070 $0.300
GPT-4o
OpenAI
llm $3.13 $12.50
GPT-4o Mini
OpenAI
llm $0.190 $0.750
Gemma 3 27B
Google · 27B
llm $0.120 $0.200
Grok 3 Mini
xAI
llm $0.250 $0.630
Kimi K2.5
Moonshot AI
llm $0.560 $3.50
Llama 3.3 70B
Meta · 70B
llm $0.700 $2.80
Mistral Small 4
Mistral AI · 119B
llm $0.090 $0.250
Qwen 3 235B
Alibaba · 235B MoE
llm $0.150 $0.750