Pro Users

Pro subscribers automatically receive a one-time $10 API credit upon upgrading to Pro – double the credit amount compared to competitors. This credit provides capacity for testing and small applications, with seamless pathways to scale via VVV staking or direct USD payments for larger implementations.

Paid access to the Venice API can be obtained in two ways:

1

Purchased API Credits

Users can purchase API credits via the API Dashboard.

2

Stake VVV

Users can stake VVV which in return, provides you proportional access to Venice’s compute pool in “Venice Compute Units” or VCUs. The more you stake, the higher your VCU allocation, and they renew daily. You also earn staking rewards while staked. Visit the Token Dashboard to stake VVV and to see how much VCU you control.

Model Pricing

Chat Models

Chat models are priced per million tokens, with separate pricing for input and output tokens. While the price is per million tokens, you will only be charged for the tokens you use. You can estimate the token count of a chat request using this calculator.

ModelInput Tokens (per M.)Input Tokens (per M.)Output Tokens (per M.)Output Tokens (per M.)
Venice Small (Qwen 3 4B)
Llama 3.2 3B
BGE 3 Embeddings
1.5 VCU$0.156 VCU$0.60
Venice Medium (Mistral Small 3.1 24B)
Venice Uncensored
Qwen 2.5 Coder 32B
Qwen 2.5 QWQ 32B
5 VCU$0.5020 VCU$2.00
Llama 3.3 70B
Dolphin 72B
Qwen 2.5 VL 72B
7 VCU$0.7028 VCU$2.80
Venice Large (Qwen 3 235B)
Llama 3.1 405B
15 VCU$1.5060 VCU$6.00
DeepSeek R1 671B
35 VCU$3.50140 VCU$14.00

Image Models

Venice Image models are currently priced at the following rates:

ModelVCU PricingUSD Pricing
Generation0.1 VCU$0.01 USD
Upscale / Enhance (2x)0.2 VCU$0.02 USD
Upscale / Enhance (4x)0.8 VCU$0.08 USD

Audio Models

All Venice Audio models are currently priced at the following rates:

ModelInput Characters (per M.)Input Characters (per M.)
All35 VCU$3.50 USD