These rate limits apply to users who have purchased USD credits or staked VVV to gain VCU.

We will continue to monitor usage. As we add compute capacity to the network, we will review these limits. If you are consistently hitting rate limits, please contact support@venice.ai or post in the #API channel in Discord for assistance and we can work with you to raise your limits.

ModelModel IDReq / MinReq / DayTokens / Min
Llama 3.2 3Bllama-3.2-3b500288,0001,000,000
Deepseek Coder V2deepseek-coder-v2-lite7554,000750,000
Qwen 2.5 Coder 32Bqwen-2.5-coder-32b7554,000750,000
Qwen 2.5 QWQ 32Bqwen-2.5-qwq-32b7554,000750,000
Dolphin 72Bdolphin-2.9.2-qwen2-72b5036,000750,000
Llama 3.3 70Bllama-3.3-70b5036,000750,000
Mistral Small 3.1 24Bmistral-31-24b5036,000750,000
Qwen 2.5 VL 72Bqwen-2.5-vl5036,000750,000
Llama 4 Maverick 17B (402B Total Params)llama-4-maverick-17b5036,000750,000
Llama 3.1 405Bllama-3.1-405b2015,000750,000
Deepseek R1 671Bdeepseek-r1-671b1510,000200,000

ModelModel IDReq / MinReq / Day
Fluxflux-dev / flux-dev-uncensored2014,400
All othersAll2028,800

ModelModel IDReq / MinReq / Day
All Audio ModelsAll6086,400

Rate Limit and Consumption Headers

You can monitor your API utilization and remaining requests by evaluating the following headers:

HeaderDescription
x-ratelimit-limit-requests
The number of requests you’ve made in the current evaluation period.
x-ratelimit-remaining-requests
The remaining requests you can make in the current evaluation period.
x-ratelimit-reset-requests
The unix time stamp when the rate limit will reset.
x-ratelimit-limit-tokens
The number of total (prompt + completion) tokens used within a 1 minute sliding window.
x-ratelimit-remaining-tokens
The remaining number of total tokens that can be used during the evaluation period.
x-ratelimit-reset-tokens
The duration of time in seconds until the token rate limit resets.
x-venice-balance-vcu
The user’s VCU balance before the request has been processed.
x-venice-balance-usd
The user’s USD balance before the request has been processed.