Skip to main content

Pro Users

Pro subscribers receive a one-time $10 API credit when upgrading to Pro. Use it to test and build small apps. You can scale your usage by buying credits, buying Diem, or staking VVV. Choose how you pay for API usage:
1

Buy API Credits

Pay in USD via the API Dashboard. Credits are applied to usage automatically.
2

Buy Diem (1 Diem = $1/day)

Purchase Diem directly. Each Diem grants $1 of compute per day at the same rates as USD.
3

Stake to Earn Diem (1 Diem = $1/day)

Stake tokens to receive daily Diem allocations (each Diem grants $1 of compute per day). Manage staking and Diem at the Token Dashboard.

Model Pricing

All prices are in USD. Diem users pay the same rates (1 Diem = $1 of compute per day).

Chat Models

Prices per 1M tokens, with separate pricing for input and output tokens. You will only be charged for the tokens you use. You can estimate the token count of a chat request using this calculator.
ModelModel IDInputOutputCapabilities
Venice Smallqwen3-4b$0.05$0.15Function Calling, Reasoning
Llama 3.2 3Bllama-3.2-3b$0.15$0.60Function Calling
Venice Uncensoredvenice-uncensored$0.20$0.90Uncensored
Venice Medium (3.1)mistral-31-24b$0.50$2.00Function Calling, Vision
Llama 3.3 70Bllama-3.3-70b$0.70$2.80Function Calling
Venice Largeqwen3-235b$0.90$4.50Function Calling, Reasoning

Beta Chat Models

ModelModel IDInputOutputCapabilities
Qwen 3 Next 80B (beta)qwen3-next-80b$0.35$1.90Function Calling
Qwen 3 Coder 480B (beta)qwen3-coder-480b-a35b-instruct$0.75$3.00Function Calling
Hermes 3 Llama 3.1 405B (beta)hermes-3-llama-3.1-405b$1.10$3.00High Intelligence
GLM 4.6 (beta)zai-org-glm-4.6$0.85$2.75Function Calling
Beta models are experimental and not recommended for production use. These models may be changed, removed, or replaced at any time without notice. Learn more about beta models

Web Search and Scraping

Web Search and Web Scraping features run on dedicated compute infrastructure designed for large-scale crawling and real-time content extraction. These features are usage-based and charged per API call when enabled:
FeatureVenice ModelsOther ModelsParameters
Web Search$10 / 1K calls$25 / 1K callsenable_web_search: true
Web Scraping$10 / 1K calls$25 / 1K callsenable_web_scraping: true
Venice Models: venice-uncensored, qwen3-4b, mistral-31-24b, qwen3-235b
Web Scraping automatically detects up to 3 URLs per message, scrapes and converts content into structured markdown, and adds the extracted text into model context. These charges apply in addition to standard model token pricing.

Embedding Models

Prices per 1M tokens:
ModelModel IDInputOutput
BGE-M3text-embedding-bge-m3$0.15$0.60

Image Models

Image models are priced per generation:
ModelPrice
Generation$0.01
Upscale / Enhance (2x)$0.02
Upscale / Enhance (4x)$0.08
Edit (aka Inpaint)$0.04

Audio Models

Prices per 1M characters:
ModelModel IDPrice
Kokoro TTStts-kokoro$3.50