Text Models

Model NameModel IDPrice (in/out)Context LimitCapabilitiesTraits
Venice Uncensored 1.1venice-uncensored$0.50 / $2.0032,768
Venice Reasoningqwen-2.5-qwq-32b$0.50 / $2.0032,768Reasoning
Venice Smallqwen3-4b$0.15 / $0.6040,960Function Calling, Reasoning
Venice Medium (3.2 beta)mistral-32-24b$0.50 / $2.00131,072Function Calling, Vision
Venice Medium (3.1)mistral-31-24b$0.50 / $2.00131,072Function Calling, Visiondefault_vision
Venice Large 1.1qwen3-235b$1.50 / $6.00131,072Function Calling, Reasoning
Llama 3.2 3Bllama-3.2-3b$0.15 / $0.60131,072Function Callingfastest
Llama 3.3 70Bllama-3.3-70b$0.70 / $2.8065,536Function Callingdefault
Llama 3.1 405B (D)llama-3.1-405b$1.50 / $6.0065,536most_intelligent
Dolphin 72B (D)dolphin-2.9.2-qwen2-72b$0.70 / $2.8032,768most_uncensored
Qwen 2.5 VL 72B (D)qwen-2.5-vl$0.70 / $2.8032,768Vision
Qwen 2.5 Coder 32B (D)qwen-2.5-coder-32b$0.50 / $2.0032,768default_code
DeepSeek R1 671B (D)deepseek-r1-671b$3.50 / $14.00131,072Reasoningdefault_reasoning
DeepSeek Coder V2 Litedeepseek-coder-v2-lite$0.50 / $2.00131,072
Pricing is per 1M tokens (input / output). Models with reasoning capabilities support advanced reasoning via thinking mode. qwen3-235b Venice Large 1.1 - Most powerful flagship model
mistral-31-24b Venice Medium (3.1) - Vision + function calling
qwen3-4b Venice Small - Fast, affordable for most tasks
llama-3.3-70b Llama 3.3 70B - Balanced high-performance model

Text Model Categories

Reasoning Models qwen3-235b Venice Large 1.1 - Advanced reasoning capabilities
qwen3-4b Venice Small - Efficient reasoning model
Vision-Capable Models mistral-31-24b Venice Medium (3.1) - Vision-capable model Cost-Optimized Models qwen3-4b Venice Small - Best balance of speed and cost
llama-3.2-3b Llama 3.2 3B - Fastest for simple tasks
Uncensored Models venice-uncensored Venice Uncensored 1.1 - No content filtering High-Intelligence Models llama-3.3-70b Llama 3.3 70B - Balanced high-intelligence
qwen3-235b Venice Large 1.1 - Most powerful flagship model

Image Models

Model NameModel IDPriceModel SourceTraits
Venice SD35venice-sd35$0.01Stable Diffusion 3.5 Largedefault, eliza-default
HiDreamhidream$0.01HiDream I1 Dev
Qwen Imageqwen-image$0.01Qwen Image
FLUX Standard (D)flux-dev$0.01FLUX.1 Devhighest_quality
FLUX Custom (D)flux-dev-uncensored$0.01FLUX.1 Dev
Lustify SDXLlustify-sdxl$0.01Lustify SDXL
Pony Realism (D)pony-realism$0.01Pony Realismmost_uncensored
Stable Diffusion 3.5 (D)stable-diffusion-3.5$0.01Stable Diffusion 3.5 Large
Anime (WAI)wai-Illustrious$0.01WAI-Illustrious
qwen-image Qwen Image - Highest quality image generation
venice-sd35 Venice SD35 - Default choice with Eliza integration
lustify-sdxl Lustify SDXL - Uncensored image generation
hidream HiDream - Production-ready generation

Image Model Categories

High-Quality Models qwen-image Qwen Image - Highest quality output
hidream HiDream - Production-ready generation
Default Models venice-sd35 Venice SD35 - Default choice, Eliza-optimized Uncensored Models lustify-sdxl Lustify SDXL - Adult content generation
wai-Illustrious Anime (WAI) - Best for anime/wai NSFW capable

Audio Models

Text-to-Speech Models

tts-kokoro Kokoro TTS - 60+ multilingual voices for natural speech
Model NameModel IDPriceVoices AvailableModel Source
Kokoro Text to Speechtts-kokoro$3.50 per 1M chars60+ voicesKokoro-82M
The tts-kokoro model supports a wide range of multilingual and stylistic voices (including af_nova, am_liam, bf_emma, zf_xiaobei, and jm_kumo). Voice is selected using the voice parameter in the request payload.

Embedding Models

text-embedding-bge-m3 BGE-M3 - Versatile embedding model for text similarity
Model NameModel IDPriceModel Source
BGE-M3text-embedding-bge-m3$0.15 / $0.60 per 1K tokensKimChen/bge-m3-GGUF

Image Processing Models

upscaler Image Upscaler - Enhance image resolution up to 4x
flux-kontext-dev Flux Kontext DEV - Multimodal image editing model

Image Upscaler

Model NameModel IDPriceUpscale Options
Upscalerupscaler$0.012x ($0.02), 4x ($0.08)

Image Editing (Inpaint)

Model NameModel IDPriceModel SourceTraits
Flux Kontext DEVflux-kontext-dev$0.04Flux Kontextspecialized_editing

Model Features

  • Vision: Ability to process and understand images
  • Reasoning: Advanced logical reasoning capabilities
  • Function Calling: Support for calling external functions and tools
  • Traits: Special characteristics or optimizations (e.g., fastest, most_intelligent, most_uncensored)

Usage Notes

  • Input pricing refers to tokens sent to the model
  • Output pricing refers to tokens generated by the model
  • Context limits define the maximum number of tokens the model can process in a single request
  • (D) Scheduled for deprecation. For timelines and migration guidance, see the Deprecation Tracker.