Setup
Chat Models
UseChatOpenAI with Venice’s base URL:
Streaming
Embeddings
Chains
Simple Chain with Prompt Template
Sequential Chain
RAG Pipeline
Build a retrieval-augmented generation pipeline with Venice:Function Calling with Agents
Structured Output
Web Search Integration
Use Venice’s built-in web search viavenice_parameters:
Recommended Models for LangChain
| Use Case | Model | Why |
|---|---|---|
| General chains | venice-uncensored | Fast, cheap, uncensored |
| Complex reasoning | zai-org-glm-4.7 | Best private flagship model |
| Function calling | zai-org-glm-4.7 | Reliable tool use |
| Vision + text | qwen3-vl-235b-a22b | Advanced vision understanding |
| Code generation | qwen3-coder-480b-a35b-instruct | Optimized for code |
| Embeddings (RAG) | text-embedding-bge-m3 | Private embeddings |
| Budget / high-volume | qwen3-4b | $0.05/1M input |
View All Models
Browse all Venice models with pricing and capabilities