> ## Documentation Index > Fetch the complete documentation index at: https://docs.venice.ai/llms.txt > Use this file to discover all available pages before exploring further. # Venice API > Venice API documentation — private, unrestricted access to OpenAI-compatible chat, image, audio, and video models behind one API key.

The API for private, unrestricted access to intelligence.

OpenAI-compatible chat, image, audio, and video behind one API key.

Get an API key → Get started

```bash curl theme={"system"} curl https://api.venice.ai/api/v1/chat/completions \ -H "Authorization: Bearer $VENICE_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "zai-org-glm-5-1", "messages": [{"role": "user", "content": "Build without permission."}] }' ``` ```ts TypeScript theme={"system"} import OpenAI from "openai"; const client = new OpenAI({ apiKey: process.env.VENICE_API_KEY, baseURL: "https://api.venice.ai/api/v1", }); const res = await client.chat.completions.create({ model: "zai-org-glm-5-1", messages: [{ role: "user", content: "Build without permission." }], }); ``` ```python Python theme={"system"} import os from openai import OpenAI client = OpenAI( api_key=os.environ["VENICE_API_KEY"], base_url="https://api.venice.ai/api/v1", ) res = client.chat.completions.create( model="zai-org-glm-5-1", messages=[{"role": "user", "content": "Build without permission."}], ) ```

Endpoints

One API for every modality

Chat, image, audio, video, and embeddings behind one API key.

Chat Completions

OpenAI-compatible chat with reasoning, tool use, and streaming across 100+ text models.

Streaming Tools Vision

See reference → Image Generation

Text-to-image, image-to-image, upscaling, and background removal across photorealistic, stylized, and uncensored models.

Text-to-image Image-to-image Upscale

See reference → Audio

Text-to-speech with 50+ multilingual voices, plus speech-to-text transcription for any audio file.

TTS Transcription 50+ voices

See reference → Video

Text-to-video, image-to-video, and reference-to-video through a sync or async job queue.

Text-to-video Image-to-video Reference-to-video

See reference →

Plus embeddings, file inputs, MCP tools, and wallet payments. View all endpoints →

Agents

Built for AI agents

Private inference, MCP tools, and wallet-funded workflows for messaging, coding, and onchain agents.

Agent apps

Connect Venice to WhatsApp, Telegram, Discord, and more through OpenClaw, Hermes, and NanoClaw.

See integrations → Coding agents

Use Claude Code, Cursor, and Codex CLI with Venice models for private coding workflows.

See integrations → MCP + Skills

Expose chat, image, video, audio, and embeddings as MCP tools or runtime skills.

See integrations →

Explore the AI Agents hub →

Models

Popular models

A few of the most-used models on Venice. Use the ID as your `model` parameter.

Kimi K2.6 Moonshot AI

Open-weights frontier reasoning. Strong long-context and tool use at a fraction of frontier prices.

256K context \$0.85 / \$4.66 per 1M Private

kimi-k2-6

Claude Opus 4.7 Anthropic

Best-in-class for coding, planning, and long-horizon agents that need to stay coherent.

1M context \$6.00 / \$30.00 per 1M Anonymized

claude-opus-4-7

GPT-5.5 OpenAI

Frontier general intelligence with 1M context. Strong default for chat, RAG, and multi-step reasoning.

1M context \$6.25 / \$37.50 per 1M Anonymized

openai-gpt-55

250+ models Text, image, audio, and video Browse the catalog →

Tools

Built‑in tools for chat models

Turn on web search, attach files, or query a blockchain with `venice_parameters` or a Venice-native endpoint.

Add real-time web search with citations to any text model via `enable_web_search`. ```bash Curl theme={"system"} curl https://api.venice.ai/api/v1/chat/completions \ -H "Authorization: Bearer $VENICE_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "zai-org-glm-5-1", "messages": [{"role": "user", "content": "What are the latest developments in AI?"}], "venice_parameters": { "enable_web_search": "auto" } }' ``` ```ts TypeScript theme={"system"} import OpenAI from "openai"; const client = new OpenAI({ apiKey: process.env.VENICE_API_KEY!, baseURL: "https://api.venice.ai/api/v1", }); const completion = await client.chat.completions.create({ model: "zai-org-glm-5-1", messages: [{ role: "user", content: "What are the latest developments in AI?" }], // @ts-expect-error - Venice-specific parameter venice_parameters: { enable_web_search: "auto", }, }); console.log(completion.choices[0].message.content); ``` ```python Python theme={"system"} import os from openai import OpenAI client = OpenAI( api_key=os.environ["VENICE_API_KEY"], base_url="https://api.venice.ai/api/v1", ) response = client.chat.completions.create( model="zai-org-glm-5-1", messages=[{"role": "user", "content": "What are the latest developments in AI?"}], extra_body={ "venice_parameters": { "enable_web_search": "auto", } }, ) print(response.choices[0].message.content) ``` ```bash Model Suffix theme={"system"} # Alternative: append parameters directly to the model ID curl https://api.venice.ai/api/v1/chat/completions \ -H "Authorization: Bearer $VENICE_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "zai-org-glm-5-1:enable_web_search=on&enable_web_citations=true", "messages": [{"role": "user", "content": "What are the latest developments in AI?"}] }' ``` Set `enable_web_scraping: true` and the model will fetch and read any URLs in the user message before answering. ```bash Curl theme={"system"} curl https://api.venice.ai/api/v1/chat/completions \ -H "Authorization: Bearer $VENICE_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "openai-gpt-55", "messages": [ {"role": "user", "content": "Summarize this post in five bullets: https://venice.ai/blog/how-to-use-venice-api"} ], "venice_parameters": { "enable_web_scraping": true } }' ``` ```ts TypeScript theme={"system"} import OpenAI from "openai"; const client = new OpenAI({ apiKey: process.env.VENICE_API_KEY!, baseURL: "https://api.venice.ai/api/v1", }); const response = await client.chat.completions.create({ model: "openai-gpt-55", messages: [ { role: "user", content: "Summarize this post in five bullets: https://venice.ai/blog/how-to-use-venice-api", }, ], // @ts-expect-error - Venice-specific parameter venice_parameters: { enable_web_scraping: true, }, }); console.log(response.choices[0].message.content); ``` ```python Python theme={"system"} import os from openai import OpenAI client = OpenAI( api_key=os.environ["VENICE_API_KEY"], base_url="https://api.venice.ai/api/v1", ) response = client.chat.completions.create( model="openai-gpt-55", messages=[ { "role": "user", "content": "Summarize this post in five bullets: https://venice.ai/blog/how-to-use-venice-api", } ], extra_body={ "venice_parameters": { "enable_web_scraping": True, } }, ) print(response.choices[0].message.content) ``` Attach PDFs, Office docs, code, and text files (up to 25MB) directly to a chat request. See the [File Inputs guide](/guides/features/file-inputs) for the full format list. ```bash Curl theme={"system"} # Encode a local file as a base64 data URL, then send it inline FILE_B64=$(base64 q3-report.pdf | tr -d '\n') curl https://api.venice.ai/api/v1/chat/completions \ -H "Authorization: Bearer $VENICE_API_KEY" \ -H "Content-Type: application/json" \ -d "{ \"model\": \"openai-gpt-55\", \"messages\": [ { \"role\": \"user\", \"content\": [ {\"type\": \"text\", \"text\": \"Summarize this report in five bullets and list the main risks.\"}, {\"type\": \"file\", \"file\": {\"filename\": \"q3-report.pdf\", \"file_data\": \"data:application/pdf;base64,${FILE_B64}\"}} ] } ] }" ``` ```ts TypeScript theme={"system"} import OpenAI from "openai"; import { readFile } from "node:fs/promises"; const client = new OpenAI({ apiKey: process.env.VENICE_API_KEY!, baseURL: "https://api.venice.ai/api/v1", }); const pdf = await readFile("q3-report.pdf"); const fileData = `data:application/pdf;base64,${pdf.toString("base64")}`; const response = await client.chat.completions.create({ model: "openai-gpt-55", messages: [ { role: "user", content: [ { type: "text", text: "Summarize this report in five bullets and list the main risks." }, // @ts-expect-error - Venice file input block { type: "file", file: { filename: "q3-report.pdf", file_data: fileData } }, ], }, ], }); console.log(response.choices[0].message.content); ``` ```python Python theme={"system"} import base64 import os from pathlib import Path from openai import OpenAI client = OpenAI( api_key=os.environ["VENICE_API_KEY"], base_url="https://api.venice.ai/api/v1", ) path = Path("q3-report.pdf") file_data = "data:application/pdf;base64," + base64.b64encode(path.read_bytes()).decode("utf-8") response = client.chat.completions.create( model="openai-gpt-55", messages=[ { "role": "user", "content": [ {"type": "text", "text": "Summarize this report in five bullets and list the main risks."}, {"type": "file", "file": {"filename": "q3-report.pdf", "file_data": file_data}}, ], } ], ) print(response.choices[0].message.content) ``` Proxy JSON-RPC 2.0 calls across 11 supported chains with your Venice key or an x402 wallet. See the [Crypto RPC reference](/api-reference/endpoint/crypto/rpc) for chains, methods, and credit tiers. ```bash Curl theme={"system"} curl https://api.venice.ai/api/v1/crypto/rpc/ethereum-mainnet \ -H "Authorization: Bearer $VENICE_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "method": "eth_blockNumber", "params": [], "id": 1 }' ``` ```ts TypeScript theme={"system"} const response = await fetch( "https://api.venice.ai/api/v1/crypto/rpc/base-mainnet", { method: "POST", headers: { Authorization: `Bearer ${process.env.VENICE_API_KEY}`, "Content-Type": "application/json", }, body: JSON.stringify([ { jsonrpc: "2.0", method: "eth_chainId", params: [], id: 1 }, { jsonrpc: "2.0", method: "eth_blockNumber", params: [], id: 2 }, ]), } ); const results = await response.json(); console.log(results); ``` ```python Python theme={"system"} import os import requests response = requests.post( "https://api.venice.ai/api/v1/crypto/rpc/ethereum-mainnet", headers={ "Authorization": f"Bearer {os.environ['VENICE_API_KEY']}", "Content-Type": "application/json", }, json={ "jsonrpc": "2.0", "method": "eth_getBalance", "params": ["0xd8dA6BF26964aF9D7eEd9e03E53415D37aA96045", "latest"], "id": 1, }, ) print(response.json()) ```

Pricing

Top up, stake, or pay per request

Fund an account with credits, stake DIEM for a daily allowance, or skip the account entirely with USDC on Base.

Credits USD or Crypto

Pay as you go in USD or crypto. Credits never expire and work across every endpoint.

Buy Credits

DIEM Daily allowance

Stake DIEM or VVV once and earn a fixed inference allowance every day, with no per-call charges.

Learn about DIEM

x402 USDC on Base

Pay per request from any Base wallet in USDC. No account or API key, built for agents.

Read x402 Guide

Questions or feedback? Join us on [Discord](https://discord.gg/askvenice).