Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.venice.ai/llms.txt

Use this file to discover all available pages before exploring further.

The Venice MCP Server is the official Model Context Protocol server for Venice. It exposes the full Venice API (chat, image, video, audio, music, embeddings, web augment, and characters) as 31 tools that any MCP-compatible agent can call.

GitHub: veniceai/venice-mcp-server

Published as @veniceai/mcp-server on npm. MIT licensed.

31 Tools

Every Venice modality in one config block

Any MCP Host

Claude Desktop, Cursor, ChatGPT, LM Studio, Continue, and more

Wallet Auth (Optional)

Bring an API key, or pay per call with a SIWE-signed wallet via x402

Quick start

1

Get a Venice API key

Generate one from venice.ai/settings/api. See the API key guide for step-by-step instructions.
2

Add Venice to your MCP host config

Drop this into your MCP host’s config file:
{
  "mcpServers": {
    "venice": {
      "command": "npx",
      "args": ["-y", "@veniceai/[email protected]"],
      "env": { "VENICE_API_KEY": "<your-venice-api-key>" }
    }
  }
}
Common config paths:
HostPath
Claude Desktop (macOS)~/Library/Application Support/Claude/claude_desktop_config.json
Claude Desktop (Windows)%APPDATA%\Claude\claude_desktop_config.json
Cursor~/.cursor/mcp.json
LM Studiomcp.json (from the app’s MCP settings)
3

Restart your MCP host

Your agent now has chat, image, video, music, TTS, ASR, and 25 more Venice tools available.
Most MCP hosts only pass environment variables that are explicitly listed in the env block. System-level env vars are not inherited. If you see 402 errors with an API key set, double-check that VENICE_API_KEY is inside env in your config.

What you get

31 tools spanning every Venice modality, 3 resources (venice://models, venice://styles, venice://voices), and 3 prompt templates.

Chat & embeddings

ToolDescription
venice_chatOpenAI-compatible chat completion against Venice’s full LLM catalog.
venice_responsesOpenAI-compatible Responses API with single- or multi-turn tool support.
venice_embeddingsCompute embeddings for text input.
venice_chat_with_characterChat with a Venice character by slug.

Image

ToolDescription
venice_image_generateGenerate an image (Flux 2, Lustify SDXL, Anime/WAI, Qwen Image, GPT Image, Nano Banana Pro, and more).
venice_image_editEdit an image with a prompt.
venice_image_multi_editEdit multiple images together with one prompt.
venice_image_upscaleUpscale an image up to 4×.
venice_image_remove_bgRemove an image background.
venice_image_stylesList image style presets.

Video

ToolDescription
venice_video_generateQueue a video generation (Sora 2, Veo 3.1, Kling, Wan, LTX 2, Seedance, Runway Gen-4, and more).
venice_video_statusCheck status of a queued video job.
venice_video_completeMark a completed video as downloaded; deletes server-side media.
venice_video_transcriptionsTranscribe a YouTube video URL.
venice_video_quoteGet a price quote before queuing.

Audio (TTS / ASR)

ToolDescription
venice_ttsText-to-speech with cloned voices and emotion tags.
venice_asrTranscribe audio from a URL.
venice_voice_cloneList built-in voices or clone a voice from a sample.
venice_audio_quoteGet a price quote for music generation.

Music

ToolDescription
venice_music_generateQueue music generation (ace-step-15, elevenlabs-music, minimax-music-v2/v25/v26, stable-audio-25, mmaudio-v2, elevenlabs-sound-effects-v2).
venice_music_statusCheck status of a queued music job.
venice_music_completeMark a completed music job as downloaded.

Web augment, catalog, and crypto

ToolDescription
venice_web_searchSearch the web (Firecrawl-backed).
venice_web_scrapeScrape one URL into markdown.
venice_text_parserExtract text from PDF/DOCX/EPUB/PPTX/XLSX.
venice_list_modelsList the live model catalog with prices.
venice_list_charactersList public Venice characters.
venice_crypto_rpcProxy JSON-RPC calls to Base, Ethereum, Polygon, Arbitrum, or Optimism.

x402 wallet helpers

Only relevant if you authenticate with a wallet via x402 instead of an API key.
ToolDescription
venice_x402_balanceCheck prepaid x402 credit balance for a wallet.
venice_x402_top_up_infoFetch top-up requirements (network, USDC token, receiver, min amount).
venice_x402_transactionsList recent x402 top-up and debit transactions.

Configuration

The server is configured entirely through environment variables.
Env varDefaultNotes
VENICE_API_KEY(none)Your Venice API key. The simplest setup.
VENICE_DEFAULT_CHAT_MODELvenice-uncensored
VENICE_DEFAULT_IMAGE_MODELflux-2-pro
VENICE_DEFAULT_TTS_MODELtts-kokoro
VENICE_DEFAULT_ASR_MODELopenai/whisper-large-v3
VENICE_DISABLE_NSFW0Set to 1 to remove NSFW capability notes from tool descriptions.
VENICE_HTTP_TIMEOUT_MS60000
VENICE_SIWX_TOKEN(none)x402 wallet-mode auth token. See x402 below.
If both VENICE_API_KEY and VENICE_SIWX_TOKEN are set, the API key wins.

x402 wallet mode

Venice supports authenticating with a SIWE-signed wallet token backed by prepaid USDC credit on Base mainnet, in addition to the normal API key flow. No email, phone, or KYC required: your wallet is the only identity.
{
  "mcpServers": {
    "venice": {
      "command": "npx",
      "args": ["-y", "@veniceai/[email protected]"],
      "env": { "VENICE_SIWX_TOKEN": "<base64 SIWE payload>" }
    }
  }
}
The MCP server forwards VENICE_SIWX_TOKEN as the X-Sign-In-With-X header on every Venice API call. The server never sees your private key. SIWE signing and USDC top-up authorizations happen in your own wallet.
FlowWhat happens
One-time setupSign a SIWE message in your wallet → produces a SIWX token (base64 JSON).
Top upPOST /api/v1/x402/top-up returns 402 + payment requirements. Sign a USDC EIP-3009 transfer in your wallet, resubmit, and Venice credits your balance.
Every inference callMCP server sends X-Sign-In-With-X: <SIWX>; Venice debits your prepaid balance.
Minimum top-up is $5 USD. Minimum balance to call inference is $0.10. Once topped up, calls are sub-100ms because settlement happens off-chain on a fast credit account.
Wallets linked to a Venice account with DIEM staked consume from the staking balance instead of USDC credits, so no top-up is needed.

Self-hosting (Streamable HTTP)

For team or workspace deployments, run the MCP server over HTTP instead of stdio:
docker run -p 3333:3333 \
  -e VENICE_API_KEY=<your-venice-api-key> \
  -e VENICE_MCP_AUTH_TOKEN=<choose-a-long-random-token> \
  ghcr.io/veniceai/venice-mcp-server:latest
The server is now available at http://localhost:3333/mcp. HTTP clients must send Authorization: Bearer <VENICE_MCP_AUTH_TOKEN>.
/mcp is a credential-backed tool execution endpoint: callers can spend the configured Venice API key or x402 balance. When HTTP mode binds outside loopback, startup fails unless VENICE_MCP_AUTH_TOKEN is set. For production, pin the npm package version explicitly instead of relying on latest.

Resources

GitHub

Source code, issues, and releases

npm

@veniceai/mcp-server

Venice Skills

Companion skills that teach agents how to use these tools

MCP Spec

Learn more about the Model Context Protocol