Venice API Docs: /models

GET

models

curl --request GET \
  --url https://api.venice.ai/api/v1/models

{
  "data": [
    {
      "created": 1727966436,
      "id": "llama-3.2-3b",
      "model_spec": {
        "availableContextTokens": 131072,
        "capabilities": {
          "optimizedForCode": false,
          "quantization": "fp16",
          "supportsFunctionCalling": true,
          "supportsReasoning": false,
          "supportsResponseSchema": true,
          "supportsVision": false,
          "supportsWebSearch": true,
          "supportsLogProbs": true
        },
        "constraints": {
          "temperature": {
            "default": 0.8
          },
          "top_p": {
            "default": 0.9
          }
        },
        "modelSource": "https://huggingface.co/meta-llama/Llama-3.2-3B",
        "offline": false,
        "pricing": {
          "input": {
            "usd": 0.15,
            "vcu": 1.5
          },
          "output": {
            "usd": 0.6,
            "vcu": 6
          }
        },
        "traits": [
          "fastest"
        ]
      },
      "object": "model",
      "owned_by": "venice.ai",
      "type": "text"
    }
  ],
  "object": "list",
  "type": "text"
}

Postman Collection

For additional examples, please see this Postman Collection.

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Query Parameters

type

Filter models by type. Use "all" to get all model types.

Available options:

embedding,

image,

text,

tts,

upscale

Example:

"text"

Response

200

application/json

The response is of type object.

Embeddings Compatibility Mapping

curl --request GET \
  --url https://api.venice.ai/api/v1/models

{
  "data": [
    {
      "created": 1727966436,
      "id": "llama-3.2-3b",
      "model_spec": {
        "availableContextTokens": 131072,
        "capabilities": {
          "optimizedForCode": false,
          "quantization": "fp16",
          "supportsFunctionCalling": true,
          "supportsReasoning": false,
          "supportsResponseSchema": true,
          "supportsVision": false,
          "supportsWebSearch": true,
          "supportsLogProbs": true
        },
        "constraints": {
          "temperature": {
            "default": 0.8
          },
          "top_p": {
            "default": 0.9
          }
        },
        "modelSource": "https://huggingface.co/meta-llama/Llama-3.2-3B",
        "offline": false,
        "pricing": {
          "input": {
            "usd": 0.15,
            "vcu": 1.5
          },
          "output": {
            "usd": 0.6,
            "vcu": 6
          }
        },
        "traits": [
          "fastest"
        ]
      },
      "object": "model",
      "owned_by": "venice.ai",
      "type": "text"
    }
  ],
  "object": "list",
  "type": "text"
}

Venice APIs

List Models

Postman Collection

Authorizations

Query Parameters

Response

Venice APIs

​Postman Collection

Authorizations

Query Parameters

Response

Postman Collection