API Key Rate Limits
Rate Limits and Balances
Venice APIs
- API Spec
- Rate Limits
- Error Codes
- Chat
- Images
- Audio
- Models
- API Keys
- API Key Rate Limits
- Characters
API Key Rate Limits
Rate Limits and Balances
Return details about user balances and rate limits.
GET
/
api_keys
/
rate_limits
curl --request GET \
--url https://api.venice.ai/api/v1/api_keys/rate_limits \
--header 'Authorization: Bearer <token>'
{
"data": {
"apiTier": {
"id": "paid",
"isCharged": true
},
"accessPermitted": true,
"keyExpiration": "2025-06-01T00:00:00.000Z",
"balances": {
"VCU": 100.023,
"USD": 50.23
},
"rateLimits": [
{
"apiModelId": "llama-3.3-70b",
"rateLimits": [
{
"amount": 100,
"type": "RPM"
}
]
}
]
}
}
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Response
200
application/json
OK
Does the API key have access to consume the inference APIs?
Example:
true
The timestamp the API key expires. If null, the key never expires.
Example:
"2025-06-01T00:00:00.000Z"
The ID of the API model.
Example:
"llama-3.3-70b"
curl --request GET \
--url https://api.venice.ai/api/v1/api_keys/rate_limits \
--header 'Authorization: Bearer <token>'
{
"data": {
"apiTier": {
"id": "paid",
"isCharged": true
},
"accessPermitted": true,
"keyExpiration": "2025-06-01T00:00:00.000Z",
"balances": {
"VCU": 100.023,
"USD": 50.23
},
"rateLimits": [
{
"apiModelId": "llama-3.3-70b",
"rateLimits": [
{
"amount": 100,
"type": "RPM"
}
]
}
]
}
}