Extracts text from a document file. Supports PDF, DOCX, PPTX, XLSX, and plain text formats. Upload a file via multipart/form-data.
Privacy: Text parsing runs entirely in-memory on Venice’s infrastructure with zero data retention. Documents are processed and immediately discarded — no content is stored or logged.
Authentication: This endpoint accepts either a Bearer API key or an X-Sign-In-With-X header for x402 wallet-based authentication. When using x402, a 402 Payment Required response indicates insufficient balance and includes top-up instructions.
Upload a document file via multipart/form-data using theDocumentation Index
Fetch the complete documentation index at: https://docs.venice.ai/llms.txt
Use this file to discover all available pages before exploring further.
file field. Supported formats include PDF, DOCX, XLSX, and plain text files (up to 25MB).
Set response_format to json (default) for structured output with extracted text and token count, or text for the raw extracted text.
Privacy: Text parsing runs entirely in-memory on Venice’s infrastructure with zero data retention. Your documents are processed and immediately discarded — no content is stored or logged.
Pricing: $0.01 per request.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
The document file to parse. Supported formats: PDF, DOCX, PPTX, XLSX, and plain text files. Maximum size: 25MB.
The format of the response output. "json" returns structured JSON with text and token count, "text" returns only the extracted text.
json, text