文本解析器
Extracts text from a document file. Supports PDF, DOCX, PPTX, XLSX, and plain text formats. Upload a file via multipart/form-data.
Privacy: Text parsing runs entirely in-memory on Venice’s infrastructure with zero data retention. Documents are processed and immediately discarded — no content is stored or logged.
Authentication: This endpoint accepts either a Bearer API key or a SIGN-IN-WITH-X header for x402 wallet-based authentication. The legacy X-Sign-In-With-X header is also accepted during migration. When using x402, a 402 Payment Required response indicates insufficient balance and includes top-up instructions.
file 字段上传文档文件。支持的格式包括 PDF、DOCX、XLSX 和纯文本文件(最大 25MB)。
将 response_format 设置为 json(默认)可获取包含已提取文本和 token 数量的结构化输出,设置为 text 则返回原始提取的文本。
隐私: 文本解析完全在 Venice 基础设施的内存中运行,零数据保留。您的文档在处理后立即被丢弃 —— 不存储或记录任何内容。
定价: 每次请求 $0.01。
示例(cURL)
授权
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
请求体
The document file to parse. Supported formats: PDF, DOCX, PPTX, XLSX, and plain text files. Maximum size: 25MB.
The format of the response output. "json" returns structured JSON with text and token count, "text" returns only the extracted text.
json, text