Chat Completions
Run text inference based on the supplied parameters.
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
ID of the model to use, or the model trait to select the model from.
A list of messages comprising the conversation so far.
What sampling temperature to use. Higher values make output more random, lower values more focused.
0 < x < 2
An alternative to sampling with temperature, called nucleus sampling.
0 < x < 1
Whether to stream back partial progress as server-sent events.
Maximum number of tokens to generate.
An upper bound for the number of tokens that can be generated for a completion.
A list of tools the model may call.
Response
Unique identifier for the chat completion
The object type
chat.completion
Unix timestamp of when the completion was created
The model used for completion