Path parameters
- task_type
string Required The type of inference task that the model performs.
Values are
sparse_embedding
,text_embedding
,rerank
,completion
, orchat_completion
. - inference_id
string Required The unique identifier for the inference endpoint.
Query parameters
- timeout
string The amount of time to wait for the inference request to complete.
Values are
-1
or0
.
Body
- query
string The query input, which is required only for the
rerank
task. It is not required for other tasks. input
string | array[string] Required The text on which you want to perform the inference task. It can be a single string or an array.
Inference endpoints for the
completion
task type currently only support a single string as input.- task_settings
object
POST /_inference/{task_type}/{inference_id}
curl \
--request POST 'http://api.example.com/_inference/{task_type}/{inference_id}' \
--header "Authorization: $API_KEY" \
--header "Content-Type: application/json" \
--data '{"query":"string","input":"string","task_settings":{}}'