Path parameters

  • inference_idstring Required

    The inference Id

Responses

  • 200 application/json
    Hide response attribute Show response attribute object
    • endpointsarray[object] Required
      Hide endpoints attributes Show endpoints attributes object
      • Hide chunking_settings attributes Show chunking_settings attributes object
        • The maximum size of a chunk in words. This value cannot be higher than 300 or lower than 20 (for sentence strategy) or 10 (for word strategy).

        • overlapnumber

          The number of overlapping words for chunks. It is applicable only to a word chunking strategy. This value cannot be higher than half the max_chunk_size value.

        • The number of overlapping sentences for chunks. It is applicable only for a sentence chunking strategy. It can be either 1 or 0.

        • strategystring

          The chunking strategy: sentence or word.

      • servicestring Required

        The service type

      • service_settingsobject Required
      • inference_idstring Required

        The inference Id

      • task_typestring Required

        Values are sparse_embedding, text_embedding, rerank, completion, or chat_completion.

GET /_inference/{inference_id}
curl \
 --request GET 'http://api.example.com/_inference/{inference_id}' \
 --header "Authorization: $API_KEY"