Responses
- 200 application/json
Hide response attribute Show response attribute object
- endpoints
array[object] Required Hide endpoints attributes Show endpoints attributes object
- chunking_settings
object Hide chunking_settings attributes Show chunking_settings attributes object
- max_chunk_size
number The maximum size of a chunk in words. This value cannot be higher than
300
or lower than20
(forsentence
strategy) or10
(forword
strategy). - overlap
number The number of overlapping words for chunks. It is applicable only to a
word
chunking strategy. This value cannot be higher than half themax_chunk_size
value. - sentence_overlap
number The number of overlapping sentences for chunks. It is applicable only for a
sentence
chunking strategy. It can be either1
or0
. - strategy
string The chunking strategy:
sentence
orword
.
- service
string Required The service type
- service_settings
object Required - task_settings
object - inference_id
string Required The inference Id
- task_type
string Required Values are
sparse_embedding
,text_embedding
,rerank
,completion
, orchat_completion
.
GET /_inference
curl \
--request GET 'http://api.example.com/_inference' \
--header "Authorization: $API_KEY"