POST
/
v5
/
inference
curl --request POST \
  --url https://api.egp.scale.com/v5/inference \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --data '{
  "model": "<string>",
  "args": {},
  "inference_configuration": {
    "timeout_seconds": 123,
    "num_retries": 123
  }
}'
{
  "object": "generic_inference",
  "response": "<any>"
}

Authorizations

x-api-key
string
header
required

Body

application/json
model
string
required

model specified as vendor/name (ex. openai/gpt-5)

args
object

Arguments passed into model

inference_configuration
object

Vendor specific configuration

Response

200
application/json
Successful Response
response
any
required
object
enum<string>
default:
generic_inference
Available options:
generic_inference