GET
/
v5
/
models
curl --request GET \
  --url https://api.egp.scale.com/v5/models \
  --header 'x-api-key: <api-key>'
{
  "object": "list",
  "items": [
    {
      "name": "<string>",
      "model_type": "generic",
      "model_vendor": "openai",
      "vendor_configuration": {
        "model_image": {
          "registry": "<string>",
          "repository": "<string>",
          "tag": "<string>",
          "predict_route": "/predict",
          "command": [
            "<string>"
          ],
          "streaming_predict_route": "<string>",
          "streaming_command": [
            "<string>"
          ],
          "request_schema": {},
          "response_schema": {},
          "env_vars": {},
          "readiness_delay": 120,
          "healthcheck_route": "/readyz"
        },
        "model_infra": {
          "endpoint_type": "sync",
          "cpus": "<string>",
          "gpus": 0,
          "gpu_type": "nvidia-tesla-t4",
          "memory": "8Gi",
          "storage": "16Gi",
          "min_workers": 0,
          "max_workers": 1,
          "per_worker": 10,
          "high_priority": false,
          "labels": {},
          "public_inference": false
        }
      },
      "status": "failed",
      "model_metadata": {},
      "id": "<string>",
      "created_at": "2023-11-07T05:31:56Z",
      "created_by_user_id": "<string>",
      "object": "model"
    }
  ],
  "limit": 100,
  "total": 123,
  "has_more": true
}

Authorizations

x-api-key
string
header
required

Query Parameters

name
string | null
model_vendor
enum<string> | null
Available options:
openai,
cohere,
vertex_ai,
anthropic,
azure,
gemini,
launch,
llmengine,
model_zoo,
bedrock
starting_after
string | null
ending_before
string | null
limit
integer
default:
100
Required range: 1 < x < 1000

Response

200
application/json
Successful Response
items
object[]
required
total
integer
required

The total of items that match the query. This is greater than or equal to the number of items returned.

has_more
boolean
required

Whether there are more items left to be fetched.

object
enum<string>
default:
list
Available options:
list
limit
integer
default:
100

The maximum number of items to return.