curl --request GET \
--url https://api.egp.scale.com/v5/models \
--header 'x-api-key: <api-key>'{
"items": [
{
"id": "<string>",
"name": "<string>",
"model_type": "generic",
"model_vendor": "openai",
"status": "failed",
"created_at": "2023-11-07T05:31:56Z",
"created_by_user_id": "<string>",
"created_by_identity_type": "user",
"object": "model",
"vendor_configuration": {
"model_image": {
"registry": "<string>",
"repository": "<string>",
"tag": "<string>",
"command": [
"<string>"
],
"predict_route": "/predict",
"streaming_predict_route": "<string>",
"streaming_command": [
"<string>"
],
"request_schema": {},
"response_schema": {},
"env_vars": {},
"readiness_delay": 120,
"healthcheck_route": "/readyz"
},
"model_infra": {
"endpoint_type": "sync",
"cpus": 3,
"gpus": 0,
"gpu_type": "nvidia-tesla-t4",
"memory": "8Gi",
"storage": "16Gi",
"min_workers": 0,
"max_workers": 1,
"per_worker": 10,
"high_priority": false,
"labels": {},
"public_inference": false
}
},
"model_metadata": {},
"model_availability": "unknown"
}
],
"total": 123,
"has_more": true,
"object": "list",
"limit": 100
}curl --request GET \
--url https://api.egp.scale.com/v5/models \
--header 'x-api-key: <api-key>'{
"items": [
{
"id": "<string>",
"name": "<string>",
"model_type": "generic",
"model_vendor": "openai",
"status": "failed",
"created_at": "2023-11-07T05:31:56Z",
"created_by_user_id": "<string>",
"created_by_identity_type": "user",
"object": "model",
"vendor_configuration": {
"model_image": {
"registry": "<string>",
"repository": "<string>",
"tag": "<string>",
"command": [
"<string>"
],
"predict_route": "/predict",
"streaming_predict_route": "<string>",
"streaming_command": [
"<string>"
],
"request_schema": {},
"response_schema": {},
"env_vars": {},
"readiness_delay": 120,
"healthcheck_route": "/readyz"
},
"model_infra": {
"endpoint_type": "sync",
"cpus": 3,
"gpus": 0,
"gpu_type": "nvidia-tesla-t4",
"memory": "8Gi",
"storage": "16Gi",
"min_workers": 0,
"max_workers": 1,
"per_worker": 10,
"high_priority": false,
"labels": {},
"public_inference": false
}
},
"model_metadata": {},
"model_availability": "unknown"
}
],
"total": 123,
"has_more": true,
"object": "list",
"limit": 100
}openai, cohere, vertex_ai, anthropic, azure, gemini, launch, llmengine, model_zoo, bedrock, xai, fireworks_ai 1 <= x <= 1000asc, desc Successful Response
Show child attributes
generic, completion, chat_completion openai, cohere, vertex_ai, anthropic, azure, gemini, launch, llmengine, model_zoo, bedrock, xai, fireworks_ai failed, ready, deploying user, service_account "model"Show child attributes
Show child attributes
Show child attributes
async, sync, streaming nvidia-tesla-t4, nvidia-ampere-a10, nvidia-ampere-a100, nvidia-ampere-a100e, nvidia-hopper-h100, nvidia-hopper-h100-1g20gb, nvidia-hopper-h100-3g40gb unknown, available, unavailable The total of items that match the query. This is greater than or equal to the number of items returned.
Whether there are more items left to be fetched.
"list"The maximum number of items to return.