curl --request GET \
--url https://api.egp.scale.com/v5/models/{model_id} \
--header 'x-api-key: <api-key>'{
"id": "<string>",
"name": "<string>",
"model_type": "generic",
"model_vendor": "openai",
"status": "failed",
"created_at": "2023-11-07T05:31:56Z",
"created_by_user_id": "<string>",
"created_by_identity_type": "user",
"object": "model",
"vendor_configuration": {
"model_image": {
"registry": "<string>",
"repository": "<string>",
"tag": "<string>",
"command": [
"<string>"
],
"predict_route": "/predict",
"streaming_predict_route": "<string>",
"streaming_command": [
"<string>"
],
"request_schema": {},
"response_schema": {},
"env_vars": {},
"readiness_delay": 120,
"healthcheck_route": "/readyz"
},
"model_infra": {
"endpoint_type": "sync",
"cpus": 3,
"gpus": 0,
"gpu_type": "nvidia-tesla-t4",
"memory": "8Gi",
"storage": "16Gi",
"min_workers": 0,
"max_workers": 1,
"per_worker": 10,
"high_priority": false,
"labels": {},
"public_inference": false
}
},
"model_metadata": {},
"model_availability": "unknown"
}curl --request GET \
--url https://api.egp.scale.com/v5/models/{model_id} \
--header 'x-api-key: <api-key>'{
"id": "<string>",
"name": "<string>",
"model_type": "generic",
"model_vendor": "openai",
"status": "failed",
"created_at": "2023-11-07T05:31:56Z",
"created_by_user_id": "<string>",
"created_by_identity_type": "user",
"object": "model",
"vendor_configuration": {
"model_image": {
"registry": "<string>",
"repository": "<string>",
"tag": "<string>",
"command": [
"<string>"
],
"predict_route": "/predict",
"streaming_predict_route": "<string>",
"streaming_command": [
"<string>"
],
"request_schema": {},
"response_schema": {},
"env_vars": {},
"readiness_delay": 120,
"healthcheck_route": "/readyz"
},
"model_infra": {
"endpoint_type": "sync",
"cpus": 3,
"gpus": 0,
"gpu_type": "nvidia-tesla-t4",
"memory": "8Gi",
"storage": "16Gi",
"min_workers": 0,
"max_workers": 1,
"per_worker": 10,
"high_priority": false,
"labels": {},
"public_inference": false
}
},
"model_metadata": {},
"model_availability": "unknown"
}Successful Response
generic, completion, chat_completion openai, cohere, vertex_ai, anthropic, azure, gemini, launch, llmengine, model_zoo, bedrock, xai, fireworks_ai failed, ready, deploying user, service_account "model"Show child attributes
Show child attributes
Show child attributes
async, sync, streaming nvidia-tesla-t4, nvidia-ampere-a10, nvidia-ampere-a100, nvidia-ampere-a100e, nvidia-hopper-h100, nvidia-hopper-h100-1g20gb, nvidia-hopper-h100-3g40gb unknown, available, unavailable