List Model Templates

Python

import os
from scale_gp import SGPClient

client = SGPClient(
    api_key=os.environ.get("SGP_API_KEY"),  # This is the default and can be omitted
)
page = client.model_templates.list()
page = page.items[0]
print(page.id)

{
  "items": [
    {
      "name": "<string>",
      "endpoint_type": "SYNC",
      "model_type": "COMPLETION",
      "vendor_configuration": {
        "bundle_config": {
          "registry": "<string>",
          "image": "<string>",
          "tag": "<string>",
          "command": [
            "<string>"
          ],
          "env": {},
          "streaming_command": [
            "<string>"
          ],
          "readiness_initial_delay_seconds": 120,
          "healthcheck_route": "/readyz",
          "predict_route": "/predict",
          "streaming_predict_route": "/generate_streaming"
        },
        "vendor": "LAUNCH",
        "endpoint_config": {
          "cpus": 3,
          "memory": "8Gi",
          "storage": "16Gi",
          "gpus": 0,
          "gpu_type": "nvidia-tesla-t4",
          "min_workers": 0,
          "max_workers": 1,
          "per_worker": 10,
          "endpoint_type": "ASYNC",
          "high_priority": false
        },
        "fine_tuning_job_bundle_config": {
          "registry": "<string>",
          "image": "<string>",
          "tag": "<string>",
          "command": [
            "<string>"
          ],
          "env": {},
          "mount_location": "/workspace/launch_specific/config.json",
          "training_dataset_schema_type": "GENERATION",
          "resources": {
            "cpus": 3,
            "memory": "8Gi",
            "storage": "16Gi",
            "gpus": 0,
            "gpu_type": "nvidia-tesla-t4"
          }
        }
      },
      "id": "<string>",
      "created_at": "2023-11-07T05:31:56Z",
      "account_id": "<string>",
      "created_by_user_id": "<string>",
      "created_by_identity_type": "user",
      "model_creation_parameters_schema": {
        "parameters": [
          {
            "name": "<string>",
            "type": "<string>",
            "description": "<string>",
            "required": true
          }
        ]
      },
      "model_request_parameters_schema": {
        "parameters": [
          {
            "name": "<string>",
            "type": "<string>",
            "description": "<string>",
            "required": true
          }
        ]
      },
      "endpoint_protocol": "SGP"
    }
  ],
  "total_item_count": 123,
  "current_page": 123,
  "items_per_page": 123
}

GET

model-templates

Python

import os
from scale_gp import SGPClient

client = SGPClient(
    api_key=os.environ.get("SGP_API_KEY"),  # This is the default and can be omitted
)
page = client.model_templates.list()
page = page.items[0]
print(page.id)

{
  "items": [
    {
      "name": "<string>",
      "endpoint_type": "SYNC",
      "model_type": "COMPLETION",
      "vendor_configuration": {
        "bundle_config": {
          "registry": "<string>",
          "image": "<string>",
          "tag": "<string>",
          "command": [
            "<string>"
          ],
          "env": {},
          "streaming_command": [
            "<string>"
          ],
          "readiness_initial_delay_seconds": 120,
          "healthcheck_route": "/readyz",
          "predict_route": "/predict",
          "streaming_predict_route": "/generate_streaming"
        },
        "vendor": "LAUNCH",
        "endpoint_config": {
          "cpus": 3,
          "memory": "8Gi",
          "storage": "16Gi",
          "gpus": 0,
          "gpu_type": "nvidia-tesla-t4",
          "min_workers": 0,
          "max_workers": 1,
          "per_worker": 10,
          "endpoint_type": "ASYNC",
          "high_priority": false
        },
        "fine_tuning_job_bundle_config": {
          "registry": "<string>",
          "image": "<string>",
          "tag": "<string>",
          "command": [
            "<string>"
          ],
          "env": {},
          "mount_location": "/workspace/launch_specific/config.json",
          "training_dataset_schema_type": "GENERATION",
          "resources": {
            "cpus": 3,
            "memory": "8Gi",
            "storage": "16Gi",
            "gpus": 0,
            "gpu_type": "nvidia-tesla-t4"
          }
        }
      },
      "id": "<string>",
      "created_at": "2023-11-07T05:31:56Z",
      "account_id": "<string>",
      "created_by_user_id": "<string>",
      "created_by_identity_type": "user",
      "model_creation_parameters_schema": {
        "parameters": [
          {
            "name": "<string>",
            "type": "<string>",
            "description": "<string>",
            "required": true
          }
        ]
      },
      "model_request_parameters_schema": {
        "parameters": [
          {
            "name": "<string>",
            "type": "<string>",
            "description": "<string>",
            "required": true
          }
        ]
      },
      "endpoint_protocol": "SGP"
    }
  ],
  "total_item_count": 123,
  "current_page": 123,
  "items_per_page": 123
}

Authorizations

x-api-key

string

header

required

Headers

x-selected-account-id

string | null

Query Parameters

page

integer

default:1

Page number for pagination to be returned by the given endpoint. Starts at page 1

Required range: x >= 1

limit

integer

default:100

Maximum number of artifacts to be returned by the given endpoint. Defaults to 100 and cannot be greater than 10k.

Required range: 1 <= x <= 10000

account_id

string | null

Response

Successful Response

items

ModelTemplateResponse · object[]

required

The data returned for the current page.

Show child attributes

total_item_count

integer

required

The total number of items of the query

current_page

integer

required

The current page number.

items_per_page

integer

required

The number of items per page.

Update Knowledge Base Data Source Create Model Template

⌘I

Knowledge Bases

Chunks

Agents

Completions

Chat Completions

Models

Users

Accounts

Organizations

Question Sets

Evaluations

Evaluation Configs

Evaluation Datasets

Studio Projects

Application Specs

Questions

Knowledge Base Data Sources

Model Templates V3 (Beta)

Model server

API Reference

Fine Tuning Jobs V3 (Beta)

Training Datasets V3 (Beta)

package deployments

Beta

Applications

ChatThreads

Interactions

MonitoringDashboard

Chat Themes

account groups

List Model Templates

Description

Details

Authorizations

Headers

Query Parameters

Response