POST
/
v4
/
model-templates

Authorizations

x-api-key
string
headerrequired

Headers

x-selected-account-id
string | null

Body

application/json
name
string
required
endpoint_type
enum<string>
required

An enum representing the different types of model endpoint types supported.

Attributes: SYNC: Denotes that the model endpoint type is sync. ASYNC: Denotes that the model endpoint type is async. STREAMING: Denotes that the model endpoint type is streaming. BATCH: Denotes that the model endpoint type is batch.

Available options:
SYNC,
ASYNC,
STREAMING,
BATCH
model_type
enum<string>
required

An enum representing the different types of models supported.

Attributes: COMPLETION: Denotes that the model type is completion. CHAT_COMPLETION: Denotes that the model type is chat completion. AGENT: Denotes that the model type is agent. EMBEDDING: Denotes that the model type is embedding. RERANKING: Denotes that the model type is reranking. GENERIC: Denotes that the model type is generic.

Available options:
COMPLETION,
CHAT_COMPLETION,
AGENT,
EMBEDDING,
RERANKING,
GENERIC
vendor_configuration
object
required

Configuration for launching a model using the Launch service which is an internal and self-hosted service developed by Scale that deploys models on Kubernetes.

Attributes: vendor: The vendor of the model template bundle_config: The bundle configuration of the model template endpoint_config: The endpoint configuration of the model template

account_id
string
required

The ID of the account that owns the given entity.

model_creation_parameters_schema
object

The field names and types of available parameter fields which may be specified during model creation

model_request_parameters_schema
object

The field names and types of available parameter fields which may be specified in a model execution API's model_request_parameters field.

endpoint_protocol
enum<string>

The name of the calling convention expected by the Launch model endpoint

Available options:
SGP,
COHERE,
VLLM

Response

200 - application/json
name
string
required
endpoint_type
enum<string>
required

An enum representing the different types of model endpoint types supported.

Attributes: SYNC: Denotes that the model endpoint type is sync. ASYNC: Denotes that the model endpoint type is async. STREAMING: Denotes that the model endpoint type is streaming. BATCH: Denotes that the model endpoint type is batch.

Available options:
SYNC,
ASYNC,
STREAMING,
BATCH
model_type
enum<string>
required

An enum representing the different types of models supported.

Attributes: COMPLETION: Denotes that the model type is completion. CHAT_COMPLETION: Denotes that the model type is chat completion. AGENT: Denotes that the model type is agent. EMBEDDING: Denotes that the model type is embedding. RERANKING: Denotes that the model type is reranking. GENERIC: Denotes that the model type is generic.

Available options:
COMPLETION,
CHAT_COMPLETION,
AGENT,
EMBEDDING,
RERANKING,
GENERIC
vendor_configuration
object
required

Configuration for launching a model using the Launch service which is an internal and self-hosted service developed by Scale that deploys models on Kubernetes.

Attributes: vendor: The vendor of the model template bundle_config: The bundle configuration of the model template endpoint_config: The endpoint configuration of the model template

id
string
required

The unique identifier of the entity.

created_at
string
required

The date and time when the entity was created in ISO format.

account_id
string
required

The ID of the account that owns the given entity.

created_by_user_id
string
required

The user who originally created the entity.

model_creation_parameters_schema
object

The field names and types of available parameter fields which may be specified during model creation

model_request_parameters_schema
object

The field names and types of available parameter fields which may be specified in a model execution API's model_request_parameters field.

endpoint_protocol
enum<string>

The name of the calling convention expected by the Launch model endpoint

Available options:
SGP,
COHERE,
VLLM