JavaScript

import SGPClient from 'sgp';

const client = new SGPClient({
  apiKey: 'My API Key',
});

const chatCompletionsResponse = await client.chatCompletions.create({
  messages: [{ content: 'string' }],
  model: 'gpt-oss-120b',
});

console.log(chatCompletionsResponse.chat_completion);

{
  "chat_completion": {
    "message": {
      "role": "user",
      "content": "<string>"
    },
    "finish_reason": "<string>"
  },
  "token_usage": {
    "prompt": 123,
    "completion": 123,
    "total": 123
  }
}

Chat Completions

Create Chat Completion

Description

Given a list of messages representing a conversation history, runs LLM inference to produce the next message.

Details

Like completions, chat completions involve an LLM’s response to input. However, chat completions take a conversation history as input, instead of a single prompt, which enables the LLM to create responses that take past context into account.

Messages

The primary input to the LLM is a list of messages represented by the messages array, which forms the conversation. The messages array must contain at least one message object. Each message object is attributed to a specific entity through its role. The available roles are:

user: Represents the human querying the model. - assistant: Represents the model responding to user. - system: Represents a non-user entity that provides information to guide the behavior of the assistant.

When the role of a message is set to user, assistant, or system, the message must also contain a content field which is a string representing the actual text of the message itself. Semantically, when the role is user, content contains the user’s query. When the role is assistant, content is the model’s response to the user. When the role is system, content represents the instruction for the assistant.

Instructions

You may provide instructions to the assistant by supplying by supplying instructions in the HTTP request body or by specifying a message with role set to system in the messages array. By convention, the system message should be the first message in the array. Do not specify both an instruction and a system message in the messages array.

POST

chat-completions

JavaScript

import SGPClient from 'sgp';

const client = new SGPClient({
  apiKey: 'My API Key',
});

const chatCompletionsResponse = await client.chatCompletions.create({
  messages: [{ content: 'string' }],
  model: 'gpt-oss-120b',
});

console.log(chatCompletionsResponse.chat_completion);

{
  "chat_completion": {
    "message": {
      "role": "user",
      "content": "<string>"
    },
    "finish_reason": "<string>"
  },
  "token_usage": {
    "prompt": 123,
    "completion": 123,
    "total": 123
  }
}

Authorizations

x-api-key

string

header

required

Body

application/json

Response

200

application/json

Successful Response

The response is of type object.

Create Completion Execute Model Deployment

Knowledge Bases

Chunks

Agents

Completions

Chat Completions

Models

Users

Accounts

Organizations

Question Sets

Evaluations

Evaluation Configs

Evaluation Datasets

Studio Projects

Application Specs

Questions

Knowledge Base Data Sources

Model Templates V3 (Beta)

Model server

API Reference

Fine Tuning Jobs V3 (Beta)

Training Datasets V3 (Beta)

package deployments

Beta

Applications

ChatThreads

Interactions

MonitoringDashboard

Chat Themes

account groups

account