Skip to main content
POST
/
v5
/
agent-evaluations
Create Agent Evaluation
curl --request POST \
  --url https://api.egp.scale.com/v5/agent-evaluations \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --data '
{
  "account_id": "acct_123",
  "data_plane_dataset_id": "dp_dataset_123",
  "description": "Metadata-only control-plane request",
  "metadata": {
    "suite": "golden"
  },
  "name": "Agent judge eval",
  "projection_policy_id": "policy_eval_projection_v1",
  "tags": [
    "two-plane",
    "mvp"
  ],
  "tasks": [
    {
      "alias": "mock_judge",
      "config": {
        "judge_adapter": "mock"
      },
      "type": "auto_evaluation.agent"
    }
  ],
  "tenant_id": "tenant_123"
}
'
{
  "id": "<string>",
  "run_id": "<string>",
  "name": "<string>",
  "progress": {
    "workflows": [
      {
        "name": "<string>",
        "status": "pending"
      }
    ],
    "total_items": 0,
    "completed_items": 0
  },
  "tasks": [
    {
      "alias": "<string>",
      "type": "<string>",
      "config": {},
      "depends_on": [
        "<string>"
      ]
    }
  ],
  "account_id": "<string>",
  "tenant_id": "<string>",
  "data_plane_dataset_id": "<string>",
  "projection_policy_id": "<string>",
  "created_at": "2023-11-07T05:31:56Z",
  "updated_at": "2023-11-07T05:31:56Z",
  "description": "<string>",
  "metadata": {},
  "tags": [
    "<string>"
  ],
  "data_plane_version": "<string>",
  "projection_freshness_seconds": 123,
  "error_count": 0
}

Authorizations

x-api-key
string
header
required

Headers

x-selected-account-id
string | null

Body

application/json
name
string
required
Minimum string length: 1
data_plane_dataset_id
string
required
Minimum string length: 1
tasks
AgentEvaluationTask · object[]
required
Minimum array length: 1
projection_policy_id
string
required
Minimum string length: 1
description
string
tags
string[]
metadata
Metadata · object
account_id
string
default:local-account
Minimum string length: 1
tenant_id
string
default:local-tenant
Minimum string length: 1

Response

Successful Response

id
string
required
run_id
string
required
name
string
required
status
enum<string>
required
Available options:
pending,
running,
completed,
failed
progress
AgentEvaluationProgress · object
required
tasks
AgentEvaluationTask · object[]
required
account_id
string
required
tenant_id
string
required
data_plane_dataset_id
string
required
projection_policy_id
string
required
created_at
string<date-time>
required
updated_at
string<date-time>
required
description
string
metadata
Metadata · object
tags
string[]
data_plane_version
string
projection_freshness_seconds
integer
error_count
integer
default:0
Required range: x >= 0