1. Instantiate Client
1. Instantiate Client
Follow the instructions in the Quickstart Guide to setup the SGP Client
2. Create autogenerated dataset
2. Create autogenerated dataset
For autogenerated evaluation datasets, a generation job workflow is created to generate test cases.
Test cases are generated based on the knowledge base provided, and must be approved before the dataset can be published.
Evaluation datasets, once published, can be used for application variant runs and report card generation
3. Start generation job
3. Start generation job
Start the generation job. This job will generate test cases based on the chunks/data present inside the specified knowledge base.
In this example, we used a knowledge base with Legend of Zelda playthrough guides.
4. Approve auto-generated test cases
4. Approve auto-generated test cases
Before publishing the dataset, review the auto-generated test cases and approve/decline each test case. Publishing is blocked until
all test cases are reviewed.
5. Publish the dataset
5. Publish the dataset
Publishing the dataset allows it to be available for use in evaluations