> ## Documentation Index
> Fetch the complete documentation index at: https://docs.gp.scale.com/llms.txt
> Use this file to discover all available pages before exploring further.

# LLM as a Judge

> Leverage LLM-based tasks to produce evaluation results.

## Create Evaluation

### Navigate to the Evaluate tab and click "Create evaluation".

<img src="https://mintcdn.com/scalegp/2sSrpizRElJqluR6/images/nextgen-evaluation/create-evaluation.png?fit=max&auto=format&n=2sSrpizRElJqluR6&q=85&s=ac97aa300f96b6d6a673417f7e4d70b8" alt="create-evaluation" width="3366" height="1700" data-path="images/nextgen-evaluation/create-evaluation.png" />

### Add Evaluation Details

Add in Evaluation name, description (optional), tags (optional), and select a dataset.

<img src="https://mintcdn.com/scalegp/2sSrpizRElJqluR6/images/nextgen-evaluation/evaluation-details.png?fit=max&auto=format&n=2sSrpizRElJqluR6&q=85&s=4c36a9cf9f0b7ddb332a11d7bfb39159" alt="create-evaluation" width="3374" height="1712" data-path="images/nextgen-evaluation/evaluation-details.png" />

### Add an LLM Judge

An LLM Judge prompts an LLM to answer questions to evaluate your dataset. You can either select an existing judge or you can configure a new one.

<img src="https://mintcdn.com/scalegp/2sSrpizRElJqluR6/images/nextgen-evaluation/llm-judge-selection.png?fit=max&auto=format&n=2sSrpizRElJqluR6&q=85&s=f46490b49b95e2c8c97426a0631c96dc" alt="create-evaluation" width="2182" height="470" data-path="images/nextgen-evaluation/llm-judge-selection.png" />

### Configure LLM Judge

An LLM Judge prompts an LLM to answer questions to evaluate your dataset.

<ul>
  <li> **Alias (Optional)** - The name of the column the results of this judge will show up on your evaluation results. </li>
  <li> **Model** - The model that the LLM Judge uses to evaluate the dataset. </li>
  <li> **System Prompt (Optional)** - A system prompt for an LLM judge is a set of instructions that defines the role and evaluation criteria for a large language model when it's used to assess, score, or make decisions about AI outputs, human responses, or other content. </li>
  <li> **Rubric** - A rubric for an LLM judge is a structured scoring framework that defines specific evaluation criteria, performance levels, and descriptors to ensure consistent and objective assessment of AI outputs or responses. </li>
  <li> **Response Options** - Constraints to what your Model can output. </li>
</ul>

<img src="https://mintcdn.com/scalegp/2sSrpizRElJqluR6/images/nextgen-evaluation/configure-judge.png?fit=max&auto=format&n=2sSrpizRElJqluR6&q=85&s=f6b323d3dd5c55dbe6a03477f6404d4e" alt="create-evaluation" width="1362" height="1656" data-path="images/nextgen-evaluation/configure-judge.png" />

### Create Evaluation

Select the rows on the dataset you want to run the evaluation, and click **Create Evaluation**.

<img src="https://mintcdn.com/scalegp/2sSrpizRElJqluR6/images/nextgen-evaluation/create-evaluation-final.png?fit=max&auto=format&n=2sSrpizRElJqluR6&q=85&s=4b1458e1bc9b09725015af1be9759540" alt="create-evaluation" width="3368" height="1704" data-path="images/nextgen-evaluation/create-evaluation-final.png" />

## View Evaluation Results

If you navigate back to the **Evaluation** tab, you should be able to see the results of the evaluation.

### Data

The data page will have a column with the results of the LLM Judge

<img src="https://mintcdn.com/scalegp/2sSrpizRElJqluR6/images/nextgen-evaluation/evaluation-results-data.png?fit=max&auto=format&n=2sSrpizRElJqluR6&q=85&s=0a98f45202c0edf413d84ba5d80b64a5" alt="create-evaluation" width="3416" height="1710" data-path="images/nextgen-evaluation/evaluation-results-data.png" />

### Overview

The overview page will have a graph with the visual representation of the evaluation result.

<img src="https://mintcdn.com/scalegp/2sSrpizRElJqluR6/images/nextgen-evaluation/evaluation-results-overview.png?fit=max&auto=format&n=2sSrpizRElJqluR6&q=85&s=dfb64e90448200ce77e680c384ec6202" alt="create-evaluation" width="3412" height="1702" data-path="images/nextgen-evaluation/evaluation-results-overview.png" />
