> ## Documentation Index
> Fetch the complete documentation index at: https://docs.honeyhive.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Human Evaluators

> Create human evaluator fields for manual review and annotation of AI outputs

Human evaluators enable domain experts to manually assess AI outputs. Unlike Python or LLM evaluators that run automatically, human evaluators create annotation fields that team members fill in during review.

**When to use human evaluators:**

* Quality assessment requiring domain expertise
* Edge cases that automated evaluators can't handle
* Building ground truth datasets
* Compliance and safety reviews

## Creating a Human Evaluator

1. Navigate to the [**Evaluators**](https://app.us.honeyhive.ai/metrics) tab
2. Click **Add Evaluator** and select **Human Evaluator**

<Frame>
  <img src="https://mintcdn.com/honeyhiveai/76aQ7hOOlqHYtp6r/images/product-categorical.png?fit=max&auto=format&n=76aQ7hOOlqHYtp6r&q=85&s=2c3ca0e45f140da8452cf35820b316ed" alt="Human evaluator configuration showing name, description, return type, categories, and passing range fields" width="3024" height="1568" data-path="images/product-categorical.png" />
</Frame>

## Configuration

### Name and Description

The **Description** field defines evaluation criteria for annotators. Keep it concise and actionable:

```
Rate the response quality:
- Accuracy: Is the information factually correct?
- Completeness: Does it fully address the question?
- Clarity: Is it easy to understand?
```

<Tip>Write criteria as questions annotators can answer while reviewing. Avoid lengthy rubrics - link to external guidelines if needed.</Tip>

### Return Type

| Return Type     | Use Case                                   | UI Component   |
| --------------- | ------------------------------------------ | -------------- |
| **Numeric**     | Ratings on a scale (1-5 stars)             | Star rating    |
| **Binary**      | Yes/no assessments                         | Thumbs up/down |
| **Categorical** | Predefined categories with optional scores | Dropdown menu  |
| **Notes**       | Free-form text feedback                    | Text input     |

### Rating Scale

For **Numeric** and **Categorical** types, set the scale range (e.g., 1-5). For Categorical, assign numeric values to each category to enable aggregation.

### Passing Range

Define which scores indicate acceptable quality. Results outside this range are flagged for review.

## Using Human Evaluators

Once created, human evaluators appear in:

* **Trace detail view** - Annotate individual traces
* **Review Mode** - Batch review with side-by-side input/output
* **Annotation Queues** - Organized workflows for systematic review

<Frame type="glass">
  <img src="https://mintcdn.com/honeyhiveai/81DpusKRfAED9ab1/images/AnnotationQueues.png?fit=max&auto=format&n=81DpusKRfAED9ab1&q=85&s=69951687bdb69d9874f5fe4c6256aed4" alt="Annotation queue interface showing queued events, trace details, and human evaluator fields including Rating, Output Quality, and Comments" width="3024" height="1562" data-path="images/AnnotationQueues.png" />
</Frame>

## Related

<CardGroup cols={2}>
  <Card title="Annotation Queues" icon="list-check" href="/v2/evaluation/annotation-queues">
    Set up automated workflows for human review
  </Card>

  <Card title="Evaluators Introduction" icon="flask" href="/v2/evaluators/introduction">
    Overview of all evaluator types
  </Card>
</CardGroup>
