> ## Documentation Index
> Fetch the complete documentation index at: https://docs.honeyhive.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# HoneyHive Overview

> Getting started with HoneyHive

<Frame>
  <img className="block dark:hidden" src="https://mintcdn.com/honeyhiveai/QpXbEVVa7Wd8Boo0/images/intro-banner.png?fit=max&auto=format&n=QpXbEVVa7Wd8Boo0&q=85&s=d56638208eed08e1ae8a8791c773413f" alt="HoneyHive: the observability layer for enterprise agents" width="2400" height="1260" data-path="images/intro-banner.png" />

  <img className="hidden dark:block" src="https://mintcdn.com/honeyhiveai/QpXbEVVa7Wd8Boo0/images/intro-banner-dark.png?fit=max&auto=format&n=QpXbEVVa7Wd8Boo0&q=85&s=2d39267eb8951d2d42b611a4cb3a3759" alt="HoneyHive: the observability layer for enterprise agents" width="2400" height="1260" data-path="images/intro-banner-dark.png" />
</Frame>

HoneyHive is the complete **AI observability and evaluation platform** for tracing, evaluating, monitoring, and improving AI agents from development to production.

<CardGroup cols={2}>
  <Card title="Start Tracing" icon="rocket" href="/v2/introduction/tracing-quickstart" color="#00A67E">
    Instrument your first agent and capture traces in 5 minutes.
  </Card>

  <Card title="Run Your First Evaluation" icon="flask" href="/v2/introduction/experiments-quickstart" color="#7C3AED">
    Set up an experiment and evaluate your agent programmatically.
  </Card>
</CardGroup>

***

## The Workflow

HoneyHive follows an **Evaluation-Driven Development (EDD)** workflow — similar to TDD in software engineering — where evaluation guides every stage of agent development.

<Steps>
  <Step title="Production: Observe and Evaluate">
    Instrument your application with distributed tracing to capture every interaction. Collect traces, user feedback, and quality metrics from production. Run **online evals** to surface edge cases at scale, and set up alerts to catch failures or metric drift.

    <Tabs>
      <Tab title="Traces">
        Inspect every LLM call, tool invocation, and chain step in a structured execution log.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/sFOpWw98R-jnkhpC/images/product-traces.png?fit=max&auto=format&n=sFOpWw98R-jnkhpC&q=85&s=99d66a25e27f2188b49cc0185ec8b607" width="3024" height="1560" data-path="images/product-traces.png" />
        </Frame>
      </Tab>

      <Tab title="Agent Graphs">
        Visualize agentic workflows as interactive graphs showing how components connect and where execution flows.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/81DpusKRfAED9ab1/images/NewGraphView.png?fit=max&auto=format&n=81DpusKRfAED9ab1&q=85&s=2ca201663a04093a65acf7b5fe0ff0f6" width="3024" height="1564" data-path="images/NewGraphView.png" />
        </Frame>
      </Tab>

      <Tab title="Trajectories">
        Spot loops, stuck steps, and outliers in long agent sessions as bubbles sized by duration, cost, metrics, feedback, or metadata values.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/sFOpWw98R-jnkhpC/images/trajectory-view.png?fit=max&auto=format&n=sFOpWw98R-jnkhpC&q=85&s=562a952c38c289a2ac3d789e74d6f168" width="3024" height="1556" data-path="images/trajectory-view.png" />
        </Frame>
      </Tab>

      <Tab title="Threads">
        Follow a session across multiple sub-agents in a single chronological thread, including internal messages and context propagation.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/81DpusKRfAED9ab1/images/ThreadView.png?fit=max&auto=format&n=81DpusKRfAED9ab1&q=85&s=4b17866633b3fcad1098e57b3a813341" width="3024" height="1562" data-path="images/ThreadView.png" />
        </Frame>
      </Tab>

      <Tab title="Timeline View">
        Identify latency bottlenecks with a chronological breakdown of every operation in a trace.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/sFOpWw98R-jnkhpC/images/timeline-view.png?fit=max&auto=format&n=sFOpWw98R-jnkhpC&q=85&s=4afe8f25efea30423253c21c6946baf7" width="3024" height="1556" data-path="images/timeline-view.png" />
        </Frame>
      </Tab>

      <Tab title="Dashboard">
        Track cost, latency, and success rates with customizable charts and filters.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/EWG3R5yYrwNnHjQ7/images/product-dashboard.png?fit=max&auto=format&n=EWG3R5yYrwNnHjQ7&q=85&s=a96b23c318597e7650d96ca6151e1892" width="3024" height="1566" data-path="images/product-dashboard.png" />
        </Frame>
      </Tab>

      <Tab title="Alerts">
        Get notified when quality drops or errors spike so you can respond before users are affected.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/EWG3R5yYrwNnHjQ7/images/product-alerts.png?fit=max&auto=format&n=EWG3R5yYrwNnHjQ7&q=85&s=75d3ba2822f5d3f747ab9d03243943ac" width="3024" height="1564" data-path="images/product-alerts.png" />
        </Frame>
      </Tab>
    </Tabs>
  </Step>

  <Step title="Testing: Curate Datasets & Run Experiments">
    Turn failing production traces into curated test datasets. Run experiments to measure the impact of your changes, track regressions over time, and gate releases in CI.

    <Tabs>
      <Tab title="Experiments">
        Compare prompts, models, or configurations side-by-side to see which changes improve performance.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/81DpusKRfAED9ab1/images/NewExperiments.png?fit=max&auto=format&n=81DpusKRfAED9ab1&q=85&s=7cf07761ca0454aeeb67d45cb58dda45" width="3024" height="1560" data-path="images/NewExperiments.png" />
        </Frame>
      </Tab>

      <Tab title="Datasets">
        Build test sets from production failures and edge cases to cover real-world scenarios.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/EWG3R5yYrwNnHjQ7/images/product-datapoint.png?fit=max&auto=format&n=EWG3R5yYrwNnHjQ7&q=85&s=76b7381517d0c5be5c634f0e03a3e194" width="3024" height="1564" data-path="images/product-datapoint.png" />
        </Frame>
      </Tab>

      <Tab title="Regression Tests">
        Verify that new changes don't break existing behavior by running evaluations on every update.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/EWG3R5yYrwNnHjQ7/images/product-regressions.png?fit=max&auto=format&n=EWG3R5yYrwNnHjQ7&q=85&s=23932c9a312f4408b289b87b36c0d8ae" width="3024" height="1566" data-path="images/product-regressions.png" />
        </Frame>
      </Tab>

      <Tab title="LLM Evaluators">
        Use AI to assess response quality, accuracy, and safety at scale without manual review.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/EWG3R5yYrwNnHjQ7/images/product-llm.png?fit=max&auto=format&n=EWG3R5yYrwNnHjQ7&q=85&s=01cc471bc2e45ffa063e5ab7a8fe26b9" width="3024" height="1568" data-path="images/product-llm.png" />
        </Frame>
      </Tab>

      <Tab title="Code Evaluators">
        Write custom Python evaluation logic for domain-specific metrics that LLMs can't judge reliably.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/EWG3R5yYrwNnHjQ7/images/product-code.png?fit=max&auto=format&n=EWG3R5yYrwNnHjQ7&q=85&s=ba616b85f884ec63d70847e3d62fc0b5" width="3024" height="1562" data-path="images/product-code.png" />
        </Frame>
      </Tab>

      <Tab title="Annotation Queues">
        Collect expert judgments on agent outputs to build ground truth labels and improve automated evaluators.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/81DpusKRfAED9ab1/images/AnnotationQueues.png?fit=max&auto=format&n=81DpusKRfAED9ab1&q=85&s=69951687bdb69d9874f5fe4c6256aed4" width="3024" height="1562" data-path="images/AnnotationQueues.png" />
        </Frame>
      </Tab>
    </Tabs>
  </Step>

  <Step title="Development: Iterate on Prompts">
    Use evaluation results to guide changes. Iterate on prompts, test new models, and optimize your application based on what the data shows. Validate changes against curated datasets before deploying.

    <Tabs>
      <Tab title="Playground">
        Test prompt variations and model configurations with instant feedback before committing to code.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/EWG3R5yYrwNnHjQ7/images/product-playground.png?fit=max&auto=format&n=EWG3R5yYrwNnHjQ7&q=85&s=7b8d091a5c4e34b26be775f11270a9da" width="3024" height="1562" data-path="images/product-playground.png" />
        </Frame>
      </Tab>

      <Tab title="Prompt Management">
        Version and deploy prompts centrally so your team can iterate without code changes or redeployments.

        <Frame>
          <img src="https://mintcdn.com/honeyhiveai/EWG3R5yYrwNnHjQ7/images/product-prompts.png?fit=max&auto=format&n=EWG3R5yYrwNnHjQ7&q=85&s=f69662f3ea6bc1c742e416c698695d24" width="3024" height="1562" data-path="images/product-prompts.png" />
        </Frame>
      </Tab>
    </Tabs>
  </Step>

  <Step title="Repeat: Continuous Improvement">
    Deploy improvements and continue the cycle. Each iteration builds on production data, creating a **flywheel of improvement** that makes your AI systems more reliable over time.

    <Frame type="glass" caption="The Agent Development Lifecycle (ADLC)">
      <img src="https://cdn.prod.website-files.com/65483e3a00a324531a626c4e/673b754f7eb65e18df13f812_image%20(363).png" />
    </Frame>
  </Step>
</Steps>

***

## Platform Capabilities

Core features across the development lifecycle:

<CardGroup cols={2}>
  <Card title="Tracing" icon="chart-network" href="/v2/tracing/introduction">
    Capture and visualize every step of your AI application with distributed tracing.
  </Card>

  <Card title="Experiments & Datasets" icon="flask" href="/v2/evaluation/introduction">
    Test changes with offline experiments and curated datasets before deploying.
  </Card>

  <Card title="Monitoring & Alerting" icon="bell" href="/v2/monitoring/overview">
    Track metrics with dashboards and get alerts when quality degrades.
  </Card>

  <Card title="Online Evaluations" icon="gauge-high" href="/v2/evaluators/introduction">
    Run automated evals on production traces to catch issues early.
  </Card>

  <Card title="Annotation Queues" icon="pen-to-square" href="/v2/evaluators/human">
    Collect expert feedback and turn it into labeled datasets.
  </Card>

  <Card title="Prompt Management" icon="layer-group" href="/v2/prompts/overview">
    Version and manage prompts across UI and code.
  </Card>
</CardGroup>

***

## Open Standards, Open Ecosystem

HoneyHive is built on **OpenTelemetry**, so it works across models, frameworks, and runtimes with no vendor lock-in.

<Frame>
  <img src="https://mintcdn.com/honeyhiveai/sFOpWw98R-jnkhpC/images/integrations.png?fit=max&auto=format&n=sFOpWw98R-jnkhpC&q=85&s=7d63a86931091aaa180ebf7f45cf0076" alt="HoneyHive Ecosystem" width="2872" height="1218" data-path="images/integrations.png" />
</Frame>

<CardGroup cols={2}>
  <Card title="Model Agnostic" icon="brain" iconType="duotone">
    Works with OpenAI, Anthropic, Bedrock, open-source models, and more.
  </Card>

  <Card title="Framework Agnostic" icon="cube" iconType="duotone">
    Native support for LangChain, CrewAI, Google ADK, AWS Strands, and more.
  </Card>

  <Card title="Runtime Agnostic" icon="microchip" iconType="duotone">
    Trace any runtime - Lambdas, Kubernetes, Bedrock AgentCore, and more.
  </Card>

  <Card title="Bring Your Own Instrumentor" icon="badge-check" iconType="duotone">
    HoneyHive supports official OTEL GenAI, OpenLLMetry, and OpenInference semantic conventions.
  </Card>
</CardGroup>

***

## Hosting Options

<CardGroup cols={3}>
  <Card title="Multi-Tenant SaaS" icon="cloud" href="/v2/setup/managed">
    Fully managed. Get started in minutes.
  </Card>

  <Card title="Dedicated Cloud" icon="lock" href="/v2/setup/dedicated">
    Single-tenant environment managed by our team.
  </Card>

  <Card title="Self-Hosted" icon="server" href="/v2/setup/self-hosted">
    Deploy in your VPC for full control and compliance.
  </Card>
</CardGroup>

***

## Additional Resources

<CardGroup cols={2}>
  <Card title="API Reference" icon="code" href="/v2/api-reference/session/start-a-new-session">
    REST API documentation for custom integrations.
  </Card>

  <Card title="SDK Documentation" icon="book" href="/v2/sdk-reference/overview">
    Python SDK guides for advanced use cases.
  </Card>

  <Card title="Invite Your Team" icon="users" href="/v2/workspace/inviting-teammates">
    Add teammates and configure role-based access control.
  </Card>

  <Card title="Integrations" icon="plug" href="/v2/integrations/google-adk">
    Connect with OpenAI, Anthropic, LangChain, and more.
  </Card>
</CardGroup>
