An overview of HoneyHive datasets and their role in the AI application lifecycle.
user_query
, expected_response
, customer_segment
).
source_log_id
, timestamp
, user_segment
).dataset_id
.
dataset_id
to the evaluate
function.
inputs
, ground_truths
, etc., and pass it via the dataset
parameter to evaluate
.
EXT-
followed by a hash of the content, e.g., EXT-dc089d82c986a22921e0e773
).
Support for External (Custom‑ID) In‑Code Datasets
You can now log an in‑code dataset with your own IDs and names by adding optional id
and name
at the top level, and optional id
on each datapoint.
These IDs will appear in the UI prefixed with EXT-
, offering full integration with experiment tracking while preserving your existing naming conventions.
id
and name
is entirely optional—omit them to let HoneyHive generate EXT-…
identifiers automatically.
Custom datapoint IDs help you trace individual rows in the UI or logs, while a custom dataset ID and name let you easily refer to that dataset across experiments.
evaluate
, provide either the dataset_id
(for managed datasets) or the dataset
parameter (for in-code datasets), but never both.