Upload datasets
How to upload a dataset in HoneyHive
We support uploading datasets to HoneyHive both through the UI and the SDK.
Upload a dataset through the UI
We currently support JSON
, JSONL
and CSV
file uploads in HoneyHive.
Here’s an example JSONL
file that you can upload:
Here’s an example CSV
file that you can upload:
In the below tutorial, we will use the JSON
file format.
Expected time: few minutes
Steps:
Create a file with your JSON data
We will use a file called AI_bot_queries.json
with the content as shown above.
Upload & view your dataset
Follow the steps after to upload & view your dataset:
Upload a dataset through the SDK
Both our TypeScript and Python SDKs have been designed to ingest completely custom JSON lists.
All you need to do is to define which fields in each row map to inputs, ground truth, conversation history. All other fields are placed in metadata.
Prerequisites
- You have already created a project in HoneyHive, as explained here.
- You have an API key for your project, as explained here.
Expected time: few minutes
Installation
To install our SDKs, run the following commands in the shell.
Authentication & Imports
To authenticate your SDK, you need to pass your API key.
Create the dataset object
Give your new dataset a name and pass the project name to which you want to associate the dataset.
Keep the generated dataset_id
handy for future reference.
Pass your data and provide a mapping
Now, using the dataset_id
, you can pass your data list and provide a mapping to the fields.
We’ll create unique datapoints for each entry in the JSON list. The datapoint_id
on those entries will be used for joining traces in experiment runs in the future.
metadata
of the datapoint.You have successfully uploaded your dataset to HoneyHive using the SDK.
You can now view your dataset in the HoneyHive UI.