Why use datasets?
- Datasets prerequisite for Dataset Runs, they serve as the data input of Dataset Runs
- Create test cases for your application with real production traces
- Collaboratively create and collect dataset items with your team
- Have a single source of truth for your test data
Get Started
1) Creating a dataset
Datasets have a name which is unique within a project.ABV UI
- Navigate to
Your Project>Datasets - Click on
+ New datasetto create a new dataset.
Python SDK
Install packageJS/TS SDK
.env
2) Create new dataset items
Dataset items can be added to a dataset by providing the input and optionally the expected output.ABV UI
- Add item - Add item manually via UI
- Import CSV - Import CSV file
- Add from trace - Add from the trace view
Python SDK
JS/TS SDK
Create synthetic datasets
Frequently, you want to create synthetic examples to test your application to bootstrap your dataset. LLMs are great at generating these by prompting for common questions/tasks.Create items from production data
A common workflow is to select production traces where the application did not perform as expected. Then you let an expert add the expected output to test new versions of your application on the same data.ABV UI
In the UI, use+ Add to dataseton any observation (span, event, generation) of a production trace.
Python SDK
JS/TS SDK
Edit/archive dataset items
You can edit or archive dataset items. Archiving items will remove them from future experiment runs.ABV UI
In the UI, you can edit the item by clicking on the item id. To archive or delete the item, click on the dots next to the item and selectArchive or Delete.
Python SDK
You can upsert items by providing theid of the item you want to update.
JS/TS SDK
You can upsert items by providing theid of the item you want to update.