This page describes the data model of datasets and dataset runs. For detailed reference please refer to the 

Datasets are a collection of inputs and, optionally, expected outputs that can be during Dataset runs.

Dataset runs are used to run a dataset through your LLM application and optionally apply evaluation methods to the results.

Most of the time, we recommend that DatasetRunItems reference TraceIDs directly. 

The reference to ObservationID exists for backwards compatibility with older SDK versions.

DataSetRuns can combine a few ABV objects:

 are created by looping through all or selected 

 passed into the LLM application as an Input a 

s to evaluate the output of the LLM application during the 

Learn about the ABV data model for Datasets and Dataset Runs. Understand how to use collections of inputs and expected outputs to evaluate and test your LLM applications effectively. See the full API reference for details.

Custom Scores

Dataset Runs Data Model

Trace

Datasets

ABV Developer Docs

Quickstart (Python SDK)

Quickstart (JS/TS SDK)

Observability & Tracing

Sessions

User Tracking

Environments

ABV Prompt Playground

Metadata

Trace IDs & Distributed Tracing

Log Levels

Comments on Objects

Masking of Sensitive LLM Data

Multi-Modality and Attachments

Event Queuing/Batching

Releases & Versioning

Sampling

Model Usage & Cost Tracking

Trace URLs

Query Data

API & Data Platform (cloned)

Metrics API

Metrics API (cloned with children)

Custom Dashboards

Basic Features

Prompt Management Overview

Get Started with Prompt Management

ABV Prompts Data Model

Prompt Version Control

Prompt Composability

Message Placeholders in Chat Prompts

A/B Testing of LLM Prompts

Caching of Prompts in Client SDKs

Prompt Config

Prompt Folders

Guaranteed Availability

Link Prompts to Traces

Prompt Management

Evaluation Overview

Scores Data Model

LLM-as-a-Judge

Human Annotation

Prompt Experiments

Remote Dataset Runs

Evaluations

Overview of ABV SDKs

Python SDK - Overview

Python SDK - Setup

Python SDK - Instrumentation

Python SDK - Evaluations

Python SDK - Advanced Usage

Python SDK - Troubleshooting

Python SDK

TypeScript SDK - Overview

TypeScript SDK - Setup

TypeScript SDK - Instrumentation

TypeScript SDK - Advanced Configuration

TypeScript SDK - Troubleshooting and FAQ

Cookbook: ABV JS/TS SDK

JS/TS SDK

SDKs

Metrics overview

Metrics

API & Data Platform overview

Export Data from UI

Export via Blob Storage Integration

Export for Fine-Tuning

Public API

Query Data via SDKs

API & Data Platform

Role-Based Access Controls in ABV

Audit Logs

Data Deletion

Data Retention

LLM Connections

SCIM & Organization-Key Scoped API Routes

Usage Alerts

Administration

Platform

API Reference

References

Python SDK (v3)

JS SDK

Security & Compliance Overview

More