Evaluations
Dataset Runs Data Model
7 min
this page describes the data model of datasets and dataset runs for detailed reference please refer to the api reference datasets datasets are a collection of inputs and, optionally, expected outputs that can be during dataset runs dataset s are a collection of datasetitem s dataset object attribute type required description name string yes name of the dataset description string no description of the dataset metadata object no additional metadata for the dataset datasetitem object attribute type required description datasetname string yes name of the dataset to add the item to input object no input data for the dataset item expectedoutput object no expected output data for the dataset item metadata object no additional metadata for the dataset item sourcetraceid string no id of the source trace to link this dataset item to sourceobservationid string no id of the source observation to link this dataset item to id string no unique identifier for the dataset item dataset items are upserted on their id id needs to be unique (project level) and cannot be reused across datasets status datasetstatus no status of the dataset item defaults to active for newly created items possible values active , archived datasetrun dataset runs are used to run a dataset through your llm application and optionally apply evaluation methods to the results datasetrun object attribute type required description datasetname string yes name of the dataset runname string yes name of the dataset run datasetrunitem object attribute type required description runname string yes name of the dataset run to add the item to rundescription string no description of the run if run exists, description will be updated metadata object no metadata of the dataset run, updates run if run already exists datasetitemid string yes id of the dataset item to link to this run observationid string no id of the observation to link to this run traceid string no id of the trace to link to this run traceid should always be provided for compatibility with older sdk versions it can also be inferred from the provided observationid most of the time, we recommend that datasetrunitems reference traceids directly the reference to observationid exists for backwards compatibility with older sdk versions end to end data relations datasetruns can combine a few abv objects datasetruns are created by looping through all or selected datasetitem s of a dataset with your llm application for each datasetitem passed into the llm application as an input a datasetrunitem & a trace are created optionally score s can be added to the trace s to evaluate the output of the llm application during the datasetrun