Skip to main content
Sets a label, description, and/or source URL for the dataset itself. The label is the display name of the dataset in the UI. The description provides context about the data. The info_source is a URL linking to the original data source. This is a UI configuration step that affects how the project is displayed in Graphext. It applies to the dataset referenced in its inputs. If your recipe produces multiple datasets (e.g. a filtered dataset that is then passed to create_project alongside the original), you need to add separate configure steps for each dataset you want to configure.

Usage

The following example shows how the step can be used in a recipe.

Examples

configure_dataset_metadata(ds, { "label": "Dataset label", "description": "Useful dataset description", "info_source": "https://www.example.com" })

Inputs & Outputs

The following are the inputs expected by the step and the outputs it produces. These are generally columns (ds.first_name), datasets (ds or ds[["first_name", "last_name"]]) or models (referenced by name e.g. "churn-clf").
dataset
dataset
required
Dataset to be configured.

Configuration

The following parameters can be used to configure the behaviour of the step by including them in a json object as the last “input” to the step, i.e. step(..., {"param": "value", ...}) -> (output).

Parameters

info_source
string
Source URL.Values must match the following regex pattern:
^https?://
description
string
Description of the dataset.
label
string
Label of the dataset.