I.e., vertically concatenates two datasets, appending the rows of the second to the end of the first.

When the two datasets contain different columns, the join parameter controls whether only the common columns are kept (inner), or all columns (outer). In the latter case, rows will have missing values (NaNs), where a column only existed in one of the two datasets.

Usage

The following example shows how the step can be used in a recipe.

Inputs & Outputs

The following are the inputs expected by the step and the outputs it produces. These are generally columns (ds.first_name), datasets (ds or ds[["first_name", "last_name"]]) or models (referenced by name e.g. "churn-clf").

Configuration

The following parameters can be used to configure the behaviour of the step by including them in a json object as the last “input” to the step, i.e. step(..., {"param": "value", ...}) -> (output).