Join two datasets on their row indexes or on values in specified columns.
ds_right
) to the first (ds_left
). If the two datasets contain columns
with identical names (other than those used to perform the join), configurable suffixes will be appended to their names
in the resulting dataset (see suffixes
parameter below).
The rows included in the result depend on the kind of join (see the how
parameter below). Depending on whether
it’s a left, right, inner, or outer-join, may include rows from either dataset or both.
The join performed is always an equi-join, meaning that rows
from the left are matched with rows from the right where their respective values in the join column (or indexes)
are identical (e.g. where the value of column id
on the left is equal to the value of column id
on the right).
Also see Wikipedia’s article on table joins to learn more about them.
Examples
ds.first_name
), datasets (ds
or ds[["first_name", "last_name"]]
) or models (referenced
by name e.g. "churn-clf"
).
Inputs
Outputs
step(..., {"param": "value", ...}) -> (output)
.
Parameters
left
and add whatever information you can from
dataset right
”. If this is the case, you’ll want a left
-join.Values must be one of the following:left
right
outer
inner
"_index_"
, null
, or simply omit this parameter.Options
"_index_"
, null
, or simply omit this parameter.Options