filter_values
Filter rows where column matches specified values exactly.
Usage
The following examples show how the step can be used in a recipe.
To create a new dataset keeping only those rows where values in the “salary” column are either “low” or “high”.
To create a new dataset keeping only those rows where values in the “salary” column are either “low” or “high”.
Or, using the exclude
parameter to drop rows where “salary” values are either “low” or “high”:
General syntax for using the step in a recipe. Shows the inputs and outputs the step is expected to receive and will produce respectively. For futher details see sections below.
Inputs & Outputs
The following are the inputs expected by the step and the outputs it produces. These are generally
columns (ds.first_name
), datasets (ds
or ds[["first_name", "last_name"]]
) or models (referenced
by name e.g. "churn-clf"
).
Configuration
The following parameters can be used to configure the behaviour of the step by including them in
a json object as the last “input” to the step, i.e. step(..., {"param": "value", ...}) -> (output)
.
Name of column to be matched against the specified values
.
Only rows matching these values exactly will be included in the resulting dataset. May be a single value or a list of values to be matched.
if true
, only rows not matching the specified values
will be included in the resulting dataset.