filter_missing
Filter rows based on missing values in one or more columns.
By default keeps only those rows where values in selected columns are not missing (non-NaNs). Using the exclude
parameter, the row selection can be inverted, such that only rows with missing values in selected rows
will be returned.
Usage
The following example shows how the step can be used in a recipe.
To keep only those rows where neither “address” nor “name” is missing
Inputs & Outputs
The following are the inputs expected by the step and the outputs it produces. These are generally
columns (ds.first_name
), datasets (ds
or ds[["first_name", "last_name"]]
) or models (referenced
by name e.g. "churn-clf"
).
Configuration
The following parameters can be used to configure the behaviour of the step by including them in
a json object as the last “input” to the step, i.e. step(..., {"param": "value", ...}) -> (output)
.
Was this page helpful?