Generate a boolean column based on a query string, marking rows that match the condition.
The following example shows how the step can be used in a recipe.
Examples
This example creates a new boolean column that flags rows where the ‘cats’ column contains the value ‘red’:
This example creates a new boolean column that flags rows where the ‘cats’ column contains the value ‘red’:
General syntax for using the step in a recipe. Shows the inputs and outputs the step is expected to receive and will produce respectively. For futher details see sections below.
The following are the inputs expected by the step and the outputs it produces. These are generally
columns (ds.first_name
), datasets (ds
or ds[["first_name", "last_name"]]
) or models (referenced
by name e.g. "churn-clf"
).
Inputs
The input dataset to evaluate against the query.
Outputs
A boolean column indicating whether each row in the dataset matches the query.
The following parameters can be used to configure the behaviour of the step by including them in
a json object as the last “input” to the step, i.e. step(..., {"param": "value", ...}) -> (output)
.
Parameters
The graphext advanced query used to identify rows to flag.
Generate a boolean column based on a query string, marking rows that match the condition.
The following example shows how the step can be used in a recipe.
Examples
This example creates a new boolean column that flags rows where the ‘cats’ column contains the value ‘red’:
This example creates a new boolean column that flags rows where the ‘cats’ column contains the value ‘red’:
General syntax for using the step in a recipe. Shows the inputs and outputs the step is expected to receive and will produce respectively. For futher details see sections below.
The following are the inputs expected by the step and the outputs it produces. These are generally
columns (ds.first_name
), datasets (ds
or ds[["first_name", "last_name"]]
) or models (referenced
by name e.g. "churn-clf"
).
Inputs
The input dataset to evaluate against the query.
Outputs
A boolean column indicating whether each row in the dataset matches the query.
The following parameters can be used to configure the behaviour of the step by including them in
a json object as the last “input” to the step, i.e. step(..., {"param": "value", ...}) -> (output)
.
Parameters
The graphext advanced query used to identify rows to flag.