Includes or excludes rows of the input datset based on the values of a selected text or list column. Depending on the configuration, if the column contains any or all of the specified values, the corresponding rows will be kept or dropped in the output dataset.

“Containment” here means texts in a text column containing one or more specified substrings (words), or lists in a list column containing one or more elements matching the specified values. See below for illustrative examples.

column
string
required

Name of column to be matched against the specified values.

values
[number, string, array[['number', 'string']]]
required

Values to be matched in each row to decide its inclusion or exclusion. May be a single value or a list of values to be matched.

exclude
boolean

If true, matching rows will be excluded from the output dataset. I.e., only rows not containing the specified values will be returned.

contains_all
boolean

Rows must contain all specified value to pass filter, rather than any.

case_sensitive
boolean
default: "true"

Text values must match case to pass filter.