trim_frequencies
Remove values whose frequencies (counts) are above/below a given threshold.
trim_frequencies(input: category|list[category], {
"param": value,
...
}) -> (output: column)
Affected categories are replaced with the missing value (NaN).
trim_frequencies(input: category|list[category], {
"param": value,
...
}) -> (output: column)
A categorical column to trim.
A categorical column with fewer categories than the input.
The number N indicating how many of the most common values to filter (in descending order).
Values must be in the following range:
0 ≤ n_most_common < inf
Values with a lower frequency (count) than this will be removed.
Values must be in the following range:
1 ≤ freq_min < inf
Values with a higher frequency (count) than this will be removed.
Values must be in the following range:
1 ≤ freq_max < inf
Was this page helpful?
trim_frequencies(input: category|list[category], {
"param": value,
...
}) -> (output: column)