clean_categories
Clean a given column of categories or lists of categories using OpenAI.
Usage
The following example shows how the step can be used in a recipe.
Bring the number of categories down to 10 categories
Inputs & Outputs
The following are the inputs expected by the step and the outputs it produces. These are generally
columns (ds.first_name
), datasets (ds
or ds[["first_name", "last_name"]]
) or models (referenced
by name e.g. "churn-clf"
).
Configuration
The following parameters can be used to configure the behaviour of the step by including them in
a json object as the last “input” to the step, i.e. step(..., {"param": "value", ...}) -> (output)
.
Associated integration.
Categories desired in the result column. If passed instructions are ignored.
Further instructions to generate the desired set of categories. You can ask for things like ‘generate around 5 categories’
Approximate number of categories to generate. Can be a float from 0 to 1 (percentage of unique categories) or an integer greater than 1.
Model Configuration. Configuration for OpenAI’s model.
Was this page helpful?