Usage
The following example shows how the step can be used in a recipe.Examples
Examples
The following configuration allows for smallish clusters and considers fewish data points as noise:
Inputs & Outputs
The following are the inputs expected by the step and the outputs it produces. These are generally columns (ds.first_name
), datasets (ds
or ds[["first_name", "last_name"]]
) or models (referenced
by name e.g. "churn-clf"
).
Inputs
Inputs
Outputs
Outputs
A column containing cluster tags.
Configuration
The following parameters can be used to configure the behaviour of the step by including them in a json object as the last “input” to the step, i.e.step(..., {"param": "value", ...}) -> (output)
.
Parameters
Parameters
Clustering algorithm to use.Values must be one of the following:
louvain
The higher this value the bigger the clusters.Values must be in the following range:
The larger the value, the more conservative the clustering.
Cluster with this number of nodes or less will be considered noise.Values must be in the following range:
The graphext advanced query syntax used to select rows.