This function calculates the percentile rank for each non-null value in a single-valued numeric or date column. Percentile ranks are assigned based on the relative position of each value in the sorted column, with results ranging [0, 1). Null values are preserved and do not affect the ranking. The function does not support multi-valued columns.Documentation Index
Fetch the complete documentation index at: https://docs.graphext.com/llms.txt
Use this file to discover all available pages before exploring further.
Usage
The following example shows how the step can be used in a recipe.Examples
Examples
- Example 1
- Signature
The following example calculates the percentile rank for a column of numerical values, producing a new numerical column where each value is ranked between 0 and 1.
Inputs & Outputs
The following are the inputs expected by the step and the outputs it produces. These are generally columns (ds.first_name), datasets (ds or ds[["first_name", "last_name"]]) or models (referenced
by name e.g. "churn-clf").
Inputs
Inputs
A single-valued numeric or date column for percentile ranking.
Outputs
Outputs
A new numerical column containing the percentile ranks for each value.
Configuration
The following parameters can be used to configure the behaviour of the step by including them in a json object as the last “input” to the step, i.e.step(..., {"param": "value", ...}) -> (output).
Parameters
Parameters
This step expects not to receive any parameters.