Given a distribution name with a scale and loc parameters, the step optionally applies another scaling to it either based on the standard deviation of the column or a proportionally to each point through the relative parameter in order to preserve the underlying structure of the data. Then the computation is carried as follows:

new value = original value + relative scaling factor * random sample from the distribution.

If this relative parameter is not given or is set to abs, then the relative scaling factor is 1.

relative
[number, string]
default: "abs"

Mode to use. Either set to “std” to use the standard deviation, or use a number to scale the sampling.

dist_name
string
default: "normal"

Distribution Function that noise is sampled from.

Values must be one of the following:

  • gumbel
  • laplace
  • logistic
  • normal
loc
number

Mean (“centre”) of the chosen distribution.

scale
number
default: "1.0"

Standard deviation (spread or “width”) of the distribution.

seed
[number, null]

The seed to use for the random distribution, if you wish to get reproducibility in your results.