Check out the
merge_similar_spellings
step for more information.
Parameters
- Column: the column to search and group terms in
- Strength threshold: a factor in the range to make the algorithm more or less sensitive. A value of 1 will merge all ocurrences, while a value closer to 0 will search for stronger correlation between the terms, thus being much more strict with the merging.