Check out the
merge_similar_semantics
step for more information.
Parameters
- Column: the column to search and group terms in
- Determine Language: specify the language of your terms. You can either set it manually, or select a column that holds the value for each row’s language.
- Strength Threshold: a factor in the range to make the algorithm more or less sensitive. A value of 1 will merge all ocurrences, while a value closer to 0 will search for stronger correlation between the terms, thus being much more strict with the merging.