Skip to content

Split string

fast step  text

Split a single column containing texts into two.

The values of a text column will be split in two at the first occurrence of a given pattern, returning two new text columns. For example, splitting a text column on the comma character (",") will produce two new columns: the first containing everything before the first comma encountered in each text, and the second containing all text encountered after the comma.

If the specified split pattern was not encountered in any of the input texts, the first output column will contain the original text, and the second column will contain missing values only (NaN).


The following are the step's expected inputs and outputs and their specific types.

Step signature
split_string(input: text|category, {
    "param": value
}) -> (output_left: text, output_right: text)

where the object {"param": value} is optional in most cases and if present may contain any of the parameters described in the corresponding section below.


E.g. to split on the first comma encountered starting from the left of each text:

Example call (in recipe editor)
split_string(ds.text, {"pattern": ","}) -> (ds.text_left, ds.text_right)


input: column:text|category

A text column to split.


output_left: column:text

A text column containing the part to the left of the given split pattern.

output_right: column:text

A text column containing the part to the right of the given split pattern.


pattern: string = " "

A pattern of characters indicating where to split each text. By default uses the whitespace " ".

right: boolean = False

Whether to search for the pattern starting from the right instead of starting from the left (default).