Skip to content

Split string

text

Split a single column containing texts into two.

The values of a text column will be split in two at the first occurrence of a given pattern, returning two new text columns. For example, splitting a text column on the comma character (",") will produce two new columns: the first containing everything before the first comma encountered in each text, and the second containing all text encountered after the comma.

If the specified split pattern was not encountered in any of the input texts, the first output column will contain the original text, and the second column will contain missing values only (NaN).

Example

E.g. to split on the first comma encountered starting from the left of each text:

split_string(ds.text, {"pattern": ","}) -> (ds.text_left, ds.text_right)

Usage

The following are the step's expected inputs and outputs and their specific types.

split_string(input: text, {"param": value}) -> (output_left: text, output_right: text)

where the object {"param": value} is optional in most cases and if present may contain any of the parameters described in the corresponding section below.

Inputs


input: column:text

A text column to split.

Outputs


output_left: column:text

A text column containing the part to the left of the given split pattern.


output_right: column:text

A text column containing the part to the right of the given split pattern.

Parameters


pattern: string = " "

A pattern of characters indicating where to split each text. By default uses the whitespace " ".


right: boolean = False

Whether to search for the pattern starting from the right instead of starting from the left (default).