> ## Documentation Index
> Fetch the complete documentation index at: https://docs.graphext.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Transform

| Step                                                                                                   | Fast | Description                                                                                                             |
| ------------------------------------------------------------------------------------------------------ | ---- | ----------------------------------------------------------------------------------------------------------------------- |
| [add\_noise](/api-docs/prepare/transform/add_noise)                                                    |      | Add noise to a column with numbers or lists of numbers                                                                  |
| [calculate](/api-docs/prepare/transform/calculate)                                                     |      | Evaluates a formula containing basic arithmetic over a dataset's columns                                                |
| [cast](/api-docs/prepare/transform/cast)                                                               | ⚡    | Interprets and changes a column's data to another (semantic) type                                                       |
| [concatenate](/api-docs/prepare/transform/concatenate)                                                 | ⚡    | Concatenate columns as text or lists with optional separator as well as pre- and postfix                                |
| [count\_unique](/api-docs/prepare/transform/count_unique)                                              | ⚡    | Counts the number of unique elements in each list/array of the input column                                             |
| [derive\_column](/api-docs/prepare/transform/derive_column)                                            | ⚡    | Derive a new column with a custom JS script                                                                             |
| [discretize\_on\_quantiles](/api-docs/prepare/transform/discretize_on_quantiles)                       | ⚡    | Discretize column into bins based on quantiles                                                                          |
| [discretize\_on\_values](/api-docs/prepare/transform/discretize_on_values)                             | ⚡    | Discretize column by binning its values using explicitly specified cuts points                                          |
| [divide](/api-docs/prepare/transform/divide)                                                           | ⚡    | Divide two or more numeric columns in given order                                                                       |
| [equal](/api-docs/prepare/transform/equal)                                                             | ⚡    | Check the row-wise equality of all input columns                                                                        |
| [explode](/api-docs/prepare/transform/explode)                                                         | ⚡    | Explode (extract) items from column(s) of lists into separate rows                                                      |
| [extract\_date\_component](/api-docs/prepare/transform/extract_date_component)                         | ⚡    | Extract a component such as day, week, weekday etc. from a date column                                                  |
| [extract\_emoji](/api-docs/prepare/transform/extract_emoji)                                            |      | Parse texts and extract their emoji                                                                                     |
| [extract\_entities](/api-docs/prepare/transform/extract_entities)                                      |      | Parse texts and extract the entities mentioned (persons, organizations etc.)                                            |
| [extract\_hashtags](/api-docs/prepare/transform/extract_hashtags)                                      |      | Parse texts and extract any hashtags mentioned                                                                          |
| [extract\_json\_values](/api-docs/prepare/transform/extract_json_values)                               | ⚡    | Extract values from JSON columns using JsonPath                                                                         |
| [extract\_keywords](/api-docs/prepare/transform/extract_keywords)                                      |      | Parse and extract keywords from texts                                                                                   |
| [extract\_mentions](/api-docs/prepare/transform/extract_mentions)                                      |      | Parse texts and extract any mentions detected                                                                           |
| [extract\_ngrams](/api-docs/prepare/transform/extract_ngrams)                                          |      | Parse texts and extract their n-grams                                                                                   |
| [extract\_range](/api-docs/prepare/transform/extract_range)                                            | ⚡    | Create a copy of a column nullifying values outside a specified range                                                   |
| [extract\_regex](/api-docs/prepare/transform/extract_regex)                                            | ⚡    | Extract parts of texts detected using regular expressions                                                               |
| [extract\_text\_features](/api-docs/prepare/transform/extract_text_features)                           |      | Parse and process texts to extract multiple features at once                                                            |
| [extract\_url\_components](/api-docs/prepare/transform/extract_url_components)                         |      | Extract components from an URL                                                                                          |
| [is\_missing](/api-docs/prepare/transform/is_missing)                                                  | ⚡    | Check for missing values in a given column                                                                              |
| [label\_bios](/api-docs/prepare/transform/label_bios)                                                  |      | Categorize people into fields of occupation using their bios (biographies)                                              |
| [label\_categories](/api-docs/prepare/transform/label_categories)                                      | ⚡    | Relabel categories based on the top terms in each category                                                              |
| [label\_encode](/api-docs/prepare/transform/label_encode)                                              |      | Encode categories with values between 0 and N-1, where N is the number of unique categories                             |
| [label\_holidays](/api-docs/prepare/transform/label_holidays)                                          |      | Indicate if there are any holidays for given date, location pairs                                                       |
| [label\_political\_subtopics](/api-docs/prepare/transform/label_political_subtopics)                   |      | Categorize the political sub-topics of texts in Spanish                                                                 |
| [label\_political\_topics](/api-docs/prepare/transform/label_political_topics)                         |      | Categorize the political topics of texts in Spanish                                                                     |
| [label\_texts\_containing](/api-docs/prepare/transform/label_texts_containing)                         |      | Categorize texts containing specific keywords with custom labels                                                        |
| [label\_texts\_containing\_from\_query](/api-docs/prepare/transform/label_texts_containing_from_query) |      | Label texts given an elastic-like query string                                                                          |
| [length](/api-docs/prepare/transform/length)                                                           | ⚡    | Calculates the length of lists (number of elements) or texts/categories (number of characters)                          |
| [make\_constant](/api-docs/prepare/transform/make_constant)                                            | ⚡    | Creates a new constant column (with a single unique value) of the same length as the input column                       |
| [math\_func](/api-docs/prepare/transform/math_func)                                                    |      | Applies a mathematical function to the values of a (single) numeric column                                              |
| [merge\_similar\_semantics](/api-docs/prepare/transform/merge_similar_semantics)                       |      | Group categories with similar meanings                                                                                  |
| [merge\_similar\_spellings](/api-docs/prepare/transform/merge_similar_spellings)                       |      | Group categories with similar spellings                                                                                 |
| [multiply](/api-docs/prepare/transform/multiply)                                                       | ⚡    | Multiply two or more numeric columns                                                                                    |
| [normalize](/api-docs/prepare/transform/normalize)                                                     | ⚡    | Normalizes a numerical column by subtracting the mean and dividing by its standard deviation                            |
| [observed\_duration](/api-docs/prepare/transform/observed_duration)                                    | ⚡    | Calculate the duration between two dates and determine whether an event was observed before a specified observation da… |
| [order\_categories](/api-docs/prepare/transform/order_categories)                                      | ⚡    | (Re-)order the categories of a categorical column                                                                       |
| [pandas\_func](/api-docs/prepare/transform/pandas_func)                                                |      | Applies an arbitrary pandas supported function to the values of an input column                                         |
| [pct\_change](/api-docs/prepare/transform/pct_change)                                                  |      | Calculate percentage change between consecutive numbers in a numeric column                                             |
| [percentile\_rank](/api-docs/prepare/transform/percentile_rank)                                        | ⚡    | Convert the values in a numeric or date column into their percentile rank                                               |
| [query](/api-docs/prepare/transform/query)                                                             | ⚡    | Generate a boolean column based on a query string, marking rows that match the condition                                |
| [replace\_missing](/api-docs/prepare/transform/replace_missing)                                        | ⚡    | Replace missing values (NaNs) with either a specified constant value or the result of a given function                  |
| [replace\_regex](/api-docs/prepare/transform/replace_regex)                                            | ⚡    | Replace parts of text detected with a regular expression                                                                |
| [replace\_values](/api-docs/prepare/transform/replace_values)                                          | ⚡    | Replace specified values in a column with new ones                                                                      |
| [scale](/api-docs/prepare/transform/scale)                                                             | ⚡    | Scales the values of a numerical column to lie between a specified minimum and maximum                                  |
| [segment\_rows](/api-docs/prepare/transform/segment_rows)                                              | ⚡    | Create a segmentation using graphext's advanced query syntax (similar to Elasticsearch)                                 |
| [slice](/api-docs/prepare/transform/slice)                                                             | ⚡    | Extract a range/slice of elements from a column of texts or lists                                                       |
| [split\_string](/api-docs/prepare/transform/split_string)                                              | ⚡    | Split a single column containing texts into two                                                                         |
| [subtract](/api-docs/prepare/transform/subtract)                                                       | ⚡    | Subtract two or more numeric columns                                                                                    |
| [sum](/api-docs/prepare/transform/sum)                                                                 | ⚡    | Calculate the row-wise sum of numeric columns                                                                           |
| [time\_interval](/api-docs/prepare/transform/time_interval)                                            | ⚡    | Calculates the duration of a time interval between two dates (datetimes/timestamps)                                     |
| [tokenize](/api-docs/prepare/transform/tokenize)                                                       |      | Parse texts and separate them into lists of tokens (words, lemmas, etc.)                                                |
| [trim\_frequencies](/api-docs/prepare/transform/trim_frequencies)                                      |      | Remove values whose frequencies (counts) are above/below a given threshold                                              |
| [unique](/api-docs/prepare/transform/unique)                                                           | ⚡    | Extracts the unique elements in each list/array                                                                         |
| [unpack\_list](/api-docs/prepare/transform/unpack_list)                                                |      | Unpack (extract) items from a column of lists into separate columns                                                     |
