extract_text_features

Essentially combines all of the following steps into one:

embed_text
extract_emoji
extract_entities
extract_hashtags
extract_keywords
extract_mentions
infer_sentiment
tokenize

Note that the step does not currently allow for detailed configuration of each of the extracted features. To do that, use any or all of the individual steps above.

Usage

The following shows how the step can be used in a recipe.

Examples

General syntax for using the step in a recipe. Shows the inputs and outputs the step is expected to receive and will produce respectively. For futher details see sections below.

extract_text_features(text: text, *lang: category, {
    "param": value,
    ...
}) -> (
	Sentiment: number,
	Embedding: list[number],
	Hashtags: list[category],
	Mentions: list[category],
	Keywords: list[category],
	Tokens: list[category],
	Emoji: list[category],
	People: list[category],
	Groups: list[category],
	Organizatons: list[category],
	GPEs: list[category],
	Locations: list[category],
	Products: list[category],
	Events: list[category],
	Money: list[category]
)

Inputs & Outputs

The following are the inputs expected by the step and the outputs it produces. These are generally columns (ds.first_name), datasets (ds or ds[["first_name", "last_name"]]) or models (referenced by name e.g. "churn-clf").

Inputs

Outputs

Configuration

The following parameters can be used to configure the behaviour of the step by including them in a json object as the last “input” to the step, i.e. step(..., {"param": "value", ...}) -> (output).

Parameters

Prepare

Report

Analyse

Usage

Inputs & Outputs

Configuration

Prepare

Report

Analyse

​Usage

​Inputs & Outputs

​Configuration

Usage

Inputs & Outputs

Configuration