Functions

abs

Returns the absolute value of the input.

Syntax

abs(expr)

add

Adds the two inputs.

Syntax

add(expr1, expr2)

and

Logical and operation of two or more boolean columns.

Syntax

and(expr1, expr2)

automated_readability_index

Returns the ARI (Automated Readability Index) which outputs a number that approximates the grade level needed to comprehend the text. For example if the ARI is 6.5, then the grade level to comprehend the text is 6th to 7th grade.

Syntax

bleu

Computes the BLEU score between two columns.

Syntax

character_count

Returns the number of characters in a text column.

Syntax

Aliases

  • num_chars

divide

Divides the two inputs.

Syntax

equal_to

Computes the element-wise equal to comparison of two columns.

Syntax

Aliases

  • eq

filter

Filters a column using another column as a mask.

Syntax

flesch_kincaid_grade

Returns the Flesch-Kincaid Grade of the given text. This is a grade formula in that a score of 9.3 means that a ninth grader would be able to read the document.

Syntax

greater_than

Computes the element-wise greater than comparison of two columns. input1 > input2

Syntax

Aliases

  • gt

greater_than_or_equal_to

Computes the element-wise greater than or equal to comparison of two columns. input1 >= input2

Syntax

Aliases

  • gte

is_valid_json

Returns true if the input string is valid json.

Syntax

less_than

Computes the element-wise less than comparison of two columns. input1 < input2

Syntax

Aliases

  • lt

less_than_or_equal_to

Computes the element-wise less than or equal to comparison of two columns. input1 <= input2

Syntax

Aliases

  • lte

levenshtein

Returns Damerau-Levenshtein distance between two strings.

Syntax

list_has_duplicate

Returns True if the list has duplicated items.

Syntax

list_len

Returns the length of lists in a list column.

Syntax

list_most_common

Most common item in list.

Syntax

multiply

Multiplies the two inputs.

Syntax

negate

Returns the negation of the input.

Syntax

not

Logical not operation of a boolean column.

Syntax

not_equal_to

Computes the element-wise not equal to comparison of two columns.

Syntax

Aliases

  • neq

or

Logical or operation of two or more boolean columns.

Syntax

rouge1

Returns the rouge1 score between two columns.

Syntax

rouge2

Returns the rouge2 score between two columns.

Syntax

rougeL

Returns the rougeL score between two columns.

Syntax

rougeLsum

Returns the rougeLsum score between two columns.

Syntax

sentence_count

Returns the number of sentences in a text column.

Syntax

Aliases

  • num_sentences

subtract

Subtracts the two inputs.

Syntax

token_count

Returns the number of tokens in a text column.

Syntax

word_count

Returns the number of words in a text column.

Syntax

Aliases

  • num_words

Was this helpful?