Text Moderation (Google)

This evaluator leverages Google's advanced text analysis capabilities to assess content across various safety attributes such as toxicity, profanity, and threats. It's designed to identify potentially harmful or sensitive content in text documents.

Configuration

Option Description Type Default Required Constraints

Option	Description	Type	Default	Required	Constraints
Attributes	Specific attributes to check within the text. An empty list implies all are checked.	`[]string`	`[]`	`false`	`Enum: TOXICITY, DEROGATORY, VIOLENCE, SEX, INSULT, PROFANITY, THREAT, WEAPONS, PUBLIC_SAFETY, HEALTH, RELIGION, ILLICIT_DRUGS, WAR, FINANCIAL, POLITICAL, LEGAL`
Threshold	The confidence threshold to flag an attribute as concerning.	`float64`	`0.7`	`true`	`Min: 0` `Max: 1`

Attributes

Specific attributes to check within the text. An empty list implies all are checked.

[]string

[]

false

Enum: TOXICITY, DEROGATORY, VIOLENCE, SEX, INSULT, PROFANITY, THREAT, WEAPONS, PUBLIC_SAFETY, HEALTH, RELIGION, ILLICIT_DRUGS, WAR, FINANCIAL, POLITICAL, LEGAL

Threshold

The confidence threshold to flag an attribute as concerning.

float64

0.7

true

Min: 0 Max: 1

Metrics

This evaluator reports the following metrics to provide detailed insights into the moderation process:

Name	Key	Description	Max
Toxicity	toxicity	The probability that the text is toxic.	1
Derogatory	derogatory	The probability that the text is derogatory.	1
Violence	violence	The probability that the text contains violent content.	1
Sex	sex	The probability that the text contains sexual content.	1
Insult	insult	The probability that the text contains insulting content.	1
Profanity	profanity	The probability that the text contains profanity.	1
Threat	threat	The probability that the text contains threats of violence.	1
Weapons	weapons	The probability that the text mentions firearms and weapons.	1
Public Safety	public_safety	The probability that the text mentions public safety issues.	1
Health	health	The probability that the text mentions health issues.	1
Religion	religion	The probability that the text mentions religion and belief.	1
Illicit Drugs	illicit_drugs	The probability that the text mentions illicit drugs.	1
War	war	The probability that the text mentions war and conflict.	1
Financial	financial	The probability that the text mentions finance-related topics.	1
Political	political	The probability that the text mentions political content.	1
Legal	legal	The probability that the text discusses legal issues.	1

Additional Information

You can learn more about Google Cloud's text moderation at Google Cloud's text moderation documentation.

PreviousDLP PII Detector (Google)NextBoolean LLM-as-Judge (Modelmetry)

Last updated 6 days ago

Option	Description	Type	Default	Required	Constraints
Attributes	Specific attributes to check within the text. An empty list implies all are checked.	`[]string`	`[]`	`false`	`Enum: TOXICITY, DEROGATORY, VIOLENCE, SEX, INSULT, PROFANITY, THREAT, WEAPONS, PUBLIC_SAFETY, HEALTH, RELIGION, ILLICIT_DRUGS, WAR, FINANCIAL, POLITICAL, LEGAL`
Threshold	The confidence threshold to flag an attribute as concerning.	`float64`	`0.7`	`true`	`Min: 0` `Max: 1`

Option

Description

Type

Default

Required

Constraints

Attributes

Specific attributes to check within the text. An empty list implies all are checked.

[]string

[]

false

Enum: TOXICITY, DEROGATORY, VIOLENCE, SEX, INSULT, PROFANITY, THREAT, WEAPONS, PUBLIC_SAFETY, HEALTH, RELIGION, ILLICIT_DRUGS, WAR, FINANCIAL, POLITICAL, LEGAL

Threshold

The confidence threshold to flag an attribute as concerning.

float64

0.7

true

Min: 0 Max: 1

Name

Key

Description

Min

Max

Toxicity

toxicity

The probability that the text is toxic.

Derogatory

derogatory

The probability that the text is derogatory.

Violence

violence

The probability that the text contains violent content.

Sex

sex

The probability that the text contains sexual content.

Insult

insult

The probability that the text contains insulting content.

Profanity

profanity

The probability that the text contains profanity.

Threat

threat

The probability that the text contains threats of violence.

Weapons

weapons

The probability that the text mentions firearms and weapons.

Public Safety

public_safety

The probability that the text mentions public safety issues.

Health

health

The probability that the text mentions health issues.

Religion

religion

The probability that the text mentions religion and belief.

Illicit Drugs

illicit_drugs

The probability that the text mentions illicit drugs.

War

war

The probability that the text mentions war and conflict.

Financial

financial

The probability that the text mentions finance-related topics.

Political

political

The probability that the text mentions political content.

Legal

legal

The probability that the text discusses legal issues.