Severe Harm

B.L.O.B.I. (Big List of Bad Ideas)

Idea Title: B.L.O.B.I. Big List of Bad Ideas Idea Description: An AI model designed to detect and track emerging hate groups by analyzing and learning from existing data sets of hate speech. In collaboration with trusted partners, we aim to … Read More

AI Safety Network Initiative

Idea Title: AI Safety Network Initiative Idea Description: AI-powered connection model for sharing cross-platform abuse Any industry could spin up their own central clearing house for sharing hashed content identifying abusive users E.g., make it easy for Bumble to tell … Read More

UnSafeSets

Idea Title: UnSafeSets Idea Description: Real-world datasets to train models on harmful content are silo-ed and insufficient to support emerging platform needs. We propose UnSafeSets: generating synthetic datasets to train models on harmful, high sensitivity + low prevalence content. Contributors: … Read More