Facebook hopes its new AI moderation tools can further counter hate speech

The company's human moderators remain unconvinced.

·Former Senior Editor

Updated November 19, 2020 at 1:01 p.m.·6 min read

Facebook has waged a long-fought and sometimes seemingly losing battle against hate speech and misinformation spreading across its platform. On Thursday, the company rolled out the latest implements of its automated anti-trolling arsenal in an effort to further curb bigots and bad actors on the site.

The company’s CTO, Mike Schroepfer, noted that Facebook has taken a number proactive steps in the last year to combat hate speech and those efforts have already begun to show results. In the first quarter of 2020, the company took action against 9.6 million pieces of content, almost double the 5.7 million in the quarter prior. “Q3 of last year to Q3 of this year, on Facebook, we've actually done over three times as much content takedowns via our automated systems, detecting hate speech,” Schroepfer told an assembly of reporters via Zoom on Wednesday. “There's not a lot in life that improves three x over a year. So I think that's, that's pretty good.”

Instagram also saw a large influx of automated takedowns in the last quarter, effectively doubling the rate of the same period before it. “[We] are now at a similar practice rate on Instagram, as we are on Facebook,” Schroepfer continued. “So we're seeing about a 95 percent proactive rate on both of those platforms.“

Of course, the baselines for those figures are continually in flux. “COVID misinformation didn't exist in Q4 of 2019, for example,” he said. “And there can be quite a change in a conversation during an election. So what I'd say is you always have to look at all these metrics together, in order to get the biggest picture.”

In addition to Facebook’s existing array of tools including semi-supervised self-learning models and XLM-R, the company unveiled and implemented a pair of new technologies. The first, Schroepfer said, is Linformer, “which is basically an optimization of how these large language models work that allow us to deploy them sort of at the massive scale, we need to address all the content we have on Facebook.”

Linformer is a first-of-its-kind Transformer architecture. Transformers are the model of choice for a number of natural language processing (NLP) applications but unlike the recurrent neural networks that came before them, Transformers can process data in parallel which makes training models faster. But the parallel processing is resource hungry, requiring exponentially more memory and processing cycles to function as the input length increases. Linformer is different. Its resource needs and input length operate under a linear relationship, allowing it to process more inputs using fewer resources than conventional Transformers.

The other new tech is called RIO. “Instead of the traditional model for all of the things I talked about over the last five years,” Schroepfer said. “Take a classifier, build it, train it tested offline, maybe test it with some online data and then deploy it into production, we have a system that can end-to-end learn.

Specifically, RIO is an end-to-end optimized reinforcement learning (RL) framework that generates classifiers -- the tests that trigger an enforcement action against a specific piece of content based on the class associated with its datapoint (think, the process that determines whether or not an email is spam) -- using online data.

“What we typically try to do is set up our classifiers to work at a very high threshold, which means sort of when in doubt, it doesn't take an action,” Schroepfer said. “So we only take an action when the classifier is highly confident, or we're highly confident based on empirical testing, that that classifier is going to be right.”

Those thresholds regularly change depending on the sort of content that is being examined. For example, the threshold for hate speech on a post is quite high because the company prefers not to mistakenly take down non-offending posts. The threshold for spammy ads, on the other hand, is quite low.

In Schroepfer’s hate speech example, the metrics RIO is pulling are regarding prevalence rates. “It's actually using some of the prevalence metrics and others that we released as its sort of score and it's trying to take those numbers down,” Schroepfer explained. “It is really optimizing from the end objective all the way backwards, which is a pretty exciting thing.”

“If I take down 1000 pieces of content that no one was going to see anyway, it doesn't really matter, Schroepfer stated. “If I catch the one piece of content that it was about to go viral before it does that, that can have a massive, massive impact. So I think that prevalence is our end goal in terms of the impact that has on users, in terms of how we're making progress on these things.”

One immediate application will be for automatically identifying the subtly-changed clones -- whether that’s the addition of text or a border, or a slight overall blurring or crop -- of already-known violating images. ”The challenge here is we have very, very, very high thresholds, because we don't want to accidentally take anything down, you know, adding a single “not” or “no” or “this is wrong” on this post completely changes the meaning of it,” he continued.

Memes continue to be one of the company’s most vexing hate speech and misinformation vectors, due in part to their multi-modality nature. Doing so requires a great deal of subtle understanding, according to Schroepher. “You have to understand the text, the image, you may be referring to current events and so you have to encode some of that knowledge. I think from a technology standpoint, it's one of the most challenging areas of hate speech”

But as RIO continues to generate increasingly accurate classifiers, it will grant Facebook’s moderation teams far more leeway and opportunity to enforce the community guidelines. The advances should also help moderators more easily root out hate groups lurking on the platform. “One of the ways you'd want to identify these groups is if a bunch of the content in it is tripping our violence or hate speech classifiers,” Schropfer said. “The content classifiers are immensely useful, because they can be input signals into these things.”

Facebook has spent the past half decade developing its automated detection and moderation systems, yet its struggles with moderation continue. Earlier this year, the company settled a case brought by 11,000 traumatized moderators for $52 million. And earlier this week, moderators issued an open letter to Facebook management arguing that the company’s policies were putting their “lives in danger” and that the AI systems designed to alleviate the psychological damage of their jobs is still years away.

“My goal is to continue to push this technology forward,” Schroepfer concluded, “so that hopefully, at some point, zero people in the world who have to encounter any of this content that violates our community standards.”

HuffPost
OOPS! Eric Trump Freaks Out Over Dad’s Trial But Gets 1 Very Awkward Thing Wrong
A rant on Fox News from Donald Trump's son contained one glaring error.
3 hours ago
People
Sydney Sweeney Jokes That She’s Sorry for Having ‘Great’ Boobs During Mexico Vacation
The 'Euphoria' actress poked fun at her appearance in an Instagram post while enjoying a getaway with friends
22 hours ago
Miami Herald
Ryan Reynolds shares emotional tribute to Michael J. Fox
Ryan Reynolds wrote a tribute to Michael J. Fox that will bring a tear to your eye.
17 hours ago
People
Rebel Wilson Says British Royal Invited Her to 2014 Party with Drugs and Orgies
“Needless to say, I hike up my damsel dress and run out of there as fast as I can,” the 'Rebel Rising' author said of the medieval-themed party
7 hours ago
BuzzFeed
Over 6.5 Million People Watched This Dismissed Juror Share Their Hilarious First Reaction To Seeing Trump In The Courtroom: "He Looked Less Orange"
Imagine it's your first time ever being called for jury duty...and it's Donald Trump's criminal trial.
14 hours ago
Cosmo
Olivia Rodrigo's itsy-bitsy string bikini is huuuuge summer inspo
Olivia Rodrigo has shared snaps of herself sunbathing, wearing a two-piece, string bikini in a gingham pattern. Could this be the next swimwear trend?
2 days ago
The New York Times
Trump’s Trial Challenge: Being Stripped of Control
NEW YORK — “Sir, can you please have a seat.” Donald Trump had stood up to leave the Manhattan criminal courtroom as Justice Juan M. Merchan was wrapping up a scheduling discussion Tuesday. But the judge had not yet adjourned the court or left the bench. Trump, the 45th president of the United States and the owner of his own company, is used to setting his own pace. Still, when Merchan admonished him to sit back down, the former president did so without saying a word. Sign up for The Morning new
2 days ago
HuffPost
Mary Trump 'Can't Help Laughing' At This 'Schadenfreude' In Uncle's Trial
Donald Trump's niece suggested what he's "probably been dreading" for decades.
a day ago
Simply Recipes
Nationwide Alert Has Been Issued Over Ground Beef Contaminated With E. Coli
Check your fridge or freezer.
a day ago
HuffPost UK
Lady Gaga Reacts As Pre-Fame Festival Performance From 17 Years Ago Goes Viral
The 13-time Grammy winner has a surprisingly good memory.
19 hours ago
HuffPost
Ex-Prosecutor Spots A Big 'Oops' In Donald Trump's Likely Legal Defense
Andrew Weissmann also predicted a huge "tug of war" over one particular witness in the former president's hush money trial.
a day ago
Hello!
Maya Jama just wore an itsy bitsy sustainable bikini
The Love Island host showed off her curves while holidaying in California. See photos
22 hours ago
HuffPost
There's A Lot Of Fake Hair In Hollywood, So Here Are 12 Male Celebs Who Have Been Candid About Using Wigs And Surgery To Get Their Locks
"People were absolutely obsessed with my hair, or lack of it, for years. Then I started wearing a wig and virtually no one’s mentioned it since.”
2 days ago
Reuters
Trump accepts new restrictions on $175 million bond in New York civil fraud case
NEW YORK (Reuters) -Former President Donald Trump agreed on Monday to additional restrictions on the $175 million bond in the former U.S. president's New York civil fraud case, resolving concerns by the state attorney general that the funds were not secure. The bond issued by Knight Specialty Insurance is meant to secure Trump's compliance with a $454.2 million judgment won by state Attorney General Letitia James if he does not succeed in an appeal. Justice Arthur Engoron imposed the penalty after finding that Trump, the Republican presidential candidate to face President Joe Biden in the Nov. 5 election, fraudulently inflated his net worth and real estate assets to deceive banks and insurers into providing better terms.
a day ago
People
TikTok Star Eva Evans Dead at 29, Sister Reveals: 'Still Find Myself in a Constant Cycle of Denial'
The TikToker's sister Lila announced her death on April 21 in a post on social media
a day ago
Snopes
This Photo Allegedly Shows What a Beach in Palestine Looked Like Before Israel Was Founded. We Delved into Its History
The photograph displays a beachfront built as prime example of Bauhaus architecture.
16 hours ago
Hello!
Princess Beatrice's stepson's unusual sleeping arrangement at mother Dara's London rental home
Princess Beatrice's stepson Wolfie was pictured with his mother Dara Huang in a candid photo inside a bedroom of their Kensington home. See Edoardo Mapelli Mozzi's ex's confession about their sleeping arrangements…
19 hours ago
People
As Amber Heard Marks Her 38th Birthday, Get to Know Her Quiet Life in Madrid Two Years After Johnny Depp Trial
The actress moved to Spain for a quieter life with daughter Oonagh, who turned 3 earlier in April
19 hours ago
INSIDER
Trump cedes control of the cash collateral for his $175M civil-fraud bond under new agreement with NY officials
The GOP frontrunner and his bond underwriters agreed Monday to keep the cash in what's essentially a Trump-proof lockbox while he appeals.
15 hours ago
HuffPost
'NYET': New York Post Trolls 'Moscow' Marjorie Taylor Greene In Russian
The far-right Republican received a Russian makeover from the conservative tabloid.
a day ago

S&P/TSX

S&P 500

DOW

CAD/USD

CRUDE OIL

Bitcoin CAD

CMC Crypto 200

GOLD FUTURES

RUSSELL 2000

10-Yr Bond

NASDAQ futures

VOLATILITY

FTSE

NIKKEI 225

CAD/EUR

Facebook hopes its new AI moderation tools can further counter hate speech

The company's human moderators remain unconvinced.

Latest Stories

OOPS! Eric Trump Freaks Out Over Dad’s Trial But Gets 1 Very Awkward Thing Wrong

Sydney Sweeney Jokes That She’s Sorry for Having ‘Great’ Boobs During Mexico Vacation

Ryan Reynolds shares emotional tribute to Michael J. Fox

Rebel Wilson Says British Royal Invited Her to 2014 Party with Drugs and Orgies

Over 6.5 Million People Watched This Dismissed Juror Share Their Hilarious First Reaction To Seeing Trump In The Courtroom: "He Looked Less Orange"

Olivia Rodrigo's itsy-bitsy string bikini is huuuuge summer inspo

Trump’s Trial Challenge: Being Stripped of Control

Mary Trump 'Can't Help Laughing' At This 'Schadenfreude' In Uncle's Trial

Nationwide Alert Has Been Issued Over Ground Beef Contaminated With E. Coli

Lady Gaga Reacts As Pre-Fame Festival Performance From 17 Years Ago Goes Viral

Ex-Prosecutor Spots A Big 'Oops' In Donald Trump's Likely Legal Defense

Maya Jama just wore an itsy bitsy sustainable bikini

There's A Lot Of Fake Hair In Hollywood, So Here Are 12 Male Celebs Who Have Been Candid About Using Wigs And Surgery To Get Their Locks

Trump accepts new restrictions on $175 million bond in New York civil fraud case

TikTok Star Eva Evans Dead at 29, Sister Reveals: 'Still Find Myself in a Constant Cycle of Denial'

This Photo Allegedly Shows What a Beach in Palestine Looked Like Before Israel Was Founded. We Delved into Its History

Princess Beatrice's stepson's unusual sleeping arrangement at mother Dara's London rental home

As Amber Heard Marks Her 38th Birthday, Get to Know Her Quiet Life in Madrid Two Years After Johnny Depp Trial

Trump cedes control of the cash collateral for his $175M civil-fraud bond under new agreement with NY officials

'NYET': New York Post Trolls 'Moscow' Marjorie Taylor Greene In Russian