Synthetic data to train machine learning models may be key in building stakeholder trust in AI

Nick Rockel

March 29, 2024 at 10:33 a.m.·3 min read

Companies can’t avoid working with data, but management of that data can pose serious challenges.

Customer and other personal data keep escaping, courtesy of breaches that surged 78% last year in the U.S., hitting a record 3,205. Total victims? An eye-popping 353 million.

And don’t forget the trust issues created by using real-world data to train AI. That hasn’t worked out so well for accident-prone autonomous cars, or for reliably racist chatbots.

Part of the solution? Synthetic data.

To be clear, synthetic data isn’t fake. In fact, it can be better than the real thing. Let me explain, with help from executives at a pair of synthetic data providers.

Synthetic data falls into two buckets, says Yashar Behzadi, founder and CEO of San Francisco–based Synthesis AI.

Structured data is what you find in database tables from industries like banking and health care. Let’s say a hospital doesn’t want to expose any patient data. “What you can essentially do is create a copy of that data that has all the statistical properties, but none of the actual information or data,” Behzadi says. “That allows folks to then work on it or share it and take it outside of specific safety bounds.”

Then there’s unstructured data—images and video used by applications based on computer vision. That’s where Synthesis plays, using CGI and generative AI to create data that helps train the systems behind technologies such as identity verification, extended reality (XR), and driver monitoring.

For example, if a facial recognition model is trained without a balanced dataset, it might have biases against dark-skinned or older people. To avoid that, Synthesis builds digital humans and uses them to generate high-quality data. “We can easily represent every ethnicity, every age, every different demographic to ensure our systems are completely bias-free,” says Behzadi, whose customers include Fortune 500 companies. “If it’s synthetic, it’s completely privacy-compliant as well.”

Alexandra Ebert is chief trust officer of Vienna-headquartered MOSTLY AI, which provides AI-generated, structured synthetic data for banks, insurers, telecoms, and health care companies. “They have plenty of existing data, but of course, it’s privacy-sensitive,” says Ebert, who runs an online course on synthetic data. “What they want to use synthetic data for is to basically anonymize it so that they’re out of scope from privacy laws.”

One of MOSTLY’s clients, bank Erste Group, likes synthetic data because it’s considered superior to traditional anonymization methods, which offer ways to piece the original data back together.

Synthetic data is taking off. By this year, 60% of the data used to train Al models will be synthetic, Gartner has predicted. That’s a huge jump from just 1% in 2021.

With help from generative AI, it’s now possible to create sophisticated simulations using unstructured synthetic data, Behzadi notes. Because that data is easier and cheaper to generate than real data, some applications will explode, he reckons. Rather than spend billions deploying fleets, autonomous vehicle makers can build simulations that include so-called edge cases, like a child running in front of a car. Another use: creating digital doubles of robots.

Ebert highlights data augmentation—using a synthetic data generator to create information that wasn’t in the original data set. For instance, a bank could take that approach to better understand fraud cases.

She also sees a chance for companies to democratize data by launching internal synthetic data hubs. The goal: “to go from synthetic data as a resource that belongs to the high priests of data science within an organization to data that is used by everyone.”

That would be real progress.

Nick Rockel
nick.rockel@consultant.fortune.com

This story was originally featured on Fortune.com

WPTV- West Palm Beach Scripps
Tornado deposits dumpster onto roof of Palm Beach Gardens home
One of the tornadoes that hit Florida was so powerful that it tossed a giant dumpster — which typically weighs a couple of tons — onto the roof of a home in the Avenir community of Palm Beach Gardens.
The Canadian Press
Northwestern exploits Maryland's turnovers, rolls 37-10 to earn first Big Ten victory
COLLEGE PARK, Md. (AP) — Defensive end Aidan Hubbard returned a fumble recovery for a touchdown in the fourth quarter, and Northwestern pulled away from Maryland 37-10 on Friday night for its first Big Ten victory this season.
The Canadian Press
Gravel made 31 saves as Saint John blanks Sherbrooke in QMJHL action
ST. JOHN, N.B. — Charles-Édward Gravel made 31 saves for the shutout as Saint John downed Sherbrooke 3-0 in Quebec Maritimes Junior Hockey League action Friday night.
The Canadian Press
Eichel, Barbashev and Theodore have a goal and assist to lead Golden Knights to 4-3 win over Blues
LAS VEGAS (AP) — Jack Eichel, Ivan Barbashev and Shea Theodore each had a goal and an assist on Friday night to lead the Vegas Golden Knights to a 4-3 victory over the St. Louis Blues.
NBA.com
Trayce Jackson-Davis rises to block the shot
Trayce Jackson-Davis rises to block the shot, 10/11/2024
NBA.com
Keegan Murray sinks it from downtown
Keegan Murray sinks it from downtown, 10/11/2024
NBA.com
De'Aaron Fox drills the trey
De'Aaron Fox drills the trey, 10/11/2024
NBA.com
DeMar DeRozan with a last basket of the period vs the Golden State Warriors
DeMar DeRozan (Sacramento Kings) with a last basket of the period vs the Golden State Warriors, 10/11/2024
NBA.com
What a shot by Buddy Hield
What a shot by Buddy Hield, 10/11/2024
NBA.com
Domantas Sabonis rocks the rim
Domantas Sabonis rocks the rim, 10/11/2024
NBA.com
Moses Moody scores and draws the foul
Moses Moody scores and draws the foul, 10/11/2024
NBA.com
Lindy Waters III gets the And-1
Lindy Waters III gets the And-1, 10/11/2024
NBA.com
Stephen Curry sinks it from downtown
Stephen Curry sinks it from downtown, 10/11/2024
NBA.com
Mason Jones scores and draws the foul
Mason Jones scores and draws the foul, 10/11/2024
NBA.com
Keon Ellis gets the And-1
Keon Ellis gets the And-1, 10/11/2024
NBA.com
Pat Spencer with a 2-pointer vs the Sacramento Kings
Pat Spencer (Golden State Warriors) with a 2-pointer vs the Sacramento Kings, 10/11/2024
The Canadian Press
As Hezbollah and Israel battle on the border, Lebanon's army watches from the sidelines
BEIRUT (AP) — Since Israel launched its ground invasion of Lebanon, Israeli forces and Hezbollah militants have clashed along the border while the Lebanese army has largely stood on the sidelines.
The Canadian Press
Padres muster no offense to support Yu Darvish's inspired pitching in NLDS loss to Dodgers
LOS ANGELES (AP) — Yu Darvish tapped the “PS” patch on his uniform and then went to work on the way to an inspired performance.
The Independent
Hurricane Milton leaves trail of destruction across Florida with 1.8m without power as death toll mounts: Live updates
Wind and storm surge warnings for Milton have been discontinued but hazards in the hurricane’s aftermath remain
People
Tori Spelling Kisses “DWTS” Pro Ezra Sosa at Gala, Tells Anna Delvey ‘You Were Missed’
The 'Beverly Hills, 90210' alum is showing some major love!

S&P/TSX

S&P 500

DOW

CAD/USD

CRUDE OIL

Bitcoin CAD

XRP CAD

GOLD FUTURES

RUSSELL 2000

10-Yr Bond

NASDAQ

VOLATILITY

FTSE

NIKKEI 225

CAD/EUR

Synthetic data to train machine learning models may be key in building stakeholder trust in AI

Latest Stories

Tornado deposits dumpster onto roof of Palm Beach Gardens home

Northwestern exploits Maryland's turnovers, rolls 37-10 to earn first Big Ten victory

Gravel made 31 saves as Saint John blanks Sherbrooke in QMJHL action

Eichel, Barbashev and Theodore have a goal and assist to lead Golden Knights to 4-3 win over Blues

Trayce Jackson-Davis rises to block the shot

Keegan Murray sinks it from downtown

De'Aaron Fox drills the trey

DeMar DeRozan with a last basket of the period vs the Golden State Warriors

What a shot by Buddy Hield

Domantas Sabonis rocks the rim

Moses Moody scores and draws the foul

Lindy Waters III gets the And-1

Stephen Curry sinks it from downtown

Mason Jones scores and draws the foul

Keon Ellis gets the And-1

Pat Spencer with a 2-pointer vs the Sacramento Kings

As Hezbollah and Israel battle on the border, Lebanon's army watches from the sidelines

Padres muster no offense to support Yu Darvish's inspired pitching in NLDS loss to Dodgers

Hurricane Milton leaves trail of destruction across Florida with 1.8m without power as death toll mounts: Live updates

Tori Spelling Kisses “DWTS” Pro Ezra Sosa at Gala, Tells Anna Delvey ‘You Were Missed’