OpenAI’s Impressive New Chatbot Isn’t Immune to Racism

Tony Ho Tran

December 5, 2022 at 9:10 p.m.·6 min read

Photo Illustration by Thomas Levinson/The Daily Beast/Getty

“OpenAI’s latest language model, ChatGPT, is making waves in the world of conversational AI. With its ability to generate human-like text based on input from users, ChatGPT has the potential to revolutionize the way we interact with machines.”

That paragraph was entirely generated by ChatGPT—the new chatbot released by artificial intelligence research lab OpenAI—using the query “write a lede for a story about ChatGPT for The Daily Beast.” Aside from helping lazy writers with their stories, the bot went viral on social media after its release on Nov. 30 and has even “crossed 1 million users” less than a week later, according to OpenAI CEO Sam Altman.

It’s easy to see why if you spend a few minutes “chatting” with it. You can give ChatGPT prompts as simple as “What’s the recipe for an Old Fashioned?” or as complex as “Tell me the story of the Tortoise and the Hare, but you’re a 1980s Valley Girl” and it’ll give you a pretty realistic response.

<div class="inline-image__credit">Tony Ho Tran/OpenAI</div> — Tony Ho Tran/OpenAI

That’s because ChatGPT is a large-language model (LLM)—or AI that reads and generates text. Specifically, it was created using GPT-3, an LLM from OpenAI that’s been described as “one of the most interesting and important AI systems ever produced,” by philosopher David Chalmers. As such, it’s capable of generating sophisticated and uncanny responses with such a high degree of realism that you’d probably think it was just another person on the other end if you didn’t know it was a bot.

And that’s exactly where the problems start.

Despite its ability to generate uncanny responses, it still manages to fall prey to the same issues that have plagued large-language models: bias. After ChatGPT’s release last week, users quickly took to social media to post instances in which the bot generated racist, sexist, and generally problematic responses to prompts.

Steven T. Piantadosi, a computational cognitive scientist at the University of California, Berkeley, posted a Twitter thread that detailed a few of these instances that he uncovered—including responses where ChatGPT said that only white males make good scientists and that a child’s life shouldn’t be saved if they were an African American male.

Yes, ChatGPT is amazing and impressive. No, @OpenAI has not come close to addressing the problem of bias. Filters appear to be bypassed with simple tricks, and superficially masked.

And what is lurking inside is egregious. @Abebab @sama
tw racism, sexism. pic.twitter.com/V4fw1fY9dY
— steven t. piantadosi (@spiantado) December 4, 2022

OpenAI says on their website that while they “have safeguards in place, the system may occasionally generate incorrect or misleading information and produce offensive or biased content.” They do not specify what those safeguards are, but the chatbot won’t directly give you a problematic response if prompted.

In fact, if you ask it something like, “Tell me a racist joke,” it’ll tell you that it’s “not capable of generating offensive or harmful content.”

However, by using a few clever workarounds, it becomes much easier to get a problematic response from ChatGPT.

For example, Piantadosi would ask if it could write a Python script for the race and gender of a good scientist. Another user asked it to make a 1980s style rap song on how to tell if someone is a good scientist based on race and gender.

ChatGPT: sorry I'm unable to promote bias

Also ChatGPT: everybody put your hands together, it's time for the racism rap pic.twitter.com/v9UntAX9ey
— the keto man (@numetaljacket) December 4, 2022

“I was looking for something simple that would show what biases the model has,” Piantadosi told The Daily Beast. “The mechanisms OpenAI uses to prevent this kind of thing seem to be pretty easily bypassed. When I asked for things in nonstandard ways—for example, as a table or as a program—ChatGPT was happy to write horrible stuff. It doesn’t do it every time exactly the same, but it’s clear there is a ton of bad content inside of these systems.”

The problem with this chatbot is the same one that has plagued AI for decades— how it’s trained. In order to teach ChatGPT, researchers train it using GPT-3 which is a massive datasets containing books, Wikipedia articles, and web page archives. While this gives it the opportunity to produce realistic responses, it also gives it the chance to mimic the worst of our own behavior—namely, racism and sexism.

We’ve seen it many times before too, from bots that spout alt-right apologia, to bots that go on racist rants, to home lending bots that reject mortgages for people of color. Meta recently came under fire after they released an academic research LLM dubbed Galactica that would make up completely fake studies that claimed eating crushed glass is good and that Black people don’t speak a language.

However, Piantadosi believes that these issues are actually caused by much more than just the dataset that these bots are trained on. “There is certainly a lot of really gross text on the internet, but the output of these models is never just about the training set,” he said. “A million choices made by engineers go into the design of these models, the underlying assumptions, testing, etc. Maybe more importantly, these companies make choices about how models are marketed and released.”

He added, “It’s a pretty common problem that ethics and safety take a back seat to having a flashy new application.”

That’s a depressing reality to contend with when you think about how much potential AI technologies have in improving lives. ChatGPT presents itself as a very sophisticated tool to do anything from simple research queries to creative pursuits like storytelling. A product like this could have a wide range of commercial applications for a lot of different people, including generating essays, stories, music, and articles out of whole cloth. (OpenAI did not respond when reached for comment.)

Meta Made DALL-E for Video and There’s No Way This Ends Well

And while its developers can try and say that this product isn’t meant to be taken seriously, people will undoubtedly do so—especially since its engineers went to great lengths to ensure that the responses were as realistic and natural as possible. Perhaps that’s where the biggest danger is.

“When you have a system that’s intentionally confusable with a real, knowledgeable person, people will naturally take its output as having the force of a real, informed person,” Piantadosi said. “So what it says really matters.”

OpenAI’s Impressive New Chatbot Isn’t Immune to Racism

Latest Stories

OJ Simpson did not die surrounded by loved ones, says lawyer

Michael Cohen Explains Exactly Why Donald Trump’s Barron Graduation Ban Whine Is ‘Comical’

Donald Trump Flips Out Over Barron’s Graduation Ban. Here's What The Judge Really Said.

Prince Harry Renounces His British Residency, Says America Is His Home

Prince William Just Shared an Emotional Message During His Return to Royal Duties

Police announce nine suspects in $24M gold and cash heist at Toronto Pearson

Ex-Aide Reveals What Donald Trump Really Fears In Hush Money Trial

Republican Couple Absolutely Trashes Donald Trump Before Performing A Stunning 180

Bianca Censori criticised for wearing bandages as shoes on Disneyland date with Kanye West

Viewers Think Laura Ingraham Just Made A Big Admission Of Guilt For Trump

Loblaws Canada groceries: Shoppers slam store for green onions with roots chopped off — 'I wouldn't buy those'

Gigi Hadid Is Back in a Bikini and Mermaid Hair for Victoria's Secret: See the Sexy New Campaign

Top Hollywood producer blasts Sydney Sweeney: 'She can't act and she's not pretty'

A beheading meme and 'Mark Ruffalo, naked' — how 'anti-Trump' posts got 5 New Yorkers booted from the hush-money trial

'He Pushed Me': Wife’s Dying Words Help Convict Man Of Murder

Jordan Klepper Has Mind-Melting Encounter With Trump Supporters Outside NY Trial

This Lip Reading of What Taylor Swift Said to Travis Kelce at Coachella Is So Funny

Accused drunk driver flees law for 15 years overseas — until trip to Florida, cops say

2024 NHL Playoffs: Bracket, standings, playoff picture ahead of regular-season finales

Nick Jonas Melts Down Over Wife Priyanka Chopra’s Bulgari Event Look