A professor says he's stunned that ChatGPT went from a D grade on his economics test to an A in just 3 months

Ryan Hogg

Updated March 26, 2023 at 5:00 a.m.·4 min read

ChatGPT-4 scored 73% on Bryan Caplan's economics test.Getty Images

The progress that ChatGPT made in an exam in just three months stunned an economics professor.
Bryan Caplan of George Mason University said the chatbot got a D on his economics test in January.
He tried again with the GPT-4 update last week and its score improved to an A.

An economics professor said the progress ChatGPT made — it improved its score from a D to an A on his economics test in just three months — has stunned him.

Bryan Caplan, an economics professor at George Mason University, told Insider that the latest version of ChatGPT could now be responsible for the first big bet he's ever lost.

ChatGPT-3.5 didn't understand basic theory

Writing in a blog post on his Substack "Bet On It" in January, Caplan said he gave ChatGPT questions from his fall midterms.

Caplan said his exam questions test students' understanding of economics rather than have them regurgitate textbooks or complete what are essentially memory exercises.

It's here where the old version of ChatGPT tripped up. The bot scored 31 out of a possible 100 on his test, equivalent to a D and well below his 50% median.

Caplan told Insider that the bot failed to understand basic concepts, such as the principle of comparative and absolute advantage. Its answers were also more political than economic, he said.

"ChatGPT does a fine job of imitating a very weak GMU econ student," Caplan wrote in his January blog post.

He isn't the only academic that ChatGPT has disappointed. While it passed a Wharton Business School exam in January, its professor said it made "surprising mistakes" on simple calculations.

Big bet

Caplan likes to bet. He's previously placed 23 public bets and won them all. They're usually for modest sums of about $100, and often on technical subjects like predicted unemployment rates and inflation readings.

He also narrowly won a 2008 bet that no member state would leave the European Union before 2020 — the UK left in January of that year.

ChatGPT's responses underwhelmed him so much that Caplan bet an AI model wouldn't score an A on six out of seven of his exams before 2029.

But when ChatGPT-4 was released, its progress stunned Caplan. It scored 73% on the same midterm test, equivalent to an A and among the best scores in his class.

ChatGPT's paywalled upgrade sought to fix some of the early issues with the beta version, GPT-3.5. This purportedly included making ChatGPT 40% more likely to return accurate responses, as well as making it able to handle more nuanced instructions.

For Caplan, the improvements were obvious. The bot gave clear answers to his questions, understanding principles it previously struggled with. It also scored perfect marks explaining and evaluating concepts that economists like Paul Krugman have championed.

"The only thing I can say is it just seems a lot better," Caplan said.

Caplan thought ChatGPT's training data might have picked up his previous blog post where he explained his answers, but colleagues told him this was highly unlikely.

He added that he's already fed the bot new tests it hadn't seen before, where it did even better than its previous 73% grade. "I was very smug in my judgment, and I'm not smug anymore," Caplan said.

Caplan is more confident he'll win his next AI-related wager. He has a bet with Eliezer Yudkowsky, an AI doomer who has sparred with Sam Altman, the creator of ChatGPT, that AI will lead to the end of the world before January 1, 2030.

"I'm probably going to lose this AI bet, but I am totally on board to do a bunch more end-of-the-world AI bets because I think these people are out of their minds," he said.

Tough to test

AI bots have caused headaches for examiners. Professors told Insider that plagiarism can be hard to prove with material from ChatGPT because there is no material evidence of wrongdoing.

Caplan said he's thinking of doing away with graded homework in the wake of ChatGPT's rise. He hopes his habit of regularly changing questions will be enough to stop students from learning and regurgitating ChatGPT's responses in exam settings.

Read the original article on Business Insider

HuffPost
Trump's Chilling New Courthouse Rant Gets Put On Ice By Critics
The former president's latest complaint gets a cool reception on social media.
6 hours ago
NY Daily News
OJ Simpson did not die surrounded by loved ones, says lawyer
The family of O.J. Simpson announced last week the former football star died on April 10 “surrounded by his children and grandchildren.” But according to Simpson’s longtime lawyer Malcolm LaVergne, the 76-year-old father of four was a sole visitor away from dying alone. LaVergne declined to tell The Associated Press who was at Simpson’s bedside when the acquitted double-murder defendant ...
2 days ago
HuffPost
George Conway Details ‘Oh, It’s Daddy’ Call To Ivanka That Exposed Trump’s Fears
It showed the then-president "was very, very concerned," said the conservative attorney.
a day ago
The Daily Beast
Sheep Suspected in the Double Killing of Husband and Wife
Newshub YouTubeA man in New Zealand went looking for his elderly parents on Thursday morning after becoming concerned that he had not heard from them for days, reports say. At their rural rented property in Waitākere, West Auckland, he found a ram in a paddock alongside the lifeless bodies of his parents.The unnamed couple in their early 80s are believed to have both been killed by the sheep, according to The New Zealand Herald. Authorities believe the man had gone out to feed the ram and never
21 hours ago
People
Kourtney Kardashian Celebrates Her Appearance After Fan Comments on Bikini Pic: 'I Love This Body'
The Lemme co-founder is thankful for her body because it "gave me my 3 big babies and my little baby"
8 hours ago
The Daily Beast
Trump’s Trial Now Has 12 Jurors—and One Angry Man
Jabin Botsford/ReutersDespite the troubles plaguing Donald Trump’s first criminal trial in New York City, the process reached a milestone Thursday afternoon when the judge filled all 12 seats of the jury that will determine his fate.But the slog is far from over, as prosecutors and defense lawyers must now screen dozens of other jurors to pick the half-dozen New Yorkers who will serve as alternates during the next month or two—and might not even make it into the deliberation room.The new additio
20 hours ago
Cosmo
Shania Twain is unrecognisable with butt-skimming peroxide blonde hair
Shania Twain just shared snaps with super long peroxide blonde hair. It's giving 00's Jessica Simpson and we're not mad at it.
23 hours ago
Yahoo Canada Style
Sangita Patel says her husband was 'freaking out' after reading her cancer report
The 45-year-old mother-of-two spoke to "The Ladygang" podcast about learning she had a rare and aggressive form of thyroid cancer.
16 hours ago
The Daily Beast
Fox News Anchor Reminds GOP Senator That Trump Killed His Border Deal
Fox NewsSen. James Lankford (R-OK), the GOP co-architect of the Senate’s failed immigration bill earlier this year, made what were perhaps his most critical comments yet on Donald Trump’s role in scuttling the legislation, alluding to Fox News Thursday that the former president was motivated by his political self-interest.On Your World, Lankford was confronted by anchor Neil Cavuto about the players behind the bill’s demise.“You are a real gentleman about this, and I know you’re not trying to zi
10 hours ago
BANG Showbiz
Kanye West involved in alleged altercation with man who 'assaulted' his wife
Kanye West is being investigated for battery after allegedly punching a man who his representatives claim "battered and sexually assaulted" his wife, Bianca Censori.
a day ago
The Independent
Bianca Censori criticised for wearing bandages as shoes on Disneyland date with Kanye West
‘How is this allowed in Disneyland?’ perplexed viewer questions
2 days ago
InStyle
Prince William Just Shared an Emotional Message During His Return to Royal Duties
Princess Kate cosigned the powerful statement.
2 days ago
HuffPost
Lara Trump's Take On Father-In-Law's Hush Money Charges Is A Real Doozy
She may have understated the allegations just a touch.
a day ago
The New York Times
Miscalculation Led to Escalation as Israel and Iran Clash
TEL AVIV, Israel — Israel was mere moments away from an airstrike on April 1 that killed several senior Iranian commanders at Iran’s embassy complex in Syria when it told the United States what was about to happen. Israel’s closest ally had just been caught off guard. Aides quickly alerted Jake Sullivan, President Joe Biden’s national security adviser; Jon Finer, the deputy national security adviser; Brett McGurk, Biden’s Middle East coordinator; and others, who saw that the strike could have se
22 hours ago
People
Gigi Hadid Is Back in a Bikini and Mermaid Hair for Victoria's Secret: See the Sexy New Campaign
The supermodel joins Emily Ratajkowski, Paloma Elsesser and Tina Kunakey in the new summer 2024 campaign
2 days ago
Hypebae
Willow Smith Bares It All For Her New Album, 'Empathogen'
Our unconventional beauty bae, Willow Smith, is back again, serving us an all-natural look, with...
20 hours ago
USA TODAY
'Cowardly judge:' Dismissed Trump hush money trial juror number 4 shares his story: Exclusive
Herson Cabreras said he was taken aback when prosecutors moved to oust him from the jury in Donald Trump's criminal trial.
11 hours ago
HuffPost
Donald Trump Confuses Jimmy Kimmel For Al Pacino In Weird Rant
"In fairness to our former president, many stable geniuses confuse me with Al Pacino," Kimmel said in response to the mix-up.
2 days ago
USA TODAY
Trump is funneling campaign money into cash-strapped businesses. Experts say it looks bad.
Trump's campaign and affiliated committees have spent more than $800,000 at Trump properties since the start of 2023.
a day ago
BuzzFeed
Here's How To Protect Yourself From The "Can You Hear Me?" Phone Scam That's Going Around Right Now
“We don’t want people to operate in this fear mode,” Nofziger said. “We want people to operate in the empowerment mode.”
a day ago

ChatGPT-3.5 didn't understand basic theory

Big bet

Tough to test

Latest Stories