Researchers just solved AI’s biggest conundrum

Andrew Tarantola

June 26, 2024 at 3:48 p.m.·3 min read

The Harth Sleep-Shift Light Bulb running next to a bed. — Harth / Amazon

The large language models that power today’s chatbots like ChatGPT, Gemini, and Claude are immensely powerful generative AI systems, and immensely power-hungry ones to boot.

They apparently don’t need to be, as recent research out of University of California, Santa Cruz has shown that modern LLMs running billions of parameters can operate on just 13 watts of power without a loss in performance. That’s roughly the draw of a 100W light bulb, and a 50x improvement over the 700W that an Nvidia H100 GPU consumes.

“We got the same performance at way less cost — all we had to do was fundamentally change how neural networks work,” lead author of the paper, Jason Eshraghian, said. “Then we took it a step further and built custom hardware.” They did so by doing away with the neural network’s multiplication matrix.

Matrix multiplication is a cornerstone of the algorithms that power today’s LLMs. Words are represented as numbers and then organized into matrices where they are weighted and multiplied against one another to produce language outputs depending on the importance of certain words and their relationship to other words in the sentence or paragraph.

These matrices are stored on hundreds of physically separate GPUs and fetched with each new query or operation. The process of shuttling data that needs to be multiplied among the multitude of matrices costs a significant amount of electrical power, and therefore money.

To get around that issue, the UC Santa Cruz team forced the numbers within the matrices into a ternary state — every single number carried a value of either negative one, zero, or positive one. This allows the processors to simply sum the numbers instead of multiplying them, a tweak that makes no difference to the algorithm but saves a huge amount of cost in terms of hardware. To maintain performance despite the reduction in the number of operations, the team introduced time-based computation to the system, effectively creating a “memory” for the network, increasing the speed at which it could process the diminished operations.

“From a circuit designer standpoint, you don’t need the overhead of multiplication, which carries a whole heap of cost,” Eshraghian said. And while the team did implement its new network on custom FGPA hardware, they remain confident that many of the efficiency improvements can be retrofitted to existing models using open-source software and minor hardware tweaks. Even on standard GPUs, the team saw a 10 times reduction in memory consumption while improving operational speed by 25%.

With chip manufacturers like Nvidia and AMD continually pushing the boundaries of GPU processor performance, electrical demands (and their associated financial costs) for the data centers housing these systems have soared in recent years. With the increase in computing power comes a commensurate increase in the amount of waste heat the chips produce — waste heat that now requires resource-intensive liquid cooling systems to fully dissipate.

Arm CEO Rene Haas warned The Register in April that AI data centers could consume as much as 20-25% of the entire U.S. electrical output by the end of the decade if corrective measures are not taken, and quickly.

Business Insider
A Chinese firm's answer to SpaceX's Falcon 9 blew up in a giant fireball after it accidentally launched during a test
Tianbing Aerospace Technology said it was testing its rocket engine when the Tianlong-3 left the launchpad due to a "structural failure."
CNN
See Chinese rocket crash after accidentally launching
On June 30, a Chinese rocket crashed into the hills of Gongyi in central China after being accidentally launched during a ground test, according to a statement from the company, Space Pioneer. No injuries have been reported.
Futurism
NASA Is Having a Spacesuit Crisis
Spacesuit Setback Earlier this week, NASA astronaut Tracy Dyson discovered to her horror that water was squirting from her spacesuit 31 minutes into her and fellow astronaut Mike Barratt's spacewalk outside of the International Space Station. Unsurprisingly, the space agency was forced to cut their journey short, with crews on board the orbital outpost investigating […]
Futurism
China Cracks Open First Ever Sample From Moon’s Far Side
"Thicker and Stickier" After boldly going to the Moon's far side, China is now in possession of more than four pounds of lunar samples — the first ever collected in human history from that mysterious region. The state-run China Daily newspaper reports that the Chang'e 6 robotic lunar lander, which touched down back on Earth […]
Futurism
Scientists Identify Plant That Could Grow on Mars
Mars Moss Scientists in China claim to have discovered a kind of desert moss that thrives in a variety of conditions, from Antarctica to the Mojave desert — that could survive on the surface of Mars without being sheltered inside a greenhouse. As The Guardian reports, the moss called Syntrichia caninervis could help us transform the […]
Associated Press
An Arizona museum tells the stories of ancient animals through their fossilized poop
One way to help tell how a Tyrannosaurus rex digested food is to look at its poop. Bone fragments in a piece of fossilized excrement at a new museum in northern Arizona — aptly called the Poozeum — are among the tinier bits of evidence that indicate T. rex wasn’t much of a chewer, but rather swallowed whole chunks of prey. The sample is one of more than 7,000 on display at the museum that opened in May in Williams, a town known for its Wild West shows along Route 66, wildlife attractions and a railway to Grand Canyon National Park.
Futurism
Trees Blamed for Air Pollution
In a controversial new study, scientists are claiming that trees in Los Angeles are contributing to the city's air pollution, challenging conventional notions about the positive role they play in their ecosystems. As New Scientist explains, this bold theory was born of a strange conundrum: despite efforts to decrease traffic exhaust and increase environmental protections, […]
CNN
NASA administrator weighs in on China’s historic lunar far side samples — and potential US access
China now has the first samples ever collected from the far side of the moon and says it will share them with scientists around the world. But a 2011 law complicates access for the US.
Popular Mechanics
Tiny Quantum Ghosts Might Be Creating Brand-New Elements
These hidden forces may reshape our approach to particle physics.
Futurism
James Webb Observes Mysterious Structures Above Jupiter's Great Red Spot
The remarkable James Webb Space Telescope has been used to image the furthest reaches of the cosmos. But now, astronomers have leveraged its immense powers on a target far closer to home, Jupiter — and in so doing, they've found mysterious features and structures on the surface of the gas giant that have never been […]
United Press International
University of Michigan wins NASA's lunar lander challenge award
The University of Michigan may have topped their college football title with NASA naming its team as the winner of its 2024 Human Lander Challenge at a forum in Huntsville, Ala., on Friday.
Time
Spotted Lanternflies: Scientists Studying How to Kill them
Scientists have been researching the best method for killing spotted lanternflies, and they may have gotten some new leads this year through the insects' attraction to vibrations.
Robb Report
The 7 Best Aviator Sunglasses to Channel Your Inner Maverick
These cool, technical shades would make Tom Cruise drool.
Reuters
SERA names India as partner country for Blue Origin space flight
The U.S.-based Space Exploration and Research Agency (SERA) on Monday announced India as a partner country in its human spaceflight programme, which will see six citizen astronauts from across the world launched into space. The programme, being executed in collaboration with Blue Origin, is meant for people from countries who have sent "few or no astronauts" to space, the agency said. The selected citizens will undertake the 11-minute journey in New Shepard, Blue Origin's reusable suborbital rocket, after undergoing training at its launch site in West Texas.
BBC
Extreme heat across island now more likely - study
Temperatures of 33C, which last occurred over 80 years ago, are now much more probable, the study says.
People
Kate Beckinsale Cheekily Moons Department Store as Way of Dealing with ‘Horrific News’
"when the bottom falls out of your world the only response after crying till you’re sick is your own bottom," the actress wrote
HuffPost
Reporter Reveals 'Real Anger' From Biden White House Aides After Debate
They were "shocked" and felt "they had not been told the truth," said Axios' Alex Thompson.
People
2 Missing Ga. Firefighters Who'd Been High School Sweethearts Are Found Dead Days After Woman Keyed Man's Car
The two firefighters had dated for about seven years before breaking up, according to Kuhbander's parents
People
Husband and Wife, 70 and 71, Die Together Through Euthanasia: 'There Is No Other Solution'
"I’ve lived my life, I don’t want pain anymore,” said Jan Faber before his death
The Hill
Pat Tillman’s mother ‘shocked’ by Prince Harry getting son’s award
The mother of U.S. soldier and former NFL star Pat Tillman said she was “shocked” to hear Prince Harry, Duke of Sussex, would be the recipient of an annual award made in the name of her son, who was killed by friendly fire in Afghanistan in 2004. Mary Tillman, the mother of the Pat Tillman,…

Latest Stories