Unpacking the hype around OpenAI’s rumored new Q* model (2024)

This story is from The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here.

Ever since last week’s dramatic events at OpenAI, the rumor mill has been in overdrive about why the company’s chief scientific officer, Ilya Sutskever, and its board decided to oust CEO Sam Altman.

While we still don’t know all the details, there have been reports that researchers at OpenAI had made a “breakthrough” in AI that had alarmed staff members. Reuters and The Information both report that researchers had come up with a new way to make powerful AI systems and had created a new model, called Q* (pronounced Q star), that was able to perform grade-school-level math. According to the people who spoke to Reuters, some at OpenAI believe this could be a milestone in the company’s quest to build artificial general intelligence, a much-hyped concept referring to an AI system that is smarter than humans. The company declined to comment on Q*.

Social media is full of speculation and excessive hype, so I called some experts to find out how big a deal any breakthrough in math and AI would really be.

Researchers have for years tried to get AI models to solve math problems. Language models like ChatGPT and GPT-4 can do some math, but not very well or reliably. We currently don’t have the algorithms or even the right architectures to be able to solve math problems reliably using AI, says Wenda Li, an AI lecturer at the University of Edinburgh. Deep learning and transformers (a kind of neural network), which is what language models use, are excellent at recognizing patterns, but that alone is likely not enough, Li adds.

Math is a benchmark for reasoning, Li says. A machine that is able to reason about mathematics, could, in theory, be able to learn to do other tasks that build on existing information, such as writing computer code or drawing conclusions from a news article. Math is a particularly hard challenge because it requires AI models to have the capacity to reason and to really understand what they are dealing with.

A generative AI system that could reliably do math would need to have a really firm grasp on concrete definitions of particular concepts that can get very abstract. A lot of math problems also require some level of planning over multiple steps, says Katie Collins, a PhD researcher at the University of Cambridge, who specializes in math and AI. Indeed, Yann LeCun, chief AI scientist at Meta, posted on X and LinkedIn over the weekend that he thinks Q* is likely to be “OpenAI attempts at planning.”

People who worry about whether AI poses an existential risk to humans, one of OpenAI's founding concerns, fear that such capabilities might lead to rogue AI. Safety concerns might arise if such AI systems are allowed to set their own goals and start to interface with a real physical or digital world in some ways, says Collins.

But while math capability might take us a step closer to more powerful AI systems, solving these sorts of math problems doesn’t signal the birth of a superintelligence.

“I don’t think it immediately gets us to AGI or scary situations,” says Collins. It’s also very important to underline what kind of math problems AI is solving, she adds.

“Solving elementary-school math problems is very, very different from pushing the boundaries of mathematics at the level of something a Fields medalist can do,” says Collins, referring to a top prize in mathematics.

Machine-learning research has focused on solving elementary-school problems, but state-of-the-art AI systems haven’t fully cracked this challenge yet. Some AI models fail on really simple math problems, but then they can excel at really hard problems, Collins says. OpenAI has, for example, developed dedicated tools that can solve challenging problems posed in competitions for top math students in high school, but these systems outperform humans only occasionally.

Nevertheless, building an AI system that can solve math equations is a cool development, if that is indeed what Q* can do. A deeper understanding of mathematics could open up applications to help scientific research and engineering, for example. The ability to generate mathematical responses could help us develop better personalized tutoring, or help mathematicians do algebra faster or solve more complicated problems.

This is also not the first time a new model has sparked AGI hype. Just last year, tech folks were saying the same things about Google DeepMind’s Gato, a “generalist” AI model that can play Atari video games, caption images, chat, and stack blocks with a real robot arm. Back then, some AI researchers claimed that DeepMind was “on the verge” of AGI because of Gato’s ability to do so many different things pretty well. Same hype machine, different AI lab.

And while it might be great PR, these hype cycles do more harm than good for the entire field by distracting people from the real, tangible problems around AI. Rumors about a powerful new AI model might also be a massive own goal for the regulation-averse tech sector. The EU, for example, is very close to finalizing its sweeping AI Act. One of the biggest fights right now among lawmakers is whether to give tech companies more power to regulate cutting-edge AI models on their own.

OpenAI’s board was designed as the company’s internal kill switch and governance mechanism to prevent the launch of harmful technologies. The past week’s boardroom drama has shown that the bottom line will always prevail at these companies. It will also make it harder to make a case for why they should be trusted with self-regulation. Lawmakers, take note.

Unpacking the hype around OpenAI’s rumored new Q* model (2024)

References

Top Articles
Qantas points calculator: how to use Qantas frequent flyer points
Multiple Sklerose: Therapie, Medikamente und Behandlung
Dairy Queen Lobby Hours
Matgyn
Koopa Wrapper 1 Point 0
Gomoviesmalayalam
Craftsman M230 Lawn Mower Oil Change
DEA closing 2 offices in China even as the agency struggles to stem flow of fentanyl chemicals
OSRS Fishing Training Guide: Quick Methods To Reach Level 99 - Rune Fanatics
Employeeres Ual
Scentsy Dashboard Log In
Nichole Monskey
Audrey Boustani Age
7 Low-Carb Foods That Fill You Up - Keto Tips
Sivir Urf Runes
Buff Cookie Only Fans
Trac Cbna
Diamond Piers Menards
NBA 2k23 MyTEAM guide: Every Trophy Case Agenda for all 30 teams
Aris Rachevsky Harvard
Evil Dead Rise - Everything You Need To Know
Craigslist Pet Phoenix
Today Was A Good Day With Lyrics
Bella Bodhi [Model] - Bio, Height, Body Stats, Family, Career and Net Worth 
U Of Arizona Phonebook
Winco Employee Handbook 2022
R/Airforcerecruits
Miles City Montana Craigslist
12657 Uline Way Kenosha Wi
Viduthalai Movie Download
Rlcraft Toolbelt
Average weekly earnings in Great Britain
Nacogdoches, Texas: Step Back in Time in Texas' Oldest Town
UPS Drop Off Location Finder
Ultra Clear Epoxy Instructions
The 50 Best Albums of 2023
Laff Tv Passport
Can You Buy Pedialyte On Food Stamps
Fototour verlassener Fliegerhorst Schönwald [Lost Place Brandenburg]
Dr Adj Redist Cadv Prin Amex Charge
Why I’m Joining Flipboard
Entry of the Globbots - 20th Century Electro​-​Synthesis, Avant Garde & Experimental Music 02;31,​07 - Volume II, by Various
Wunderground Orlando
Ig Weekend Dow
Www.craigslist.com Waco
Craigslist Mendocino
Tropical Smoothie Address
Ratchet And Clank Tools Of Destruction Rpcs3 Freeze
Meee Ruh
6463896344
Minute Clinic Mooresville Nc
Inloggen bij AH Sam - E-Overheid
Latest Posts
Article information

Author: Kimberely Baumbach CPA

Last Updated:

Views: 5946

Rating: 4 / 5 (41 voted)

Reviews: 80% of readers found this page helpful

Author information

Name: Kimberely Baumbach CPA

Birthday: 1996-01-14

Address: 8381 Boyce Course, Imeldachester, ND 74681

Phone: +3571286597580

Job: Product Banking Analyst

Hobby: Cosplaying, Inline skating, Amateur radio, Baton twirling, Mountaineering, Flying, Archery

Introduction: My name is Kimberely Baumbach CPA, I am a gorgeous, bright, charming, encouraging, zealous, lively, good person who loves writing and wants to share my knowledge and understanding with you.