
We went head-to-head with AI and LOST as 30 of Earth's top brains left ‘frightened' after secret battle with chatbot
30 of the world's most renowned mathematicians congregated in Berkeley, California in mid-May for a secret maths battle against a machine.
Advertisement
3
The bot uses a large language models (LLM), called o4-mini, which was produced by ChatGPT creator OpenAI
Credit: Reuters
The bot uses a large language models (LLM), called o4-mini, which was produced by ChatGPT creator OpenAI.
And it proved itself to be smarter than some of the human geniuses graduating universities today, according to Ken Ono, a mathematician at the University of Virginia and a leader and judge at the meeting.
It was able to answer some of the toughest math equations out there in mere minutes - problems that would have taken a human expert weeks or months to solve.
OpenAI had asked Epoch AI, a nonprofit than benchmarks AI models, to come up with 300 math questions whose solutions had not yet been published.
Advertisement
READ MORE ON AI
This meant the AI couldn't just trawl the internet for the answer; it had to solve it on its own.
The group of mathematicians, hand-selected by
Elliot Glazer, a recent math Ph.D. graduate hired by Epoch AI
, were tasked with coming up with the hardest equations they could.
Everyone who participated had to sign a nondisclosure agreement to ensure they only communicated through secure messenger app Signal.
This would prevent the AI from potentially seeing their conversations and using it to train its robot brain.
Advertisement
Most read in Tech
Only a small group of people in the world are capable of developing such questions, let alone answering them.
Each problem the o4-mini couldn't solve would grant its creator a $7,500 reward.
By April 2025, Glazer found that o4-mini could solve around 20 percent of the questions.
Father of murdered girl turned into AI chatbot warns of dangers of new tech
Then at the in-person, two-day meeting in May, participants finalised their last batch of challenge questions.
Advertisement
The 30 attendees were split into groups of six, and competed against each other to devise problems that they could solve but would stump the AI reasoning bot.
By the end of that Saturday night, the bot's mathematical prowess was proving too successful.
"I came up with a problem which experts in my field would recognize as an open question in number theory — a good Ph.D.-level problem," said Ken Ono, a mathematician at the University of Virginia and a leader and judge at the meeting, reported by
Early that Sunday morning, Ono alerted the rest of the participants.
Advertisement
"I was not prepared to be contending with an LLM like this," he said.
"I've never seen that kind of reasoning before in models. That's what a scientist does. That's frightening."
Over the two days, the bot was able to solve some of the world's trickiest math problems.
"I have colleagues who literally said these models are approaching mathematical genius," added Ono.
Advertisement
"I've been telling my colleagues that it's a grave mistake to say that generalised artificial intelligence will never come, [that] it's just a computer.
"I don't want to add to the hysteria, but in some ways these large language models are already outperforming most of our best graduate students in the world."
Just 10 questions stumped the bot, according to researchers.
Yang Hui He, a mathematician at the London Institute for Mathematical Sciences and an early pioneer of using AI in maths, said: "This is what a very, very good graduate student would be doing - in fact, more."
Advertisement
3
Over the two days, the bot was able to solve some of the world's trickiest math problems
Credit: Getty
3
Just 10 questions stumped the bot, according to researchers
Credit: Getty
Read more about Artificial Intelligence
Everything you need to know about the latest developments in Artificial Intelligence
What is the popular AI
How do you use Google's latest AI chatbot
What is the AI image generator
How do you use Snapchat's My AI tool?
What are the
What are the

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Irish Independent
2 hours ago
- Irish Independent
Impact of AI explained to EU Ambassadors by Wicklow-Wexford TD
As chair of the Oireachtas Committee on Artificial Intelligence, Deputy Malcom Byrne highlighted the growing impact of Artificial Intelligence (AI) on all aspects of society as well as EU moves to ensure citizen safety in its rollout and use. The event took place at the Embassy of Slovakia in Dublin. Artificial intelligence will be a theme of Ireland's presidency of the European Union during the second half of 2026. During the event Deputy Byrne pointed out how artificial intelligence is having a impact on a range of areas including agriculture, transport and healthcare. 'It is critical that as legislators and policymakers that we are really engaged in the topic and that while we want to support innovation and enterprise.' Deputy Byrne said efforts must be made to make sure there are proper safety guardrails in place. 'The European Union is taking a global lead on this through the Artificial Intelligence Act, looking to regulate where and when AI can and cannot be used. It is of huge interest to every European Union country.' Deputy Byrne highlighted the individuals, organisations and businesses across Counties Wicklow and Wexford who are now using AI to make some of what they do more efficient. 'People are really beginning to engage in the possibilities of this new technology.' he added.


Irish Independent
7 hours ago
- Irish Independent
Dear Vicki: My staff have become over-reliant on AI. How do I get them to use it appropriately?
Q: I am a manager in the advertising business, which is being severely disrupted by AI. While we grapple with that at a senior level, I have noticed staff throughout my organisation are pretty much using ChatGPT all the time. They claim it's a 'research tool' and a good way to test ideas, but I'm worried that some people are just cutting and pasting stuff directly from AI, without doing further checks.


Irish Times
2 days ago
- Irish Times
Is it time for a new kind of CEO at Apple?
Apple 's continuing artificial intelligence (AI) problems mean a few brave analysts are saying the quiet part out loud: it might be time for Tim Cook to go. 'Apple now needs a product-focused CEO, not one centred on logistics,' New York-based LightShed Partners said in a recent note. The note made waves, but Cook's position is thought to be secure. Apple's board is loyal and while shares have lagged behind over the past year few investors are reaching for the panic button. Under Cook, Apple's market value has ballooned from $340 billion (€293 billion) to $3.1 trillion, a return that tends to silence complaints. Still, that Cook's position is even being discussed says something. READ MORE No one disputes Cook's achievements – even LightShed admits he has done 'a great job' since 2011 – but some wonder if the traits that suited Apple's past may not fit its future. A master of optimisation, Cook perfects rather than pioneers. That made him the ideal steward of Apple's golden decade. Under his tenure, margins stayed fat, supply chains got leaner and stock buy-backs flowed. He kept the iPhone at the heart of a sleek ecosystem and steered Apple through Covid and geopolitical headwinds. However, much of this success came from refining and scaling ideas inherited from the Steve Jobs era. Whether he's the one to lead Apple's next act is less clear, with some suggesting AI demands a different skill set, one rooted in bold product vision rather than operational mastery. For now, Cook remains firmly in charge, but the AI era is forcing even Apple to confront uncomfortable questions about what comes next.