
AI breakthrough: AlphaGeometry 2 and Symbolic AI outperform Maths Olympiad gold medallists
The world was astonished a year ago, but it is now shocked. Last year, AlphaGeometry, an AI problem solver developed by Google DeepMind, astonished the world by placing second in the International Mathematical Olympiad (IMO). The DeepMind team now claims that the performance of their improved system, AlphaGeometry 2, has surpassed that of the typical gold medallist. The findings are detailed in a preprint available on the arXiv service.
International Mathematical Olympiad
The International Mathematical Olympiad (IMO) is the world's most prominent mathematics competition. The inaugural competition, which took place in Romania in 1959 among seven Soviet Bloc nations, began to grow swiftly, reaching 50 nations in 1989 and surpassing 100 countries for the first time in 2009. The competition has always aimed to help school-age mathematicians improve their problem-solving abilities.
In India, the Homi Bhabha Centre for Science Education (HBCSE) organises the Mathematical Olympiad Programme on behalf of the National Board for Higher Mathematics (NBHM) of the Department of Atomic Energy (DAE), Government of India. The Indian team to compete in the international competition is chosen using a broad-based Indian Olympiad Qualifier in Mathematics (IOQM). For additional information, click here.
Questions are picked from four topic areas: algebra, combinatorics, geometry, and number theory, with no necessity or expectation that students can utilise calculus.
The competition consists of six problems. The tournament lasts two days and consists of three problems per day; each day, participants get four and a half hours to complete three questions. Each problem is worth 7 points, with a maximum of 42 points.
AI in the race
In 2024, the IMO was hosted in Bath, United Kingdom, with 609 high school students from 108 countries participating. Chinese student Haojia Shi finished first in the individual rankings with a perfect score- 42 points.
In the country rankings, the United States team came out on top, and China came in second. The human problem-solvers won 58 gold medals, 123 silver and 145 bronze. One of the event's highlights was the presence of two unofficial contestants: AlphaGeometry 2 and AlphaProof, both artificial intelligence algorithms built by Google DeepMind.
The two programs were able to solve four out of six tasks. Mathematician and Fields Medallist Timothy Gowers, a past IMO gold medallist, and mathematician Joseph K. Myers, another previous IMO gold medallist, evaluated the two AI systems' solutions using the same criteria as the human competitors. According to these standards, the programs received an excellent 28 points out of a potential 42 points, equivalent to a silver medal.
This means that the AI came close to earning a gold medal, which was granted for a score of 29 points or higher. Furthermore, just 60 pupils achieved higher completion scores. Furthermore, AlphaGeometry 2 solved the geometry problem correctly in just 19 seconds. Meanwhile, AlphaProof solved one number theory and two algebra problems, including one that only five human participants could figure out.
Training the tools
Training an AI requires a large quantity of data. AlphaProof's training was restricted by the amount of mathematical material accessible in a formal mathematical language. The DeepMind researchers then used the Gemini AI tool to translate millions of problems on the Internet that people have solved step-by-step in natural language into the Lean programming language, allowing the proof assistant to learn about them.
Using this huge data, AlphaProof was taught using reinforcement learning, as AI systems were taught to master chess, shogi, and Go previously. Reinforcement Learning (RL) is similar to instructing a dog. The dog (agent) learns tricks by performing actions (such as sitting). If it sits appropriately, it gets a treat (reward); otherwise, no treat. Over time, it learns which acts result in rewards.
Similarly, RL systems learn using trial and error to maximise rewards in tasks such as gaming or robotics. AlphaProof repeatedly competes with itself and improves step by step; if the process does not result in a win, it is penalised and learns to explore alternative techniques.
What works for number theory or algebra does not work in geometry, necessitating a new methodology. As a result, DeepMind created AlphaGeometry, a unique AI system designed to solve geometry difficulties.
The experts initially created an exhaustive list of geometric 'premises,' or basic building pieces of geometry, such as a triangle having three sides, and so on. Just as the Architect studies the design, AlphaGeometry's deduction engine algorithm evaluates the 'problems'. It picks the appropriate blocks (premises) assembled step by step to build the home (proof).
The AI was able to manipulate the geometric objects around a 2D plane, like adding a fourth point and converting a triangle into a quadrilateral or moving a point to change the triangle's height. The 'proof' is complete when all components fit together correctly, and the home is sturdy. Unlike trial-and-error learning (RL), this is equivalent to following an instruction manual with unlimited LEGO parts.
Going for gold
The DeepMind team has now produced an improved version, AlphaGeometry 2, that trains the model with more data and accelerates the process. The AI system is now able to solve linear equations.
With the upgrade, the AI was recently proved capable of answering 84% of all geometry problems set in IMOs during the last 25 years, compared to 54% for the previous version of AlphaGeometry. Future developments in AlphaGeometry will include dealing with mathematical problems containing inequalities and nonlinear equations, which will be necessary to solve geometry completely.
A team of researchers, including IIIT Hyderabad's Ponnurangam Kumaraguru, has made a breakthrough with their 'Symbolic AI', outperforming AlphaGeometry's capabilities. Furthermore, the hybrid symbolic AI, which complemented Wu's technique with AlphaGeometry, outperformed human gold medallists on IMO geometry problems.
This Symbolic AI system solves geometry problems by combining algebraic methods—primarily Wu's Method—with synthetic approaches, such as the deduction engine algorithm. The heart of this technique is 'Wu's method,' which is analogous to systematically completing a gigantic jigsaw puzzle.
Consider solving a jigsaw puzzle; it is challenging to complete if some elements (such as variables and equations in geometry) are concealed behind clutter. Thus, initial decluttering is valuable, such as sorting the puzzle pieces by colour/edge.
Wu's approach rearranges geometric equations into a more organised hierarchy. We can answer one equation at a time, just as we would place corner pieces in a jigsaw puzzle first, then use the results to simplify the next. Wu's Method simplifies complex geometry into a step-by-step assembly line.
'With very low computational requirements, this performs comparably to an IMO silver medalist. When combined with AlphaGeometry, the hybrid system successfully solves 27 out of 30 IMO problems,' according to Mr. Kumaraguru. 'This system is remarkably efficient — on most consumer laptops, with no access to a GPU, it can solve these problems within a few seconds. It also requires no learning or training phase.'
China's endeavours are not far behind. TongGeometry, a system for proposing and solving Euclidean geometry problems that bridge numerical and spatial reasoning, was developed by scholars at the Beijing Institute for General Artificial Intelligence (BIGAI) and the Institute for Artificial Intelligence, Peking University, Beijing. It has solved all International Mathematical Olympiad geometry problems for the last 30 years, outperforming gold medallists for the first time. 'Their work, including their analysis on novel problem generation, is quite interesting,' says Mr. Kumaraguru. However, he declined to express 'informed opinions' on their work due to the lack of publicly available material.
Are mathematicians redundant?
Are we nearing the point when mathematicians are obsolete? Gowers concurs: 'I would guess that we are still a breakthrough or two short of that.'
While the AlphaProof outperformed the humans, it took almost 60 hours to answer the problem, when the humans were only given 41/2 hours. If human competitors had been given that time for each task, they would surely have scored higher.
Another prerequisite is that problems were manually translated into the proof assistant Lean, meaning humans did the auto formalisation. In contrast, the AI program did the necessary mathematics.
Autoformalization converts ambiguous human language into precise logical or mathematical assertions that computers can reason about. For example, the sentence 'If it rains, then the ground gets wet' in plain English has to be translated into Propositional Logic: Rain → WetGround, which is symbolic form: First-Order Logic: ∀x (rain(x) → wetground(x)). Humans could do their own auto formalisation.
Furthermore, in the previous IMO, DeepMind's algorithms did not even attempt to solve combinatorial issues since they were difficult to transfer into programming languages like Lean. Alphageometry could only work with Euclidean plane geometry problems in 2D. Another significant inherent difficulty with AI is 'hallucinations', which are nonsensical or erroneous assertions that might occur, especially when dealing with intricate thinking.
(T.V. Venkateswaran is a science communicator and visiting faculty member at the Indian Institute of Science Education and Research, Mohali.)
Hashtags

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles
&w=3840&q=100)

Business Standard
36 minutes ago
- Business Standard
Use of AI likely to help enhance sugarcane production by 30%, say expert
The use of Artificial Intelligence (AI) could bring down the need of water in sugarcane farming by 50 per cent and increase the per acre produce by nearly 30 per cent, an expert in the field said on Wednesday. Notably, a meeting was recently held in Pune in the presence of Maharashtra Deputy Chief Minister Ajit Pawar and former Union Agriculture Minister Sharad Pawar where the use of AI in sugarcane farming was discussed. A Memorandum of Understanding (MoU) between the Vasantdada Sugar Institute and Agricultural Development Trust was signed to explore ways to ensure this technology reaches as many farmers as possible. "Microsoft has already worked for a long time on the use of AI for sugarcane cultivation and is assuring 30 per cent growth in sugarcane production and curtailing water usage (in its farming) to half. This will help to run the sugar mills for a longer time (more than 110 days) and losses will also be less," Maharashtra State Co-Operative Sugar Factories Federation Ltd director Jayprakash Dandegaonkar told PTI. 'The 40 (23 co-operative and 17 private) sugar mills in Maharashtra which do not owe any debt to the VSI will be included in this project (of AI use in sugarcane cultivation)," he said. A group of 25 sugarcane cultivating farmers in a two-km radius will have a station (automated AI facility) which will report to war rooms of the Krishi Vigyan Kendra and the VSI, the former state minister said. An amount of Rs 25,000 may be needed initially for the cost incurred by a farmer. The war rooms will alert these farmers in a few seconds about steps to be taken during the cultivation, he said. This technology will work on forecasting, soil testing, watering alerts, limiting the use of pesticides and protecting nutrients of the soil, he added. The tonnage of sugarcane in Maharashtra has gone down, Dandegaonkar pointed out. "Due to less rainfall, the per acre production in the state has come down to 73 tonnes. The use of AI can definitely help us to reach at least 150 tonnes per acre in the near period," he said. "Farmers need to install drips in their fields for this (irrigation). We can expect the first such station (automated AI facility) installed and put into operation by August-end or the first week of September," he added. (Only the headline and picture of this report may have been reworked by the Business Standard staff; the rest of the content is auto-generated from a syndicated feed.)


Hindustan Times
37 minutes ago
- Hindustan Times
OpenAI taps Google in unprecedented cloud deal despite AI rivalry
OpenAI plans to add Alphabet's Google cloud service to meet its growing needs for computing capacity, three sources told Reuters, marking a surprising collaboration between two prominent competitors in the artificial intelligence sector. The deal, which has been under discussion for a few months, was finalized in May, one of the sources added. It underscores how massive computing demands to train and deploy AI models are reshaping the competitive dynamics in AI, and marks OpenAI's latest move to diversify its compute sources beyond its major supporter Microsoft, including its high-profile Stargate data center project. It is a win for Google's cloud unit, which will supply additional computing capacity to OpenAI's existing infrastructure for training and running its AI models, sources said, who requested anonymity to discuss private matters. The move also comes as OpenAI's ChatGPT poses the biggest threat to Google's dominant search business in years, with Google executives recently saying that the AI race may not be winner-take-all. OpenAI, Google and Microsoft declined to comment. Alphabet's stock was up 2.1% on Tuesday afternoon following the news, while Microsoft shares were down 0.6%. Scotiabank analysts called the development "somewhat surprising" in a note on Tuesday, highlighting the growth opportunities for Google's Cloud unit, while expressing caution regarding competition from ChatGPT. "The deal ... underscores the fact that the two are willing to overlook heavy competition between them to meet the massive computing demands. Ultimately, we view this as a big win for Google's cloud unit, but ... there are continued worries that ChatGPT is becoming an incrementally larger threat to Google's search dominance," the analysts wrote. Since ChatGPT burst onto the scene in late 2022, OpenAI has dealt with increasing demand for computing capacity - known in the industry as compute - for training large language models, as well as for running inference, which involves processing information so people can use these models. OpenAI said on Monday that its annualized revenue run rate surged to $10 billion as of June, positioning the company to hit its full-year target amid booming adoption of AI. Earlier this year, OpenAI partnered with SoftBank and Oracle on the $500 billion Stargate infrastructure program, and signed deals worth billions with CoreWeave for more compute. It is on track this year to finalize the design of its first in-house chip that could reduce its dependency on external hardware providers, Reuters reported in February. The partnership with Google is the latest of several maneuvers made by OpenAI to reduce its dependency on Microsoft, whose Azure cloud service had served as the ChatGPT maker's exclusive data center infrastructure provider until January. Google and OpenAI discussed an arrangement for months but were previously blocked from signing a deal due to OpenAI's lock-in with Microsoft, a source told Reuters. Microsoft and OpenAI are also in negotiations to revise the terms of their multibillion-dollar investment, including the future equity stake Microsoft will hold in OpenAI. For Google, the deal comes as the tech giant is expanding external availability of its in-house chip known as tensor processing units, or TPUs, which were historically reserved for internal use. That helped Google win customers including Big Tech player Apple as well as startups like Anthropic and Safe Superintelligence, two OpenAI competitors launched by former OpenAI leaders. Google's addition of OpenAI to its customer list shows how the tech giant has capitalized on its in-house AI technology from hardware to software to accelerate the growth of its cloud business. Google Cloud, whose $43 billion in sales comprised 12% of Alphabet's 2024 revenue, has positioned itself as a neutral arbiter of computing resources in an effort to outflank Amazon and Microsoft as the cloud provider of choice for a rising legion of AI startups whose heavy infrastructure demands generate costly bills. Alphabet faces market pressure to demonstrate financial returns on its AI-related capital expenditures, which are expected to hit $75 billion this year, while maintaining its bottom line against the threat of competing AI offerings, as well as antitrust enforcement. Google's DeepMind AI unit also competes directly with OpenAI and Anthropic in a race to develop the best models and integrate those advances into consumer applications. Selling computing power reduces Google's own supply of chips while bolstering capacity-constrained rivals. The OpenAI deal will further complicate how Alphabet CEO Sundar Pichai allocates the capacity between the competing interests of Google's enterprise and consumer business segments. Google already lacked sufficient capacity to meet its cloud customers' demands as of the last quarter, Chief Financial Officer Anat Ashkenazi told analysts in April. Although ChatGPT holds a large lead over Google's competing chatbot in terms of monthly users and analysts have predicted it could reduce Google's dominant search market share, Pichai has brushed aside concerns that OpenAI will usurp Google's business dominance. First Published Date: 11 Jun, 11:21 IST


Hans India
39 minutes ago
- Hans India
WWDC 2025: Apple Introduces Liquid Glass Tech and Next-Gen Live Translate
Apple launched Apple WWDC 2025 on June 9 while worldwide audiences watched for its response to President Donald Trump's mandate to manufacture in the U.S. to avoid high Trump tariffs on products produced in China and India. The software and developer-focused WWDC anticipates geopolitical concerns about Apple's financial stability in attendees' thoughts. The company Apple encounters increasing examination regarding its failure to deliver the promised Apple Intelligence (AI) features like the AI-enhanced Siri assistant on schedule. The company revealed new software updates for all its devices and products. The standout feature of the presentation was likely the new 'Apple Liquid Glass' design update which aims to provide a more visually appealing workflow for Apple users. WWDC 2025 highlights Apple executives gave only brief mention to the issues the company has faced in delivering the AI-enhanced Siri without providing a revised release deadline. Liquid Glass display technology shows continuous expansion and development as it gains capabilities to execute tasks from call screening to motivating runners. Users of Apple Watch, iPad, iPhone, and MacBook will receive software updates that improve both their visual experience and functionality while Visual Intelligence now displays screen content. Apple innovation 2025 update and the growing capabilities of Apple Intelligence are highlighted by Tim Cook. Before he ends his presentation he expresses his gratitude to Apple's developer community amidst a rainbow backdrop. The Apple iPad receives a fresh design update and incorporates many iOS 19 features that were previously announced like phone controls and gaming apps. In the new system developers created a window management system to improve multi-tasking capabilities. Users can adjust the size of apps/windows while enjoying improved pointer responsiveness. Apple's headset users will access spatial widgets which will reside within their virtual reality environments along with photo-to-portal conversion features and persistent app placement memory to maintain content stability.