logo
Which AI model is best? Big chess battle between top engines to decide

Which AI model is best? Big chess battle between top engines to decide

First Post2 days ago
Eight top AI models from Google, OpenAI, Anthropic, and more are battling it out in the Kaggle AI Chess Exhibition Tournament to find out which model is best at chess. Day 1 saw dominant 4-0 wins across all games. The semi-finals will be held on Wednesday, August 6 and the final is slated for Thursday. read more
World's top AI models are competing in a chess exhibition tournament to find out who is the best. AFP
A new and exciting tournament is being held to find out which AI model is the best at playing chess. The event is being organised by Kaggle Game Arena in partnership with Google DeepMind. It is called the AI Chess Exhibition Tournament, and is being held from August 5 to August 7.
This special tournament will feature eight popular Large Language Models (LLMs) from top companies like Google, OpenAI, Anthropic, and more. These models are not made for chess, but for other things like coding, writing, and answering questions. However, they now face each other on the chessboard.
STORY CONTINUES BELOW THIS AD
The competing AIs and format
Gemini 2.5 Pro (Google)
Gemini 2.5 Flash (Google)
o3 (OpenAI)
o4-mini (OpenAI)
Claude 4 Opus (Anthropic)
Grok 4 (xAI, Elon Musk's company)
DeepSeek R1
Kimi k2 (Moonshot AI)
The tournament will follow a knockout format, which means one loss and the model is out. Based on how they play, the AIs will be ranked, and the results will be shared on Chess.com and on Kaggle. American Grandmaster Hikaru Nakamura is also covering the event on his Twitch handle.
AI Chess Exhibition Tournament Day 1 results
Day 1 of the Kaggle AI Chess Tournament saw dominant performances, with four models, OpenAI o4 mini, OpenAI o3, Gemini 2.5 Pro, and Grok 4, cruising into the semifinals with identical 4-0 wins.
OpenAI's models impressed early, as both o4 mini and o3 swept past DeepSeek-R1 and Kimi K2 Instruct respectively. On the other side of the bracket, Google's Gemini 2.5 Pro outplayed Anthropic's Claude Opus 4, while xAI's Grok 4 defeated Gemini 2.5 Flash in a surprise result.
In the semifinals on August 6, o4 mini will face o3, and Gemini 2.5 Pro will lock horns against Grok 4. The gold-medal match and bronze-medal match will be held on August 7.
In 2017, Google's famous AI model AlphaZero shocked the world when it learned chess in just 4 hours and beat Stockfish, which is considered the world's best chess engine. But unlike AlphaZero, the current models are not made for chess as they are general-purpose language models.
However, it will still be an interesting tournament to watch, as it will provide an answer to which AI model is the best at chess.
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Sam Altman slams companies for ‘going after shiny names' amid intense AI talent war. Says ‘thousands can do the same work'
Sam Altman slams companies for ‘going after shiny names' amid intense AI talent war. Says ‘thousands can do the same work'

Time of India

time32 minutes ago

  • Time of India

Sam Altman slams companies for ‘going after shiny names' amid intense AI talent war. Says ‘thousands can do the same work'

Why the Stakes Are Sky-High Meta's Aggressive Playbook The Musk Factor Staying Private, Staying Focused You Might Also Like: Elon Musk vs Satya Nadella: Billionaire mocks Microsoft after GPT-5 launch You Might Also Like: Andrew Garfield spotted outside OpenAI CEO Sam Altman's house? Viral video sparks buzz The race to secure the world's brightest artificial intelligence minds has turned into one of Silicon Valley's most expensive battles — but according to OpenAI CEO Sam Altman , some companies are chasing prestige over potential. Speaking to CNBC's Squawk Box, Altman described the current hiring climate as 'the most intense talent market I have seen in my career.' Rival firms, particularly Meta, have been offering massive pay packages in an effort to lure top engineers away from competitors. Reports suggest these offers can stretch into the tens of millions, with some signing bonuses hitting nine Altman is skeptical of the industry's fixation on a small circle of well-known researchers. 'Some companies have decided to go after a few shiny names,' he said. 'But I think there are many thousands of people we could find, and probably tens or hundreds of thousands around the world capable of doing this kind of work.'The bidding war is being fueled by the enormous economic potential AI experts can unlock. OpenAI and other industry leaders are spending billions on computing power and infrastructure, creating a scenario where a single breakthrough could justify years of asked about the tiny elite capable of advancing AI to superintelligence, Altman admitted there is a 'medium-sized handful' of people who might be able to uncover the critical algorithmic breakthroughs needed. But he insists that innovation will not be driven solely by a select few — and that overlooking the wider talent pool is a strategic remarks come as Meta and xAI double down on their recruitment drive. CEO Mark Zuckerberg has built a new 'Superintelligence Lab,' poaching high-profile researchers from OpenAI, Google DeepMind, and Anthropic. Former Scale AI CEO Alexandr Wang now serves as Meta's Chief AI Officer, while ex-GitHub CEO Nat Friedman has joined the has fought to keep its top people, reportedly revising compensation to match Meta's offers and relying on its mission-driven culture as a retention tool. Meta, meanwhile, has framed its advantage as a mix of vast compute resources, global reach, and a bold, open-source conversation around AI talent has also been colored by Altman's long-running feud with Elon Musk, his former OpenAI co-founder. After Microsoft announced that OpenAI's latest model, GPT-5, will be integrated into products like Microsoft 365 Copilot and GitHub Copilot, Musk quipped on X that 'OpenAI is going to eat Microsoft alive.'Altman brushed off the comment in his CNBC interview. 'I don't think about him that much,' he said, questioning the intent behind Musk's frequent criticism of OpenAI. The rivalry traces back to a philosophical split over the company's mission and Musk's failed attempt earlier this year to acquire OpenAI's controlling speculation about an OpenAI IPO, Altman told Squawk Box that going public is not a priority. Valued at up to $500 billion, the company remains privately held with Microsoft as a major backer. Altman cited operational challenges and a desire to keep investing heavily in compute and research as reasons to avoid public markets for now.'I hate that people get pushed to various degrees of sketchy ways to try to get exposure [to OpenAI],' he said, referring to private market trades of company shares. 'Whenever we do go public — if we ever go public — I think there will be tremendous upside left in front of the company.'

OpenAI launches GPT-5, a potential barometer for whether AI hype is justified
OpenAI launches GPT-5, a potential barometer for whether AI hype is justified

Time of India

time39 minutes ago

  • Time of India

OpenAI launches GPT-5, a potential barometer for whether AI hype is justified

OpenAI on Thursday released the fifth generation of the artificial intelligence technology that powers ChatGPT, a product update that's being closely watched as a measure of whether generative AI is advancing rapidly or hitting a plateau. GPT-5 arrives more than two years after the March 2023 release of GPT-4, bookending a period of intense commercial investment, hype and worry over AI's capabilities. In anticipation, rival Anthropic released the latest version of its own chatbot, Claude, earlier in the week, part of a race with Google and other competitors in the U.S. and China to leapfrog each other on AI benchmarks. Meanwhile, longtime OpenAI partner Microsoft said it will incorporate GPT-5 into its own AI assistant, Copilot. Expectations are high for the newest version of OpenAI's flagship model because the San Francisco company has long positioned its technical advancements as a path toward artificial general intelligence , or AGI, a technology that is supposed to surpass humans at economically valuable work. It is also trying to raise huge amounts of money to get there, in part to pay for the costly computer chips and data centers needed to build and run the technology. OpenAI CEO Sam Altman described the new model as a "significant step along our path to AGI" but mostly focused on its usability to the 700 million people he says use ChatGPT each week. "It's like talking to an expert - a legitimate PhD-level expert in anything, any area you need, on demand," Altman said at a launch event livestreamed Thursday. It may take some time to see how people use the new model - now available, with usage limits, to anyone with a free ChatGPT account. The Thursday event focused heavily on ChatGPT's use in coding, an area where Anthropic is seen as a leader, and featured a guest appearance by the CEO of coding software maker Cursor, an important Anthropic customer. OpenAI's presenters also spent time talking about safety improvements to make the chatbot "less deceptive" and stop it from producing harmful responses to "cleverly worded" prompts that could bypass its guardrails. The Associated Press reported Wednesday on a study that showed ChatGPT was providing dangerous information about drugs and self-harm to researchers posing as teenagers. At a technical level, GPT-5 shows "modest but significant improvements" on the latest benchmarks, but when compared to GPT-4, it also looks very different and resets OpenAI's flagship technology in a way that could set the stage for future innovations, said John Thickstun, an assistant professor of computer science at Cornell University. "I'm not a believer that it's the end of work and that AI is just going to solve all humanity's problems for it, but I do think there's still a lot of headroom for them, and other people in this space, to continue to improve the technology," he said. "Not just capitalizing on the gains that have already been made." OpenAI started in 2015 as a nonprofit research laboratory to safely build AGI and has since incorporated a for-profit company with a valuation that has grown to $300 billion. The company has tried to change its structure since the nonprofit board ousted Altman in November 2023. He was reinstated days later. It has not yet reported making a profit but has run into hurdles escaping its nonprofit roots, including scrutiny from the attorneys general in California and Delaware, who have oversight of nonprofits, and a lawsuit by Elon Musk, an early donor to and founder of OpenAI who now runs his own AI company. Most recently, OpenAI has said it will turn its for-profit company into a public benefit corporation, which must balance the interests of shareholders and its mission. OpenAI is the world's third most valuable private company and a bellwether for the AI industry, with an "increasingly fragile moat" at the frontier of AI, according to banking giant JPMorgan Chase, which recently made a rare decision to cover the company despite it not being publicly traded. The inability of a single AI developer to have a "sustained competitive edge" could increasingly force companies to compete on lowering the prices of their AI products, the bank said in a report last month. The Associated Press and OpenAI have a licensing and technology agreement that allows OpenAI access to part of AP's text archives.

Ex-Google boss reveals how long we have to wait until AI's real benefits arrive. Warns of an unavoidable dystopia first
Ex-Google boss reveals how long we have to wait until AI's real benefits arrive. Warns of an unavoidable dystopia first

Time of India

time43 minutes ago

  • Time of India

Ex-Google boss reveals how long we have to wait until AI's real benefits arrive. Warns of an unavoidable dystopia first

A Countdown to 2027: When the Storm Begins The Real Threat Isn't the Machines Why the Next 15 Years Could Be the Hardest in Human History A Utopia on the Other Side? A former top Google leader has sounded an unsettling alarm about the near future, predicting that artificial intelligence will plunge the world into a decade-and-a-half of turmoil before humanity can emerge into anything resembling a warning comes from Mo Gawdat , the former Chief Business Officer at Google X , who spoke candidly on the Diary of a CEO podcast. His forecast is anything but rosy — and it's not the robots themselves we should believes the tipping point is just around the corner.'We will have to prepare for a world that is very unfamiliar,' he told host Steven Bartlett. 'We are going to hit a short-term dystopia — there's no escaping that.'According to him, the shift will begin in 2027, with a rough 12–15 year stretch where the darker side of human behavior, amplified by AI, will dominate. While early signs could emerge as soon as next year, the full impact will hit when powerful AI tools become commonplace in the wrong to popular sci-fi fears, Gawdat does not envision AI 'taking over' in a sentient, hostile way. Instead, his concern lies in how people will use these powerful tools. He warns that the 'failing morality of humanity' will lead to a surge in scams, privacy violations, and manipulation at unprecedented of the most troubling side effects he predicts is a 'massive concentration of power' in the hands of a select few — echoing similar warnings from AI pioneer Geoffrey Hinton, who has raised alarms about widening inequality driven by AI's economic forecast paints a picture of societies struggling to adapt, where governance and ethical safeguards lag far behind technological such an environment, the potential for abuse skyrockets — from deepfake-driven disinformation campaigns to automated financial fraud, and from hyper-personalized propaganda to manipulation of democratic Gawdat's prediction is not entirely bleak. He sees this 'short-term dystopia' as a turbulent passage to what he calls a 'long-term utopia' — a world where AI fulfills its positive humanity can survive the next decade and a half of chaos, he suggests we might emerge into an era of abundance, efficiency, and unprecedented problem-solving optimists like Bill Gates share this latter vision, pointing to AI's potential to cut workloads, accelerate medical breakthroughs, and expand access to education. But as Gawdat warns, the path there will test our resilience, ethics, and adaptability like never before.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store