Latest news with #KaggleGameArena

When Magnus Carlsen met Elon Musk and played chess with Sam Altman, Mark Zuckerberg and Demis Hassabis

Indian Express

a day ago

Indian Express

When Magnus Carlsen met Elon Musk and played chess with Sam Altman, Mark Zuckerberg and Demis Hassabis

The world's greatest chess player of this generation Magnus Carlsen recently switched roles to do commentary instead of playing chess for an AI chess exhibition tournament on Google's Kaggle Game Arena. Carlsen had some sharp words for the world's leading general-purpose large language models (LLMs) such as xAI's Grok 4, OpenAI's o3, Gemini 2.5 Pro (Google), Gemini 2.5 Flash (Google), o4-mini (OpenAI), Claude 4 Opus (Anthropic), DeepSeek R1 and Kimi k2 (Moonshot AI). Carlsen was doing commentary for the Take Take Take app for the final of the AI chess exhibition tournament, where OpenAI's o3 (backed by Sam Altman) humbled xAI's Grok 4 (backed by Elon Musk). Carlsen at one point said that watching the world's top LLMs play was 'like watching kids' games'. But what does Carlsen think about the men behind the world's AI's movers-and-shakers? For one, Carlsen revealed that he had played a couple of events with some of the world's most influential men in the AI space. 'I have played in the same chess tournament as Sam Altman. It was an alternate moves tournament in Silicon Valley. It was around the Champions Chess Tour final in 2022. And I think Anish Giri was on the team with him. Sam wasn't very good,' Carlsen laughed. 'So Giri was very unhappy about being teamed with him. But he actually learned a lot through the experience. You could tell that he was very smart.' Carlsen also spoke about playing in a tournament with Facebook's Mark Zuckerberg. 'He didn't know much chess, but he was a little bit the same as Sam Altman. I thought he was even better. Was learning very, very quickly, he was forming his own opinions very quickly, which I thought was impressive. It's a useful skill. They were not necessarily right, which you wouldn't expect, but they were always well-reasoned,' Carlsen said. Carlsen then spoke about Musk and spoke about the X owner's disdain for chess, before explaining why he understands it to an extent. 'Elon I've seen in person but I haven't talked to him. You know he famously doesn't have the greatest respect for chess players or the game which, to some extent, I understand because it is not a very complicated game, right? It is very simple in many ways but I think that's also the beauty of the game. Like it's obviously simple since it took computers not that long to master it, right? Compared to some other games. But chess is also rich and very difficult to play as well. Like it's simple enough to play that you can get joy from playing after practice. Like not maybe immediately but after practice practicing a bit but hard enough that you can never actually get particularly good at it as a human which we've found out by seeing engines play,' Carlsen said. Carlsen shared a funny anecdote about playing chess with Demis Hassabis, the CEO of DeepMind Technologies and an AI Adviser to the UK Government, who was recently awarded a Nobel Prize in Chemistry. 'I played an alternate moves event with Demis quite a few years ago at the London Chess Classic. It was a lot of fun. Demis is fantastic. I've met him many times, but in terms of chess, he's kind of set in his ways: like he loves playing the Botvinnik System as white. And I asked him, 'Do you know anything else?' And he was like, 'No, not really. That's what I like to play.' And I was like, 'Yeah, fine. That's great.' And we actually beat some really good players in that line. It's a good system. But then it ended up where I think we had a tiebreaker in the finals. And as a tiebreak the two amateur players had to play each other, right? And then we discussed the game before and he said, 'My opponent's going to play 1.c4. What should I do?' And I said, 'What do you normally do?' Well, I play 1.c5, then I play g6 and so on. So I told him ways you could get around (the opponent's tricks in the opening). And lo and behold, the game starts, he forgets everything, and he ends up in a horrible passive position. He fought really well but we lost,' Carlsen said before adding: 'What he's done for chess is fantastic, but also what he's done for humanity with medicine is obviously even better.'

Gemini vs Grok vs Claude vs DeepSeek: Which AI tool is best for chess?

Time of India

4 days ago

Business
Time of India

Gemini vs Grok vs Claude vs DeepSeek: Which AI tool is best for chess?

A detailed view of a chess board and pieces (Photo by) The inaugural day of the AI chess exhibition tournament, hosted by Google's Kaggle Game Arena project, witnessed four Large Language Models (LLMs) securing dominant 4-0 victories to advance to the semifinals. Gemini 2.5 Pro, o4-mini, Grok 4, and o3 defeated their respective opponents Claude 4 Opus, DeepSeek R1, Gemini 2.5 Flash, and Kimi k2, showcasing the capabilities of general-purpose AI models in strategic gameplay. The Kaggle Game Arena, a new initiative by Google-owned Kaggle, aims to evaluate how LLMs perform in competitive environments. The tournament features eight leading LLMs competing in a single-elimination knockout bracket, with games broadcast live on multiple platforms. Go Beyond The Boundary with our YouTube channel. SUBSCRIBE NOW! Google has partnered with DeepMind to organise this unique tournament, where LLMs use a universal controller called "harness" to visualise positions and make moves. Each AI has four attempts to make a legal move, failing which results in losing the game. The match between Kimi k2 and o3 ended quickly, with none of the games lasting beyond eight moves. Kimi k2 consistently failed to make legal moves, despite showing the ability to follow opening theory for initial moves. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like Cyprus investments might surprise you Investments Cyprus | Search Ads Get Info Undo O4-mini's victory against DeepSeek R1 displayed a pattern of strong opening moves followed by declining play quality. Despite the inconsistencies, o4-mini managed to achieve two checkmates during the match. "This is a side effect btw. @xAI spent almost no effort on chess," posted Elon Musk on X, responding to Grok 4's impressive performance in the tournament. Gemini 2.5 Pro's match against Claude 4 Opus featured more checkmates than illegal move forfeits. The first game showed both AIs maintaining good moves until move nine, when Claude 4 Opus made a critical error with 10...g5. Grok 4 delivered the strongest performance of the day, demonstrating particular skill in identifying and capitalising on undefended pieces in its match against Gemini 2.5 Flash. The tournament has revealed three primary challenges for LLMs in chess: visualising the entire board, understanding piece interactions, and making legal moves. These limitations vary among the different AI models. The competition continues on Wednesday, August 6, starting at 1 p.m. ET / 19:00 CEST / 10:30 p.m. IST. Viewers can watch the event live on GM Hikaru Nakamura's Twitch and YouTube channels, as well as on the tournament's dedicated events page. Catch Rani Rampal's inspiring story on Game On, Episode 4. Watch Here!

Latest news with #KaggleGameArena

When Magnus Carlsen met Elon Musk and played chess with Sam Altman, Mark Zuckerberg and Demis Hassabis

Gemini vs Grok vs Claude vs DeepSeek: Which AI tool is best for chess?

Get Started Now: Download the App