logo
Why the tech industry is obsessed with Chatbot Arena, the AI-ranking platform

Why the tech industry is obsessed with Chatbot Arena, the AI-ranking platform

Indian Express21-04-2025

With tech companies like OpenAI, Google, and Meta launching new AI models within weeks of each other, it's becoming increasingly difficult not only to keep track but also to assess how sophisticated each model truly is. That's where Chatbot Arena comes in—a free, crowdsourced benchmarking platform that tests newly launched AI models and pits them against each other across various parameters. In fact, ever since it was launched, Chatbot Arena has become the silicon valley's new obsession.
Here's everything you need to know about Chatbot Arena—how it works and why the crowdsourced ranking site has become so popular.
What is Chatbot Arena?
Most companies measure their AI models against a set of general capability benchmarks, but there is no industry-standard benchmark or universally accepted method for assessing large language models (LLMs). Founded in 2023 by researchers affiliated to UC Berkeley's Sky Computing Lab, Chatbot Arena has emerged as the most practical—and virtually the only—tool to determine which AI model is the best on the market.
Essentially, it's an interactive platform where users can pit multiple AI chatbots against each other in real-time conversations.
Ranking of various AI models on the chatbot arena.
What sets Chatbot Arena apart is that it allows AI models to converse freely across a wide range of topics, offering a more holistic assessment of their conversational skills. This is important criteria – after all, even small differences to factors like prompts, datasets, and formatting can have a huge impact on how a model performs.
In Chatbot Arena, users can interact with the chatbots, get side-by-side comparisons of various AI tools with complete weakness and strengths, and vote on which one performs better. The tool becomes a testing ground for AI developers, researchers, and anyone who is interested in benchmarking these AI tools.
Chatbot Arena recently transitioned into a full fledged company called LMArena, operating under Arena Intelligence Inc. The new company is co-founded by Dimitris Angelopoulos, Wei-Lin Chiang—another former UC Berkeley postdoctoral researcher—and Ion Stoica, a professor and tech entrepreneur. Chatbot Arena is funded through a combination of grants and donations, including support from Google 's Kaggle data science platform, Andreessen Horowitz, and Together AI.
How does Chatbot Arena works?
Perhaps the biggest reason why the AI benchmarking tool is so popular in the first place is that it makes it easy to compare two AI models side by side. Not only does Chatbot Arena allow users to pit the latest AI chatbots from OpenAI, Google, Anthropic, and Meta against each other, but its scoreboard also ranks over 100 AI models (developed by organisations or individuals) based on nearly 1.5 million votes. These rankings span a wide range of categories, including coding, long-form queries, mathematics, 'hard prompts,' and various languages such as English, French, Chinese, Japanese, Spanish, among others. It all adds up to why the AI benchmarking tool is so popular among the global community users.
The industry also praises Chatbot Arena for offering neutral benchmarking, making the platform largely free of bias—an important factor for objective comparisons between different AI models. Chatbot Arena has partnerships with OpenAI, Google, and Anthropic to make their flagship models available for the community to evaluate.
How to use Chatbot Arena
Chatbot Arena offers two ways of evaluate different AI models, and if you are keen to try the AI benchmarking tool, make sure to try their 'battle' modes. The first mode is the Arena Battle, where your prompts is answered by Model A and Model B
However, you don't know the model names until you click a button at the bottom. Then the model names appear. It's completely anonymous.
To use Chatbot Arena battle:
*Click OK on the popup that indicates this is a research preview.
*Make sure to read the Terms of use to better understand how the battle works, then scroll down to the field that reads Enter text and press Enter.
*Enter your prompt.
*Click the Send button.
Read the results and click the appropriate button. Now you should now see the name of the LLMs used for the battle.
Another way to evaluate two AI models is through a side-by-side comparison. This allows you to choose the AI models of your choice and see how they perform against each other. It's definitely a better approach—at the very least, it helps you identify which model best suits your needs
To use Chatbot Arena (side-by-side comparison):
*Open https://arena.lmsys.org/ in your web browser.
*Click OK to the research preview popup.
*Now click the top tab labeled 'Arena (side-by-side)'.
*Click in the field showing the model name.
*Select your model name from the drop-down list. You can also clear the field and start typing letters.
*Scroll down and enter your prompt.
*Click Send.
*Review your responses and then Cast your vote using the buttons at the bottom.
In case, you don't like the responses, you always click the Regenerate button.

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

French energy giant TotalEnergies reaffirms support for Adani Green's expansion in India
French energy giant TotalEnergies reaffirms support for Adani Green's expansion in India

India Gazette

time36 minutes ago

  • India Gazette

French energy giant TotalEnergies reaffirms support for Adani Green's expansion in India

By Shailesh Yadav Paris [France], 2 June (ANI): TotalEnergies is 'committed to continue to support the expansion of Adani Green,' Patrick Pouyanne, Chairman and CEO of the French energy giant, said on Monday. With Adani Green Energy Limited (AGEL )'s current capacity of 14 gigawatts and plans for continued expansion, the partnership with TotalEnergies is expected to play a crucial role in India's ambitious renewable energy targets and the country's transition toward cleaner energy sources. Pouyanne said this during his meeting with Indian Commerce and Industry Minister Piyush Goyal, who is currently on a three-day visit to France from June 1. 'We are committed to continue to support the expansion of Adani Green, which already has 14 gigawatts of capacity,' Pouyanne stated, adding, 'We will continue to support this growth.' The French energy major has invested approximately USD 5 billion in India over the past five years, focusing on natural gas infrastructure, city gas development, and renewable energy projects, particularly solar and wind installations, in partnership with Adani entities. TotalEnergies' involvement with Adani Green began in January 2021 when the company acquired a minority stake in the publicly listed renewable energy firm. As part of its strategy to strengthen renewable energy development in India, TotalEnergies also secured a 50 per cent stake in three joint ventures with AGEL operating renewable energy assets. During the Paris meetings, Pouyanne outlined TotalEnergies' broader Indian expansion plans, including increased energy exports from the United States, where the company serves as the largest energy exporter. The executive also highlighted plans to restart operations in Mozambique, which could provide additional energy supplies to India. This commitment marks a significant turnaround from TotalEnergies' position following allegations made by short-seller Hindenburg Research against the Adani Group last year. In response to those allegations, the French company had announced it would suspend further financial contributions to its Adani investments pending clarification of the accusations and their consequences. TotalEnergies emphasised that its investments in Adani entities were conducted in full compliance with applicable laws and the company's internal governance processes based on thorough due diligence and declarations provided by the investment partners. The renewed partnership comes as both companies look to capitalise on India's growing renewable energy market. Minister Goyal reportedly encouraged TotalEnergies to expand its presence in India further, building on the substantial investments already made in the country's energy infrastructure. The announcement also follows a memorandum of understanding signed with Gujarat State Petroleum Corporation (GSPC) during the India Energy Summit, signalling TotalEnergies' broader commitment to the Indian energy sector. (ANI)

Auto companies flag China's magnet supply risks
Auto companies flag China's magnet supply risks

Time of India

timean hour ago

  • Time of India

Auto companies flag China's magnet supply risks

NEW DELHI: Raising an alarm over challenges in procuring rare earth magnets from China, the auto industry has told govt that the matter will lead to stoppage of production of certain models from this week, while heading to a complete shutdown by the middle of next month. China has put the export curbs on rare earth magnets to ensure they are not used for making defence and weapon systems. This threatens launch of many new models, apart from disturbing the entire value chain across passenger vehicles, two-wheelers, and commercial vehicles. Companies at risk of production disruptions include, Maruti Suzuki, Mahindra, Hyundai and Kia, Hero Moto, TVS, and Bajaj Auto. Over the past month, auto companies discussed the matter with various govt departments to get clarity regarding the process to be adopted for obtaining the EUC (end-use case). On suggestions of the ministry of external affairs (MEA), they also had a discussion with the embassy of China in India. However, there has been no solution so far. The rare earth magnets are used for components like speedometers, electric motors, e-axles, electric water pumps, automatic transmission kits, speakers, sensors, and ignition coils in engines. Companies say an elaborate approval process has to be followed before the magnets can be procured, and a final approval from China's ministry of commerce is also required. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like 2025 Top Trending local enterprise accounting software [Click Here] Esseps Learn More Undo Heavy industries minister H D Kumaraswamy on Monday said govt is preparing to send a delegation of industry executives to China in two-three weeks to discuss the issue. Govt is working overtime to work out a solution to the issue, which began after the Chinese govt on April 4, imposed certain requirements in the export permit system for medium and heavy rare earth metals, its alloys, magnets and related products. On duration of inventory Maruti has before any impact on production, the automaker said it submitted an import application and it would be difficult to give "very specific details" until it receives a response. "It is not a restriction. It is an endorsement of end-use. In case there is an issue, we will inform all our stakeholders, including the stock exchange," Rahul Bharti, senior executive director, corporate affairs, said. Bajaj Auto said the issue could have a "serious impact" on their EV production by July. The industry's warning comes even as delegations of industry body Siam and component makers ACMA plan to visit China "at the earliest" to expedite permissions to procure the magnets used across automotive applications, both in petrol, diesel engines and EVs. Stay informed with the latest business news, updates on bank holidays and public holidays . AI Masterclass for Students. Upskill Young Ones Today!– Join Now

Donald Trump-Xi Jinping call ‘likely' this week, says White House, amid stalled trade tariff talks
Donald Trump-Xi Jinping call ‘likely' this week, says White House, amid stalled trade tariff talks

Mint

timean hour ago

  • Mint

Donald Trump-Xi Jinping call ‘likely' this week, says White House, amid stalled trade tariff talks

US President Donald Trump and Chinese President Xi Jinping are expected to speak this week, according to White House Press Secretary Karoline Leavitt. The call would come amid rising tensions after Trump accused Beijing of breaching last month's tariff rollback agreement, reached in Geneva, and Beijing asserting that Washington 'has made bogus charges and unreasonably accused China of violating the consensus". Leavitt is the third senior Trump official in recent days to suggest a phone call is imminent. The exact date and time of the conversation remain unconfirmed. A temporary US-China agreement to suspend tariffs for 90 days triggered a strong relief rally in global stock markets. Earlier this month, the two sides agreed to a temporary easing of trade tensions. China cut tariffs on American goods from 125% to 10% for 90 days, while the US proposed reducing its tariffs on Chinese imports from 145% to 30%. Despite this breakthrough, progress has since stalled amid new disputes, including US export controls on AI chips and the revocation of Chinese student visas. China warned that if the US continues on its current path, it 'will continue to resolutely take strong measures to uphold its legitimate rights and interests.' However, the temporary ceasefire failed to address fundamental US grievances over China's export-driven, state-led economic practices. These include issues like forced technology transfers, industrial subsidies, and limited market access for foreign firms. While the short-term tariff freeze offers breathing room, it leaves the more complex issues to be hashed out in future negotiations. Trump reignited the US-China trade war on Friday with an explosive post on Truth Social, accusing Beijing of failing to honor the recent tariff rollback agreement. 'The bad news is that China, perhaps not surprisingly to some, HAS TOTALLY VIOLATED ITS AGREEMENT WITH US. So much for being Mr. NICE GUY!' Trump wrote. Trump did not provide specific details about how China allegedly broke the deal, but claimed the violations were severe and deliberate. The comments come less than a month after both nations agreed in Geneva to reduce tit-for-tat tariffs for a 90-day cooling-off period. In the same post, Trump claimed his aggressive tariffs had left China's economy 'in grave danger,' leading to factory closures and unrest. 'Two weeks ago China was in grave economic danger! The very high Tariffs I set made it virtually impossible for China to TRADE into the United States marketplace,' he said. He further claimed that a wave of 'mild civil unrest' in China prompted him to pursue a quick resolution. 'I saw what was happening and didn't like it, for them, not for us. I made a FAST DEAL with China in order to save them… and I didn't want to see that happen,' Trump added. China's Commerce Ministry responded swiftly and sharply, rejecting Trump's accusations and reaffirming its commitment to the Geneva consensus. 'China has been firm in safeguarding its rights and interests, and sincere in implementing the consensus,' the ministry said, according to AFP. Beijing also accused Washington of 'unreasonably' blaming China while taking discriminatory actions of its own. 'Washington has made bogus charges and unreasonably accused China of violating the consensus, which is seriously contrary to the facts,' the statement said. 'We urge the U.S. to meet China halfway, immediately correct its wrongful actions, and jointly uphold the consensus from the Geneva trade talks.' With President Trump and Chinese leader Xi Jinping expected to speak in the coming days, the future of the trade truce hangs in the balance. Trump's accusations and Beijing's stern rebuttal signal that tensions remain high despite diplomatic efforts. Separately, the US trade court ruled on Wednesday that President Donald Trump exceeded his legal authority by using emergency powers to impose the majority of his tariffs on Chinese and other foreign goods. The ruling cast doubt on the legality of the broader tariff regime enacted during the Trump administration. But in a swift reversal, a federal appeals court temporarily reinstated those tariffs less than 24 hours later. The court issued a stay on the lower court's decision while it reviews the government's appeal. It set a fast-track schedule, ordering the plaintiffs to respond by June 5 and the Biden administration to reply by June 9.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store