logo
#

Latest news with #R1Model

China's DeepSeek Upgrades R1 AI Model, Narrowing Gap with Western Counterparts
China's DeepSeek Upgrades R1 AI Model, Narrowing Gap with Western Counterparts

Entrepreneur

time2 days ago

  • Business
  • Entrepreneur

China's DeepSeek Upgrades R1 AI Model, Narrowing Gap with Western Counterparts

The latest version of the R1 model reportedly performs just below OpenAI's o3 and o4-mini, based on evaluations by LiveCodeBench You're reading Entrepreneur India, an international franchise of Entrepreneur Media. Chinese AI startup DeepSeek has quietly released an upgraded version of its R1 model, 'DeepSeek-R1-0528', on the open-source platform Hugging Face, claiming substantial improvements in mathematical reasoning, programming, and logic, alongside a reduction in hallucination rates. The update positions DeepSeek's model closer to top-tier systems such as OpenAI's o3 and Google's Gemini 2.5 Pro, according to performance data cited on Hugging Face. The company's low-key release strategy continues its trend of disrupting the AI landscape without formal fanfare. "In terms of comprehensive performance, the updated model is approaching the level of the industry's leading systems," DeepSeek said in its Hugging Face release. DeepSeek first drew global attention in January with the debut of its original R1 model, which delivered competitive performance despite limited resources. The company's progress raised concerns in the West, with R1's launch contributing to brief market volatility and a dip in Nvidia's stock price. The latest version of the R1 model reportedly performs just below OpenAI's o3 and o4-mini, based on evaluations by LiveCodeBench, an AI benchmarking platform. The model's capabilities are built on cost-efficiency and reasoning-focused design—traits that have made DeepSeek a symbol of China's broader AI ambitions. Founded in Hangzhou, DeepSeek has become a key player in China's push to rival American AI giants, navigating restrictions on advanced semiconductors by focusing on optimisation and resourceful development. The company's founder, Liang Wenfeng, has gained domestic prominence, recently participating in a high-level economic forum chaired by President Xi Jinping. His rise reflects growing confidence in China's home-grown AI talent, amid intensifying global competition in the sector.

Chinese AI start-up DeepSeek pushes US rivals with R1 model upgrade
Chinese AI start-up DeepSeek pushes US rivals with R1 model upgrade

Yahoo

time2 days ago

  • Business
  • Yahoo

Chinese AI start-up DeepSeek pushes US rivals with R1 model upgrade

By Brenda Goh and Eduardo Baptista SHANGHAI/BEIJING -Chinese artificial intelligence startup DeepSeek released the first update to its hit R1 reasoning model in the early hours of Thursday, stepping up competition with U.S. rivals such as OpenAI. DeepSeek said via developer platform Hugging Face that R1-0528 was a minor version upgrade of R1 that nevertheless significantly improved its depth of reasoning and inference capabilities, including better handling of complex tasks, bringing its performance closer to OpenAI's o3 reasoning models and Google's Gemini 2.5 Pro. The launch of R1 in January went globally viral, sent tech shares outside China plummeting, and challenged the view that scaling AI requires vast computing power and investment. Since R1's release, Chinese tech giants like Alibaba and Tencent have released models claiming to surpass DeepSeek's. Thursday's update was initially light on details in contrast to the launch of R1 in January which was accompanied by a multi-authored academic paper that the AI community worldwide has parsed to understand the firm's strategies. The Hangzhou-based firm said later in a short post on X that R1-0528 featured improved performance. In a longer post on WeChat, DeepSeek said the rate of "hallucinations", false or misleading output, was reduced by about 45-50% in scenarios such as rewriting and summarizing. It said the update also enabled it to creatively write essays, novels and other genres, and had improved capabilities in areas such as generating front-end code and role-playing. "The model has demonstrated outstanding performance across various benchmark evaluations, including mathematics, programming, and general logic," DeepSeek said. DeepSeek's success has upended beliefs that U.S. export controls were holding back China's AI advancements, after it released AI models that were on a par or better than industry-leading models in the United States at a fraction of the cost. The startup added on Thursday that a variant of its update was created by taking the reasoning process used by the R1-0528 model, to then further enhance Chinese tech giant Alibaba's Qwen 3 8B Base model, a process known as distillation. The result was a performance surpassing the original Qwen 3 model by over 10%. "We believe that the chain-of-thought from DeepSeek-R1-0528 will hold significant importance for both academic research on reasoning models and industrial development focused on small-scale models," DeepSeek added. Bloomberg reported the update on Wednesday. It said that a DeepSeek representative had told a WeChat group it had completed what it described as a "minor trial upgrade" and that users could start testing it. In response to competition from Deepseek, Google's Gemini has introduced discounted tiers of access while OpenAI cut prices and released an o3 Mini model that relies on less computing power. Deepseek is still widely expected to release R2, a successor to R1. Reuters reported in March, citing sources, that R2's release was initially planned for May. DeepSeek also released an upgrade to its V3 large language model in March.

China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year
China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year

CNN

time3 days ago

  • Business
  • CNN

China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year

Chinese artificial intelligence startup DeepSeek released an update to its R1 reasoning model in the early hours of Thursday, stepping up competition with US rivals such as OpenAI. DeepSeek launched R1-0528 on developer platform Hugging Face, but has yet to make an official public announcement. It did not publish a description of the model or comparisons. But the LiveCodeBench leaderboard, a benchmark developed by researchers from UC Berkeley, MIT, and Cornell, ranked DeepSeek's updated R1 reasoning model just slightly behind OpenAI's o4 mini and o3 reasoning models on code generation and ahead of xAI's Grok 3 mini and Alibaba's Qwen 3. Bloomberg earlier reported the update on Wedneday. It said that a DeepSeek representative had told a WeChat group that it had completed what it described as a 'minor trial upgrade' and that users could start testing it. DeepSeek earlier this year upended beliefs that US export controls were holding back China's AI advancements after the startup released AI models that were on a par with or better than industry-leading models in the United States at a fraction of the cost. The launch of R1 in January sent tech shares outside China plummeting in January and challenged the view that scaling AI requires vast computing power and investment. Since R1's release, Chinese tech giants like Alibaba and Tencent have released models claiming to surpass DeepSeek's. Google's Gemini has introduced discounted tiers of access while OpenAI cut prices and released an o3 mini model that relies on less computing power. The company is still widely expected to release R2, a successor to R1. Reuters reported in March, citing sources, that R2's release was initially planned for May. DeepSeek also released an upgrade to its V3 large language model in March.

China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year
China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year

Yahoo

time3 days ago

  • Business
  • Yahoo

China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year

Chinese artificial intelligence startup DeepSeek released an update to its R1 reasoning model in the early hours of Thursday, stepping up competition with US rivals such as OpenAI. DeepSeek launched R1-0528 on developer platform Hugging Face, but has yet to make an official public announcement. It did not publish a description of the model or comparisons. But the LiveCodeBench leaderboard, a benchmark developed by researchers from UC Berkeley, MIT, and Cornell, ranked DeepSeek's updated R1 reasoning model just slightly behind OpenAI's o4 mini and o3 reasoning models on code generation and ahead of xAI's Grok 3 mini and Alibaba's Qwen 3. Bloomberg earlier reported the update on Wedneday. It said that a DeepSeek representative had told a WeChat group that it had completed what it described as a 'minor trial upgrade' and that users could start testing it. DeepSeek earlier this year upended beliefs that US export controls were holding back China's AI advancements after the startup released AI models that were on a par with or better than industry-leading models in the United States at a fraction of the cost. The launch of R1 in January sent tech shares outside China plummeting in January and challenged the view that scaling AI requires vast computing power and investment. Since R1's release, Chinese tech giants like Alibaba and Tencent have released models claiming to surpass DeepSeek's. Google's Gemini has introduced discounted tiers of access while OpenAI cut prices and released an o3 mini model that relies on less computing power. The company is still widely expected to release R2, a successor to R1. Reuters reported in March, citing sources, that R2's release was initially planned for May. DeepSeek also released an upgrade to its V3 large language model in March. Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year
China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year

CNN

time3 days ago

  • Business
  • CNN

China's DeepSeek releases update to AI model that sent US shares tumbling earlier this year

Chinese artificial intelligence startup DeepSeek released an update to its R1 reasoning model in the early hours of Thursday, stepping up competition with US rivals such as OpenAI. DeepSeek launched R1-0528 on developer platform Hugging Face, but has yet to make an official public announcement. It did not publish a description of the model or comparisons. But the LiveCodeBench leaderboard, a benchmark developed by researchers from UC Berkeley, MIT, and Cornell, ranked DeepSeek's updated R1 reasoning model just slightly behind OpenAI's o4 mini and o3 reasoning models on code generation and ahead of xAI's Grok 3 mini and Alibaba's Qwen 3. Bloomberg earlier reported the update on Wedneday. It said that a DeepSeek representative had told a WeChat group that it had completed what it described as a 'minor trial upgrade' and that users could start testing it. DeepSeek earlier this year upended beliefs that US export controls were holding back China's AI advancements after the startup released AI models that were on a par with or better than industry-leading models in the United States at a fraction of the cost. The launch of R1 in January sent tech shares outside China plummeting in January and challenged the view that scaling AI requires vast computing power and investment. Since R1's release, Chinese tech giants like Alibaba and Tencent have released models claiming to surpass DeepSeek's. Google's Gemini has introduced discounted tiers of access while OpenAI cut prices and released an o3 mini model that relies on less computing power. The company is still widely expected to release R2, a successor to R1. Reuters reported in March, citing sources, that R2's release was initially planned for May. DeepSeek also released an upgrade to its V3 large language model in March.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store