Latest news with #DeepSeek-GRM

China's DeepSeek Teams Up With Tsinghua University To Raise AI Bar, Boost Reasoning Capabilities

Yahoo

07-04-2025

Business
Yahoo

China's DeepSeek Teams Up With Tsinghua University To Raise AI Bar, Boost Reasoning Capabilities

China's DeepSeek, in collaboration with researchers from Tsinghua University, developed a technique to improve the reasoning capabilities of large language models (LLMs) that combines generative reward modeling (GRM) and self-principled critique tuning, SCMP reported, citing a paper published on Friday. The dual approach aims to enable LLMs to deliver better and faster results to general queries. Reportedly, the DeepSeek-GRM models outperformed existing methods, according to SCMP, who cited the researchers. Rising Tide Of AI Is So Strong That Semiconductor Optical Industry Projects $30 Billion Total Addressable Market By 2029 DeepSeek aimed to make the GRM models open source. The emergence of DeepSeek and claims of affordable AI models fueled a $1 trillion market wipeout in the U.S. and a domestic price war, prompting Chinese Big Tech companies to roll out affordable AI models. In March, DeepSeek said its upgraded V3 model offered enhanced reasoning capabilities, optimized front-end web development, and upgraded Chinese writing proficiency. In February, it also open-sourced five of its code repositories. In late February, DeepSeek founder Liang Wenfeng participated in a symposium with tech entrepreneurs hosted by Chinese President Xi Jinping in Beijing. Chinese e-commerce juggernaut Alibaba Group Holding (NYSE:BABA) plans to release an upgraded version of its flagship AI model by April. DeepSeek's claims prompted China's tech leaders to flood the market with affordable AI services. OpenAI, Alphabet Inc (NASDAQ:GOOG) (NASDAQ:GOOGL) Google and Anthropic have similarly released new models. Meta Platforms Inc (NASDAQ:META) announced the release of its new Llama 4 artificial intelligence models, built on one of the world's most advanced large language models as per the company. It is noteworthy that iShares China Large-Cap ETF (NYSE:FXI) has gained 10% year-to-date, while iShares China Large-Cap ETF (NASDAQ:QQQ) lost over 17%. Read Next:Image via Shutterstock UNLOCKED: 5 NEW TRADES EVERY WEEK. Click now to get top trade ideas daily, plus unlimited access to cutting-edge tools and strategies to gain an edge in the markets. Get the latest stock analysis from Benzinga? APPLE (AAPL): Free Stock Analysis Report TESLA (TSLA): Free Stock Analysis Report This article China's DeepSeek Teams Up With Tsinghua University To Raise AI Bar, Boost Reasoning Capabilities originally appeared on © 2025 Benzinga does not provide investment advice. All rights reserved. Sign in to access your portfolio

DeepSeek unveils new AI reasoning method as anticipation for its next-gen model rises

South China Morning Post

05-04-2025

Business
South China Morning Post

DeepSeek unveils new AI reasoning method as anticipation for its next-gen model rises

Chinese artificial intelligence (AI) start-up DeepSeek has introduced a novel approach to improving the reasoning capabilities of large language models (LLMs), as the public awaits the release of the company's next-generation model. Advertisement In collaboration with researchers from Tsinghua University, DeepSeek developed a technique that combines methods referred to as generative reward modelling (GRM) and self-principled critique tuning, according to a paper published on Friday. The dual approach aims to enable LLMs to deliver better and faster results to general queries. The resulting DeepSeek-GRM models outperformed existing methods, having 'achieved competitive performance' with strong public reward models, the researchers wrote. Reward modelling is a process that guides an LLM towards human preferences. DeepSeek intended to make the GRM models open source, according to the researchers, but they did not give a timeline. The academic paper, published on the online scientific paper repository arXiv, comes amid speculation about the start-up's next move following the global attention garnered by the firm's V3 foundation model and R1 reasoning model. Advertisement Reuters reported last month that DeepSeek-R2, the successor to R1, could be released as soon as this month, as the company rushes to capitalise on its rising profile. The release of DeepSeek-R1 rocked the global tech community with its cost-efficient performance that rivalled leading models. DeepSeek has remained tight-lipped about the rumoured R2 release. It has not commented on the matter through official public channels, but a customer service account denied the report in a group chat with business clients, Chinese media outlets reported last month.

Latest news with #DeepSeek-GRM

China's DeepSeek Teams Up With Tsinghua University To Raise AI Bar, Boost Reasoning Capabilities

DeepSeek unveils new AI reasoning method as anticipation for its next-gen model rises

Get Started Now: Download the App