ByteDance advances DeepSeek work in AI reasoning with open-source project led by intern

21-03-2025

TikTok owner ByteDance, which has invested heavily in artificial intelligence (AI), has unveiled a new system that claims to improve on the work done by DeepSeek in training AI reasoning models.
Advertisement
DAPO, or Decoupled Clip and Dynamic Sampling Policy Optimisation, is a scalable reinforcement learning algorithm that helps a large language model (LLM) achieve better complex reasoning behaviour such as self-verification and iterative refinement, according to a research paper published earlier this week by ByteDance and Tsinghua University's Institute for AI Industry Research.
The algorithm outperformed the reinforcement learning approach in DeepSeek's R1 reasoning model, scoring 50 points in the American Invitational Mathematics Examination (AIME) 2024 using Alibaba Group Holding's Qwen2.5-32B base model, compared with 47 points attained by R1 when applying the same Alibaba model, the paper showed. Alibaba owns the South China Morning Post.
Notably, DAPO achieved the better result with 50 per cent fewer training steps.
TikTok owner ByteDance has invested heavily in artificial intelligence. Photo: Digitimes
The achievement drew positive academic and industry comments. Google DeepMind engineer Philipp Schmid, who shared the project on X, said the new method was 'better than' DeepSeek's 'group relative policy optimisation (GRPO)' in reinforcement learning. GRPO is one of DeepSeek's training methods that enables a model to learn by comparing different actions and making updates with a 'group' of observations.

Hashtags

Business

#DAPO

#AI

#DecoupledClipandDynamicSamplingPolicyOptimisation

#AmericanInvitationalMathematicsExamination

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Chinese start-up joins NetDragon-owned Cherrypicks to push AI solutions overseas

South China Morning Post

5 hours ago

South China Morning Post

Chinese start-up joins NetDragon-owned Cherrypicks to push AI solutions overseas

Beijing -based Zhongke WengAI, whose services are used by various Chinese ministries and state media outlets, will also jointly develop with Cherrypicks – owned by Hong Kong -listed NetDragon Websoft – enterprise AI solutions for industries such as finance and healthcare, the partners said in a statement on Friday. This collaboration 'exemplifies the convergence of China's AI 'go-global' strategy with Hong Kong's innovation strengths', said Simon Leung Lim-kin , vice-chairman at NetDragon. He also pointed out that the strategic partnership would help 'further cement Hong Kong's position as an international innovation hub '. Shares of NetDragon closed unchanged at HK$11.61 on Friday. NetDragon Websoft vice-chairman Simon Leung Lim-kin. Photo: Jonathan Wong The partnership reflects efforts by Chinese AI firms to expand the reach of their operations beyond the mainland, while bolstering Hong Kong's campaign to reposition itself as an international innovation and technology hub.

China's leading supplier of 12-inch silicon wafers given greenlight for Shanghai IPO

South China Morning Post

19 hours ago

South China Morning Post

China's leading supplier of 12-inch silicon wafers given greenlight for Shanghai IPO

Eswin, China's largest supplier of 12-inch silicon wafers, has received approval for an initial public offering (IPO) in Shanghai, the latest sign of increased efforts by Chinese chip companies to raise funds amid the rapid development of artificial intelligence (AI) and the country's tech self-sufficiency drive. The company, which produces monocrystalline silicon polished wafers and epitaxial wafers for the manufacture of integrated circuits, received approval on Thursday for a listing on the Nasdaq-style Star Market, to raise 4.9 billion yuan (US$682.8 million), according to information released by the Shanghai Stock Exchange. Eswin's capability in supplying 12-inch wafers is seen as an important asset for China's strategic drive towards semiconductor self-sufficiency amid the US-China tech war. 'The era of AI demands greater computing power, faster data transmission, larger data storage, and more responsive human-computer interaction,' the company said in its updated prospectus filed last week. 'To achieve these functional technologies and processes, the most mainstream and advanced logic and memory chips as well as some high-end analogue and sensor chips are manufactured using 12-inch wafers.' The company said it supplied silicon wafers to 'first-tier wafer foundries' and mainstream memory chipmakers in China, without disclosing the customer names. Three years after its founding in Beijing in 2016, Eswin secured the expertise of Wang Dongsheng, the founder of BOE Technology, the world's No 1 display maker and supplier to Apple and Huawei Technologies.

Alibaba's new AI agent to ‘revolutionise' how merchants source goods online

South China Morning Post

2 days ago

South China Morning Post

Alibaba's new AI agent to ‘revolutionise' how merchants source goods online

Alibaba Group Holding's international commerce arm on Thursday released an artificial intelligence agent to help merchants source products and supplies, a development that could change the way online business is conducted. The Accio Agent unveiled by Alibaba International Digital Commerce Group (AIDC) was designed to 'revolutionise international commerce' by automating 70 per cent of the traditional time-consuming work, including product ideation, prototyping, compliance checks and supplier sourcing, the company said in a statement. The AI agent marks a step forward in the process of 'agentic purchase', which is the use of AI agents to handle everything from product discovery to fulfilment. It could fundamentally change existing models of online search, advertising and e-commerce as tech giants such as roll out their own agents. Alibaba said the agent could reduce weeks of market research and product sourcing work to just a few minutes. This would cut costs and speed up tasks for merchants, some of whom are small and medium-sized businesses run by solo entrepreneurs, enabling them to streamline their operations. Accio Agent was trained on a huge quantity of data. Photo: Handout '[Accio Agent] is designed to help you do business,' Zhang Kuo, vice-president at AIDC, said, adding that it could handle multiple tasks simultaneously and operate like a team of professionals such as sourcing specialists, developers and engineers, and market researchers.

ByteDance advances DeepSeek work in AI reasoning with open-source project led by intern

Hashtags

Try Our AI Features

Comments

Related Articles

Chinese start-up joins NetDragon-owned Cherrypicks to push AI solutions overseas

China's leading supplier of 12-inch silicon wafers given greenlight for Shanghai IPO

Alibaba's new AI agent to ‘revolutionise' how merchants source goods online

Get Started Now: Download the App