logo
ByteDance advances DeepSeek work in AI reasoning with open-source project led by intern

ByteDance advances DeepSeek work in AI reasoning with open-source project led by intern

TikTok owner ByteDance, which has invested heavily in artificial intelligence (AI), has unveiled a new system that claims to improve on the work done by DeepSeek in training AI reasoning models.
Advertisement
DAPO, or Decoupled Clip and Dynamic Sampling Policy Optimisation, is a scalable reinforcement learning algorithm that helps a large language model (LLM) achieve better complex reasoning behaviour such as self-verification and iterative refinement, according to a research paper published earlier this week by ByteDance and Tsinghua University's Institute for AI Industry Research.
The algorithm outperformed the reinforcement learning approach in DeepSeek's R1 reasoning model, scoring 50 points in the American Invitational Mathematics Examination (AIME) 2024 using Alibaba Group Holding's Qwen2.5-32B base model, compared with 47 points attained by R1 when applying the same Alibaba model, the paper showed. Alibaba owns the South China Morning Post.
Notably, DAPO achieved the better result with 50 per cent fewer training steps.
TikTok owner ByteDance has invested heavily in artificial intelligence. Photo: Digitimes
The achievement drew positive academic and industry comments. Google DeepMind engineer Philipp Schmid, who shared the project on X, said the new method was 'better than' DeepSeek's 'group relative policy optimisation (GRPO)' in reinforcement learning. GRPO is one of DeepSeek's training methods that enables a model to learn by comparing different actions and making updates with a 'group' of observations.

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

AI content detector: why does China dismiss it as ‘superstition tech'?
AI content detector: why does China dismiss it as ‘superstition tech'?

South China Morning Post

time18 hours ago

  • South China Morning Post

AI content detector: why does China dismiss it as ‘superstition tech'?

With the graduation season approaching, many Chinese universities have introduced regulations setting clear requirements for the proportion of artificial intelligence -generated content – or the 'AI rate', as it is called – in theses. Advertisement Some universities have used the AI rate as a deciding factor in whether a thesis is approved. The rule is intended to prevent academic misconduct, as educators have become increasingly concerned about the unregulated use of AI in producing scholarly literature, including data falsification and content fabrication, since the public debut of generative AI models such as ChatGPT However, an official publication of the Ministry of Science and Technology has warned that using AI content detectors to identify AI writing is essentially a form of 'technological superstition' that could cause many unintended side effects. AI detection tools could produce false results, the Science and Technology Daily said in an editorial last Tuesday, adding that some graduates had complained that content clearly written by them was labelled as AI-generated. Advertisement Even a very famous Chinese essay written 100 years ago was evaluated as more than 60 per cent AI-generated, when analysed by these tools, the article said.

Beijing academy unveils open-source ‘RoboBrain' AI model for China's humanoid robots
Beijing academy unveils open-source ‘RoboBrain' AI model for China's humanoid robots

South China Morning Post

timea day ago

  • South China Morning Post

Beijing academy unveils open-source ‘RoboBrain' AI model for China's humanoid robots

The Beijing Academy of Artificial Intelligence (BAAI), a non-profit research laboratory in China, launched on Friday a series of new open-source artificial intelligence (AI) models designed to be the 'brain' of robots, as the country rushes to build smarter machines. Advertisement The use of powerful AI models in China's booming robotics industry could accelerate the development and adoption of humanoids, as the sector addresses challenges such as limited model capabilities and a lack of training data, according to BAAI head Wang Zhongyuan during the institute's annual conference in Beijing. Wang described BAAI's RoboBrain 2.0 as the world's most powerful open-source AI model designed to improve various types of robots, including humanoids. The launch of this general-purpose AI model coincides with the Chinese robotics industry's rapid growth, positioning BAAI as a potential major player in the local sector. Beijing Academy of Artificial Intelligence director Wang Zhongyuan speaks at the institute's annual conference on Friday. Photo: Handout 'We sincerely hope that various stakeholders in the embodied intelligence industry will collaborate with the Zhiyuan Institute,' Wang said, referring to the local name for BAAI. Advertisement 'Currently, we are partnering with over 20 leading companies in the sector and are looking for additional collaborators to drive growth.'

Beijing academy unveils open-source ‘RoboBrain' AI model for China's humanoid robots
Beijing academy unveils open-source ‘RoboBrain' AI model for China's humanoid robots

South China Morning Post

timea day ago

  • South China Morning Post

Beijing academy unveils open-source ‘RoboBrain' AI model for China's humanoid robots

The Beijing Academy of Artificial Intelligence (BAAI), a non-profit research laboratory in China, launched on Friday a series of new open-source artificial intelligence (AI) models designed to be the 'brain' of robots, as the country rushes to build smarter machines. The use of powerful AI models in China's booming robotics industry could accelerate the development and adoption of humanoids, as the sector addresses challenges such as limited model capabilities and a lack of training data, according to BAAI head Wang Zhongyuan during the institute's annual conference in Beijing. Wang described BAAI's RoboBrain 2.0 as the world's most powerful open-source AI model designed to improve various types of robots, including humanoids. The launch of this general-purpose AI model coincides with the Chinese robotics industry's rapid growth, positioning BAAI as a potential major player in the local sector. Beijing Academy of Artificial Intelligence director Wang Zhongyuan speaks at the institute's annual conference on Friday. Photo: Handout 'We sincerely hope that various stakeholders in the embodied intelligence industry will collaborate with the Zhiyuan Institute,' Wang said, referring to the local name for BAAI. 'Currently, we are partnering with over 20 leading companies in the sector and are looking for additional collaborators to drive growth.' According to Wang, RoboBrain 2.0 features significant upgrades in spatial intelligence and task planning, achieving 17 per cent faster performance and 74 per cent greater accuracy compared to its predecessor, which was introduced three months ago.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store