logo

DeepSeek rushes to launch new AI model as China goes all in

Zawya25-02-2025

DeepSeek is looking to press home its advantage.
The Chinese startup triggered a $1 trillion-plus sell-off in global equities markets last month with a cut-price AI reasoning model that outperformed many Western competitors. Now, the Hangzhou-based firm is accelerating the launch of the successor to January's R1 model, according to three people familiar with the company.
Deepseek had planned to release R2 in early May but now wants it out as early as possible, two of them said, without providing specifics. The company says it hopes the new model will produce better coding and be able to reason in languages beyond English. Details of the accelerated timeline for R2's release have not been previously reported.
DeepSeek did not respond to a request for comment for this story.
Rivals are still digesting the implications of R1, which was built with less-powerful Nvidia chips but is competitive with those developed at the costs of hundreds of billions of dollars by U.S. tech giants. "The launch of DeepSeek's R2 model could be a pivotal moment in the AI industry," said Vijayasimha Alilughatta, chief operating officer of Indian tech services provider Zensar. DeepSeek's success at creating cost-effective AI models "would likely spur companies worldwide to accelerate their own efforts ... breaking the stranglehold of the few dominant players in the field," he said. R2 is likely to worry the U.S. government, which has identified leadership of AI as a national priority. Its release may further galvanize Chinese authorities and companies, dozens of which say they have started integrating DeepSeek models into their products. Little is known about DeepSeek, whose founder Liang Wenfeng became a billionaire through his quantitative hedge fund High-Flyer. Liang, who was described by a former employer as "low-key and introverted," has not spoken to any media since July 2024.
Reuters interviewed a dozen former employees, as well as quant fund professionals knowledgeable about the operations of DeepSeek and its parent company High-Flyer. It also reviewed state media articles, social-media posts from the companies and research papers dating back to 2019.
They told a story of a company that functioned more like a research lab than a for-profit enterprise and was unencumbered by the hierarchical traditions of China's high-pressure tech industry, even as it became responsible for what many investors see as the latest breakthrough in AI.
DIFFERENT PATH
Liang was born in 1985 in a rural village in the southern province of Guangdong. He later obtained communication engineering degrees at the elite Zhejiang University.
One of his first jobs was running a research department at a smart imaging firm in Shanghai. His then-boss, Zhou Chaoen, told state media on Feb. 9 that Liang had hired prize-winning algorithm engineers and operated with a "flat management style."
At DeepSeek and High-Flyer, Liang has similarly shunned the practices of Chinese tech giants known for rigid top-down management, low pay for young employees and "996" - working from 9 a.m. to 9 p.m. six days a week.
Liang opened his Beijing office within walking distance of Tsinghua University and Peking University, China's two most prestigious education institutions. He regularly delved into technical details and was happy to work alongside Gen-Z interns and recent graduates that comprised the bulk of its workforce, according to two former employees. They also described usually working eight-hour days in a collaborative atmosphere.
"Liang gave us control and treated us as experts. He constantly asked questions and learned alongside us," said 26-year-old researcher Benjamin Liu, who left the company in September. "DeepSeek allowed me to take ownership of critical parts of the pipeline, which was very exciting."
Liang did not respond to questions sent via DeepSeek.
While Baidu and other Chinese tech giants were racing to build their consumer-facing versions of ChatGPT in 2023 and profit off of the global AI boom, Liang told Chinese media outlet Waves last year that he deliberately avoided spending heavily on app development, focusing instead on refining the AI model's quality.
Both DeepSeek and High-Flyer are known for paying generously, according to three people familiar with its compensation practices. At High-Flyer, it is not uncommon for a senior data scientist to make 1.5 million yuan annually, while competitors rarely pay more than 800,000, said one of the people, a rival quant fund manager who knows Liang. The largesse was funded by High-Flyer, which became one of China's most successful quant funds and, even after a government crackdown on the sector, still manages tens of billions of yuan, according to two people in the industry.
COMPUTING POWER
DeepSeek's success with a low-cost AI model is based on High-Flyer's decade-long and substantial investment in research and computing power, three people said.
The quant fund was an earlier pioneer in AI trading and a top executive said in 2020 that High-Flyer was going "all in" on AI by re-investing 70% of its revenue, mostly into AI research.
High-Flyer spent 1.2 billion yuan on two supercomputing AI clusters in 2020 and 2021. The second cluster, Fire-Flyer II, was made up of around 10,000 Nvidia A100 chips, used for training AI models.
DeepSeek had not been established at that time, so the accumulation of computing power caught the attention of Chinese securities regulators, said a person with direct knowledge of officials' thinking.
"Regulators wanted to know why they need so many chips?" the person said. "How they were going to use it? What kind of impact would that have on the market?"
Authorities decided not to intervene, in a move that would prove crucial for DeepSeek's fortunes: the U.S. banned the export of A100 chips to China in 2022, at which point Fire-Flyer II was already in operation.
Beijing now celebrates DeepSeek, but has instructed it not to engage with the media without approval, according to a person familiar with Chinese official thinking.
Authorities had asked Liang to keep a low-profile because they were worried that too much hype in the media would draw unnecessary attention, the person said.
China's cabinet and commerce ministry, as well as China's securities regulator, did not respond to requests for comment.
As one of the few companies with a large A100 cluster, High-Flyer and DeepSeek were able to attract some of China's best research talent, two former employees said. "The key advantage of vast (computing) resources is that it allows for large-scale experimentation," said Liu, the former employee.
Some Western AI entrepreneurs, like Scale AI CEO Alexandr Wang, have claimed that DeepSeek had as many as 50,000 higher-end Nvidia chips that are banned for export to China. He has not produced evidence for the allegation or responded to Reuters' requests to provide proof. DeepSeek has not responded to Wang's claims. Two former employees attributed the company's success to Liang's focus on more cost-effective AI architecture.
The startup used techniques like Mixture-of-Experts (MoE) and multihead latent attention (MLA), which incur far lower computing costs, its research papers show.
The MoE technique divides an AI model into different areas of expertise and activates only those related to a query, as opposed to more common architectures that use the entire model.
MLA architecture allows a model to process different aspects of one piece of information simultaneously, helping it detect key details more effectively.
While competitors like France's Mistral have developed models based on MoE, DeepSeek was the first firm to depend heavily on this architecture while achieving parity with more expensively built models.
DeepSeek's pricing was 20 to 40 times cheaper than what OpenAI charged for equivalent models, analysts at Bernstein brokerage estimated in early February.
For now, Western and Chinese tech giants have signaled plans to continue heavy AI spending, but DeepSeek's success with R1 and its earlier V3 model has prompted some to alter strategies. OpenAI cut prices this month, while Google's Gemini has introduced discounted tiers of access. Since R1's launch, OpenAI has also released an O3-Mini model that relies on less computing power.
Adnan Masood of U.S. tech services provider UST told Reuters that his laboratory had run benchmarks that found R1 often used three times as many tokens, or units of data processed by the AI model, for reasoning as OpenAI's scaled-down model.
STATE EMBRACE
Even before R1 gripped global attention, there were signs that DeepSeek had caught Beijing's favor. In January, state media reported that Liang attended a meeting with Chinese Premier Li Qiang in Beijing as the designated representative of the AI sector, ahead of the leaders of better-known firms.
The subsequent fanfare over the cost competitiveness of its models has buoyed Beijing's belief that it can out-innovate the U.S., with Chinese companies and government bodies embracing DeepSeek models at a pace that has not been offered to other firms.
At least 13 Chinese city governments and 10 state-owned energy companies say they have deployed DeepSeek into their systems, while tech giants Lenovo, Baidu and Tencent - owner of China's largest social media app WeChat - have integrated DeepSeek's models into their products.
Chinese leader Xi Jinping and Li "have signalled they endorse DeepSeek," said Alfred Wu, an expert on Chinese policymaking at Singapore's Lee Kuan Yew School of Public Policy. "Now everyone just endorses it."
The Chinese embrace comes as governments from South Korea to Italy remove DeepSeek from national app stores, citing privacy concerns.
"If DeepSeek becomes the go-to AI model across Chinese state entities, Western regulators might see this as another reason to escalate restrictions on AI chips or software collaborations," said Stephen Wu, an AI expert and founder of hedge fund Carthage Capital.
Further limits on advanced AI chips are a challenge that Liang has acknowledged.
"Our problem has never been funding," he told Waves in July. "It's the embargo on high-end chips."
(Additional reporting by Samuel Shen, Gu Li, Larissa Liao, Aditya Soni and Shanghai Newsroom; Editing by Brenda Goh and Katerina Ang)

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Ex-CIA officer running Gaza aid security advised Boston Consulting Group
Ex-CIA officer running Gaza aid security advised Boston Consulting Group

Middle East Eye

time4 hours ago

  • Middle East Eye

Ex-CIA officer running Gaza aid security advised Boston Consulting Group

Former CIA paramilitary officer Phil Reilly, who heads a private military company that is guarding Israel's newly set up food distribution sites in Gaza, was a senior advisor at the US consultancy that is the latest partner to withdraw from the controversial aid project. The Boston Consulting Group admitted last week that it was involved in developing the Gaza Humanitarian Foundation (GHF), but has since halted its involvement and distanced itself as 110 Palestinians have been killed and 583 wounded trying to access aid during GHF's 10 days of operations, according to government sources. But Reilly's role with BCG, which ended only six months ago, raises questions about whether the consultancy was also involved in developing the security side of the aid operation. Questions about the firm's role come as BCG announced on Thursday that it had fired two partners for 'unauthorised work' in relation to GHF. BCG said the partners 'failed to disclose the full nature of the work during the client acceptance process' and carried out subsequent unauthorised work in violation of company policies and protocols. New MEE newsletter: Jerusalem Dispatch Sign up to get the latest insights and analysis on Israel-Palestine, alongside Turkey Unpacked and other MEE newsletters The firm said the subsequent work had "lacked visible multi-lateral support" and that it has engaged outside counsel to investigate its involvement in the project. 'We deeply regret that in this situation we did not live up to our standards. We are committed to taking all necessary steps to address the issues identified in the ongoing investigation,' the company said. MEE asked BCG on Friday whether the firm was involved in the security operations to support GHF, what role it had played with the foundation and who had asked the firm to get involved initially. BCG directed MEE to its statement from Thursday which did not answer the questions. Consulting on Gaza A 29-year veteran of the CIA, Reilly served as a senior advisor at BCG for eight years starting in January 2017, just as US President Donald Trump's first term started, according to his LinkedIn account. It was early last year, while still holding his advisory role, that Reilly reportedly first began discussing Gaza aid with Israeli civilians in early 2024. Later in 2024, he worked on a study for Orbis, another US consultancy, which reportedly outlined a plan to outsource food aid delivery to private companies and foundations, the NYT reported. Reilly's advisory role with BCG ended in December, a month after Safe Reach Solutions (SRS), the private military company now operating in Gaza, was registered in Wyoming, a known US tax haven. Mercenary firm set to oversee Gaza aid for Israel goes on LinkedIn hiring spree Read More » Public records show that SRS's registered agent, as first reported by All-Source Intelligence, is the Wyoming-based wealth management fund, Two Oceans Trust LLC. But the public records fail to reveal many more details about SRS, including its funders. It was reported this week that McNally Capital, a Chicago-based private equity firm, has an "economic interest" in SRS, although the scale of the interest remains unclear. The lack of clarity about funding is also true of GHF which is registered with scant few other details in Delaware, another notorious US tax haven. SRS is understood to be the main company currently securing the food distribution sites that are part of Israeli and American moves to take control of aid distribution in the enclave, which have been beset by controversy. The UN and international aid agencies have sounded the alarm for weeks over concerns that the plans which have unfolded at pace failed to meet humanitarian principles and would encourage forced displacement of Palestinians. Hours before GHF was to start distributing aid in Gaza late last month, executive director Jake Wood resigned over concerns that it was impossible for the organisation to operate independently or adhere to strict humanitarian principles. Wood was replaced this week by Johnnie Moore, an evangelical leader who has advised Trump on interfaith issues. Moore has denied reports that Palestinians were killed and injured while seeking aid at GHF's sites in Gaza and said he was demanding results "with Silicon Valley precision".

Metaplanet Unveils Unprecedented $5.4B Bitcoin Investment Initiative
Metaplanet Unveils Unprecedented $5.4B Bitcoin Investment Initiative

Arabian Post

time8 hours ago

  • Arabian Post

Metaplanet Unveils Unprecedented $5.4B Bitcoin Investment Initiative

Tokyo-based investment firm Metaplanet has announced a groundbreaking plan to raise approximately $5.4 billion through the issuance of 555 million moving-strike warrants, aiming to significantly expand its Bitcoin holdings. This move represents the largest stock acquisition rights issuance in Japanese capital markets history and marks the first instance of such warrants being issued above market price. The company's ambitious strategy, dubbed the '555 Million Plan,' sets a target to acquire 210,000 Bitcoins by the end of 2027, equating to roughly 1% of the total Bitcoin supply. This initiative follows Metaplanet's earlier '21 Million Plan,' under which it aimed to hold 21,000 Bitcoins by 2026. Having already surpassed interim goals with 8,888 Bitcoins, placing it tenth globally among corporate Bitcoin holders, Metaplanet is now intensifying its efforts. The issuance of 555 million new shares through moving-strike warrants is designed to optimize capital raising with minimal dilution. If fully exercised, this issuance could generate an estimated ¥770 billion at an initial strike price of ¥1,388 per share. The funds raised will be primarily allocated to purchasing Bitcoin, with smaller portions earmarked for bond redemptions and income-generating strategies like selling put options. ADVERTISEMENT Metaplanet's CEO, Simon Gerovich, highlighted the significance of this move, stating that it represents Asia's largest-ever equity raise dedicated to Bitcoin. He emphasized the company's commitment to accelerating its Bitcoin strategy, aiming for 30,000 Bitcoins by the end of 2025, 100,000 by 2026, and reaching the 210,000 target by 2027. The company's approach leverages Japan's deep capital markets, where demand for regulated Bitcoin exposure remains strong. Metaplanet's stock, known for its liquidity, offers domestic investors an accessible, tax-advantaged vehicle to gain Bitcoin exposure—a notable advantage given Japan's lack of spot Bitcoin ETFs. This strategic move positions Metaplanet alongside other major corporate Bitcoin holders, drawing parallels to MicroStrategy's pioneering Bitcoin-backed securities approach. The firm's execution to date has been swift, with its previous capital raise of ¥102.8 billion fueling a 225.4% increase in Bitcoin holdings year-to-date. Now, Metaplanet aims to deliver an even more aggressive 600% Bitcoin yield by the end of 2025.

Multiply Group unites media assets under one brand with launch of Multiply Media Group
Multiply Group unites media assets under one brand with launch of Multiply Media Group

Al Etihad

time10 hours ago

  • Al Etihad

Multiply Group unites media assets under one brand with launch of Multiply Media Group

6 June 2025 14:37 ABU DHABI (ALETIHAD) Multiply Group, the Abu Dhabi-based investment holding company, has consolidated its media assets under a single unified brand — Multiply Media Group (MMG). The newly launched entity brings together BackLite Media, Viola Media, and Media 247 to form one of the UAE's largest and most premium out-of-home (OOH) media networks, with over 3,000 advertising units across the announcement was made at the World Out of Home Organization (WOO) Annual Congress in Mexico City, accompanied by a bold global digital media campaign that lit up cities and screens launch of MMG represents one of the most significant media consolidations in the UAE. It signals Multiply Group's broader strategy to redefine the media landscape through scale, innovation, and Bouazza, Group CEO and Managing Director of Multiply Group, described the move as a pivotal step: 'By bringing together market-leading media assets under a single AI and tech-driven group, we are reinforcing our commitment to long-term value creation and shareholder returns. MMG lays a strong foundation for our global ambitions and forward-looking investment strategy.'Out-of-home (OOH) media refers to advertising that reaches consumers outside of their homes. This includes formats such as billboards, transit ads, street furniture displays, and digital signage placed in high-traffic public spaces. As traditional media consumption continues to fragment, OOH has remained a resilient and evolving channel for advertisers to reach audiences where they work, commute, and socialise. With MMG, Multiply Group aims to capitalise on this high-impact medium by leveraging data, AI, and strategic network includes more than 75 premium assets along Dubai's Sheikh Zayed Road and benefits from long-term partnerships with major government entities such as Dubai's Road and Transport Authority (RTA) and Abu Dhabi's Department of Municipalities and Transport (DMT). These partnerships ensure MMG's access to high-visibility locations and institutional support as it scales across the Bicknell, CEO of Multiply Media Group, emphasised the transformative nature of the new venture. 'MMG is a powerhouse that unites some of the region's most strategic media assets under one bold vision. Our mandate is clear: deliver context at scale and reach audiences where it truly matters. MMG is engineered to be agile, data-led, and deeply integrated — enabling our clients to engage audiences with greater relevance and resonance than ever before.'MMG's formation also aligns with Multiply Group's recent strategic partnerships aimed at global expansion. These include a joint venture with Arabian Contracting Services Company (Al Arabia) to invest in international OOH markets, and a memorandum of understanding with Saudi Media Company (SMC). These moves highlight MMG's potential as a regional launchpad for cross-border media Hassan, Head of the Media and Communications Vertical at Multiply Group, noted the group's broader vision: 'Multiply Group's growth strategy has taken us from three integrated companies to a media powerhouse poised to redefine the regional landscape. Through MMG, we are embracing emerging trends such as the transformative role of AI and remain committed to investing in technologies that drive dynamic, innovative campaigns.'In addition to its core OOH portfolio, Multiply Group's media holdings include Yieldmo, a contextual mobile advertising platform, and Firefly, a leading digital taxi-top advertising firm in North America. With these assets, MMG is positioned to deliver holistic, multi-channel media solutions both regionally and internationally. Multiply Group is $7.2 billion investment holding company based in Abu Dhabi. It is part of International Holding Company, the most valuable holding company in the Middle East with a market cap of over $240 billion.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store