DeepSeek: Smarter Software Vs. More Compute

07-05-2025

Daniel A. Keller, CEO and President of InFlux Technologies Limited. Cofounder of Flux.
Getty Images
When ChatGPT was released by OpenAI in 2022, it was the peak expression of AI chatbots built on large language models (LLMs). With an accessible interface and absolutely no need for external gadgets, it was the power of interactive AI in the palms of users, literally!
Barely five days after its launch, ChatGPT broke the 1 million download milestone. (For context, that took Facebook 10 months to achieve.) Of course, there were a few problems, like the occasional lags and hallucinations, but version after version, ChatGPT continued to expand its frontiers.
There were also apprehensions about the development cost of ChatGPT-4, somewhere between $48 to $71 million. But it was all completely justifiable. Sixteen thousand H100s GPUs don't come cheap, and salaries have to be paid.
Or was it?
Rise Of The Deep
On January 20, 2025, the world woke up to news that would change the trajectory of AI technology. A little-known Chinese company had launched DeepSeek R1, an AI with capabilities comparable to OpenAI's ChatGPT.
And the shocker?
The initial reports claimed it did it with fewer, cheaper and older GPUs at a development cost of only $5.6 million. The ripple effect sent shock waves across the markets. By Monday, Nvidia, the biggest supplier of AI GPU chips, lost almost $600 billion in market value as investors started reconsidering their options. Indexes and corporations like Nasdaq, Microsoft and Alphabet also plummeted. Within a week, Deepseek had overtaken ChatGPT to become the most downloaded application on the Apple App Store.
But since then, DeepSeek has come under scrutiny, with the head of Google's DeepMind calling its claims "exaggerated" and one critic suggesting it actually cost DeepSeek over $1 billion to create its AI model.
Nevertheless, DeepSeek's arrival has caused a shift. The investment rationale for the supply chain had been quite simple: more spending and better outcomes for AI.
Until now.
The Paradigm Shift
Deepseek's story is exceptional for several reasons. First, due to the United States' efforts to stem the flow of advanced AI technology to competing nations, the Biden administration restricted the export of GPUs to China, limiting the availability of advanced AI GPUs like the A100s and the H100s. As a result, Deepseek presumably had to rely on less sophisticated but more available GPUs like the H800.
The ability of Deepseek to turn this crippling limitation into one of the marvels of AI innovation highlights a very critical question: Is ingenuity and better software architecture a more sustainable alternative to advanced but expensive GPUs?
GPU availability (significantly advanced chips like the H100s) is one of the rate-limiting steps for AI research and development; even in the U.S., Nvidia, the top producer of GPUs globally, continues to grapple with meeting its high demand. A breakthrough that demonstrates that companies and research labs can maximize their computing power and cut down costs is a game-changer for the entire industry, but how exactly did DeepSeek achieve this?
Flipping The Game
Before Deepseek's emergence in AI, it had always been a game of who was bigger. Bigger financial investments translate into bigger LLM Models, which in turn require more compute resources and, hopefully, bigger innovative strides.
However, DeepSeek's approach was counterintuitive. Instead of slapping on more compute and developing bigger models, the Chinese company focused on optimizing for a more efficient use of available resources. This included enhancing its model abilities through reinforcement learning, leveraging improved software architecture and optimizing its algorithm.
Rather than dwarfing prevailing challenges with sheer brute power, Deepseek turned the game on its head. Early benchmarks showed it was 20 times more efficient and far less compute-intensive than its more pronounced competitors.
Since it relied on reinforcement learning, Deepseek-R1 also eliminated the need for large teams of human reviewers and supervised fine-tuning, keeping operating costs to a minimum.
Another important paradigm that Deepseek adopted was its incorporation of MOE (mixture of experts) architecture. MOE leverages multiple expert sub-models and uses selective gating to activate only the most relevant parameters for each input. For context, the Deepseek MoE framework comprises around 671 billion parameters; however, less than 0.5% of these parameters are used during any input.
Picture a diverse team of seasoned experts across different disciplines. When needed, the gating mechanism dynamically selects the best combination of experts to solve the problem.
The result?
Dynamic routing and allocation lowers the amount of computation the model requires by reducing unnecessary computation. This approach also improves efficiency, promotes seamless scalability and supports progressive fine-tuning of different expert system components for specific problems.
Implications For The Broader AI Industry
Compute-efficient AI solutions encourage democratization, allowing for dynamic innovations from different quarters. This could, in turn, promote cheaper access to AI resources, breaking Big Tech's monopoly on AI innovation.
Deepseek's open-source nature provides a level playing field for researchers to engage in deep R&D without breaking the bank. Its lower energy requirements and smaller carbon footprint can also positively drive environmentally sustainable designs for data centers in the near future.
However, as revolutionary as the emergence of Deepseek has been, there are also a few drawbacks (on top of the dubiousness of its claims).
First, while DeepSeek's open-source nature encourages technology sharing and participation, it also means malicious actors can repurpose it, raising fresh concerns about heightened misinformation, deepfakes and other sinister possibilities.
Another danger hinges on data sovereignty and the possibility of the Chinese government mining users' data.
Rounding Off
While DeepSeek has demonstrated capabilities that are comparable to OpenAI ChatGPT in many ways, its long-term effect on repositioning AI technology, compute and market dynamics still remains to be seen.
Whatever the future might hold, Deepseek's successful deployment of a powerful open-source model has introduced a new level playing field for innovation in the AI industry. As this distills into the mainstream, its ripple effect could determine the face of the next iteration of artificial intelligence.
Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Wearable Devices Collaborates with Leading Japanese E-Commerce Platform to Expand Mudra Wearable Devices in Tech-Savvy Market

Yahoo

27 minutes ago

Yahoo

Wearable Devices Collaborates with Leading Japanese E-Commerce Platform to Expand Mudra Wearable Devices in Tech-Savvy Market

Mudra wearable devices use neural sensors to enable touchless control of digital devices across Apple, Android and Windows platforms. Yokneam Illit, Israel, Aug. 20, 2025 (GLOBE NEWSWIRE) -- Wearable Devices Ltd. (Nasdaq: WLDS, WLDSW) (the 'Company' or 'Wearable Devices'), a technology growth company specializing in artificial intelligence ('AI')-powered touchless sensing wearables, recently announced a collaboration with Media Exceed Co., Ltd. ('Media Exceed'), a leading e-commerce company in Japan. Under this agreement, Media Exceed will serve as a non-exclusive reseller of the award-winning Mudra Band and Mudra Link, bringing Wearable Devices' innovative neural technology to Japan, one of the world's most tech-savvy consumer bases. This collaboration aims to enhance the availability of Wearable Devices' neural interface products in Japan, leveraging Media Exceed's robust e-commerce platform and market expertise. The collaboration supports both drop shipping and wholesale models, ensuring streamlined order fulfillment and localized customer support for Japanese buyers. See the Mudra Band and Mudra Link in action at 'This collaboration is a major step in our mission to revolutionize how people interact with technology,' said Asher Dahan, Chief Executive Officer of Wearable Devices. 'Japan's appetite for innovation makes it the perfect market to showcase our Mudra products, and we're thrilled to collaborate with Media Exceed to accelerate our global growth.' Shinya Kasuga, Chief Executive Officer of Media Exceed, said: 'We are eager to start working with Wearable Devices and bring the innovative Mudra products to the Japanese market. Their neural interface technology aligns perfectly with our vision to introduce cutting-edge solutions that enhance the way people interact with digital devices.' The Mudra Band, designed for Apple Watch users, and the Mudra Link, compatible with Android and Windows devices, utilize proprietary Surface Nerve Conductance sensors to detect neural signals from subtle finger movements. These signals are translated into intuitive commands, enabling touchless control of digital devices. The Mudra Link was recently showcased at CES® 2025, where it received an Innovation Award in the XR Technologies and Accessories category. Media Exceed will offer these products through its online platforms, providing Japanese consumers with direct access to Wearable Devices' innovative technology. The collaboration is expected to enhance user experience and satisfaction by combining advanced wearable technology with Media Exceed's customer-centric approach. About Wearable Devices Wearable Devices Ltd. (Nasdaq: WLDS, WLDSW) is a growth company pioneering human-computer interaction through its AI-powered neural input touchless technology. Leveraging proprietary sensors, software, and advanced AI algorithms, the Company's consumer products - the Mudra Band and Mudra Link - are defining the neural input category both for wrist-worn devices and for brain-computer interfaces. These products enable touch-free, intuitive control of digital devices using gestures across multiple operating systems. Operating through a dual-channel model of direct-to-consumer sales and enterprise licensing and collaborations, Wearable Devices empowers consumers with stylish, functional wearables for enhanced experiences in gaming, productivity, and extended reality (XR). In the business sector, the Company provides enterprise partners with advanced input solutions for immersive and interactive environments, from augmented reality/virtual reality/XR to smart environments. By setting the standard for neural input in the XR ecosystem, Wearable Devices is shaping the future of seamless, natural user experiences across some of the world's fastest-growing tech markets. Wearable Devices' ordinary shares and warrants trade on the Nasdaq Capital Market under the symbols 'WLDS' and 'WLDSW,' respectively. Forward-Looking Statements Disclaimer This press release contains 'forward-looking statements' within the meaning of Section 27A of the Securities Act of 1933, as amended, and Section 21E of the Securities Exchange Act of 1934, as amended, that are intended to be covered by the 'safe harbor' created by those sections. Forward-looking statements, which are based on certain assumptions and describe our future plans, strategies and expectations, can generally be identified by the use of forward-looking terms such as 'believe,' 'expect,' 'may,' 'should,' 'could,' 'seek,' 'intend,' 'plan,' 'goal,' 'estimate,' 'anticipate' or other comparable terms. For example, we are using forward-looking statements when we discuss the aim of our collaboration with Media Exceed, benefits and advantages of our products and technology, that this collaboration is a major step in our mission to revolutionize how people interact with technology, that collaboration with Media Exceed will accelerate our global growth and that the collaboration is expected to enhance user experience and satisfaction. All statements other than statements of historical facts included in this press release regarding our strategies, prospects, financial condition, operations, costs, plans and objectives are forward-looking statements. Forward-looking statements are neither historical facts nor assurances of future performance. Instead, they are based only on our current beliefs, expectations and assumptions regarding the future of our business, future plans and strategies, projections, anticipated events and trends, the economy and other future conditions. Because forward-looking statements relate to the future, they are subject to inherent uncertainties, risks and changes in circumstances that are difficult to predict and many of which are outside of our control. Our actual results and financial condition may differ materially from those indicated in the forward-looking statements. Therefore, you should not rely on any of these forward-looking statements. Important factors that could cause our actual results and financial condition to differ materially from those indicated in the forward-looking statements include, among others, the following: the trading of our ordinary shares or warrants and the development of a liquid trading market; our ability to successfully market our products and services; the acceptance of our products and services by customers; our continued ability to pay operating costs and ability to meet demand for our products and services; the amount and nature of competition from other security and telecom products and services; the effects of changes in the cybersecurity and telecom markets; our ability to successfully develop new products and services; our success establishing and maintaining collaborative, strategic alliance agreements, licensing and supplier arrangements; our ability to comply with applicable regulations; and the other risks and uncertainties described in our annual report on Form 20-F for the year ended December 31, 2024, filed on March 20, 2025 and our other filings with the Securities and Exchange Commission. We undertake no obligation to publicly update any forward-looking statement, whether written or oral, that may be made from time to time, whether as a result of new information, future developments or otherwise. Investor Relations Contact Michal Efraty IR@ in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

Tech is showing signs of slowing down. Is there trouble ahead for Wall Street?

CNBC

27 minutes ago

CNBC

Tech is showing signs of slowing down. Is there trouble ahead for Wall Street?

Here's something you don't hear often on Wall Street: Tech had a bad day. Declines in Nvidia , Meta Platforms and Amazon , among others, sent the S & P 500 lower on Tuesday. The Nasdaq Composite suffered a worse fate, sliding 1.5%, its worst day since Aug. 1. The pullback comes as the major averages sit near record highs, perhaps encouraging investors to cash in chips on some of this year's major tech winners. Nvidia and Meta Platforms remain 38% and 43% higher for the year, respectively, even after the decline. The Technology Select Sector SPDR fund (XLK) is still up 12.2% in 2025 despite falling 1.8% Tuesday. Taking some risk off the table is seldom a bad thing in investing, especially given the 2025 moves. What may be concerning is Tuesday's slide also coincided with investors appearing to load up on market hedges. Jeff Jacobson, 22V Research head of derivative strategy, noted Sunday that the put skew on the Invesco QQQ Trust — which tracks the Nasdaq-100 index — has jumped to a three-year high. Put skews signal that put options on the underlying asset are more expensive than call options. Or, expressed another way: Investors are scooping up downside protection on the QQQ at a fast pace. "Should we see a pullback in tech that is more than just a shallow one, then the 200-day would be a likely level of support," Jacobson said. The QQQ closed Tuesday's session at $569.28. A move to its 200-day moving average of $515.93 would entail a 9.4% decline. QQQ YTD bar QQQ year to date UBS is also urging clients to stay cautious near term but remains constructive on the AI trade long term thanks to robust earnings growth and sentiment indicators not pointing to investor euphoria just yet. "We remain confident in the broader AI sector's long-term growth and resilience. We recommend investors seek balanced exposure across the AI value chain (infrastructure, semis and applications), with a preference for laggards offering a more attractive risk-reward balance. Investors seeking tech exposure may also consider structured investments, such as capital preservation and put-writing strategies, to take advantage of near term volatility," strategists at UBS wrote.

Y Combinator alum SRE.ai raises $7.2M for DevOps AI agents

TechCrunch

28 minutes ago

TechCrunch

Y Combinator alum SRE.ai raises $7.2M for DevOps AI agents

'It wasn't one big lightbulb; it was death by a thousand cuts,' Edward Aryee said when asked what led him and his co-founder, Raj Kadiyala, to launch The company is offering natural language AI agents that can perform complex enterprise DevOps workflows like continuous integration and testing. 'Instead of stitching together different low-code tools for enterprise applications like Salesforce, compared to products built on AWS, GCP, or Azure, teams can now move faster with context-driven, chat-like experiences that work across all of them,' Kadiyala, who is the company's CEO, told TechCrunch. The duo thought of the product while working at Google Research and DeepMind. Aryee, CTO, said they noticed the divide between the infrastructure tooling they had access to versus what others who didn't work at Google had to use. Their engineer friends lamented about tedious tasks, like untangling metadata conflicts. 'It gnawed at us,' Aryee said. He and Kadiyala realized: 'The next generation of DevOps experiences needed to be created.' So they founded in 2024 to offer more modern tools to enterprises so they can avoid issues like metadata merge conflicts. Other competing players include Copado, Gersetm, and Flosum. But Kadiyala said is different in that it works across multiple platforms spanning from AWS to ServiceNow. Techcrunch event Tech and VC heavyweights join the Disrupt 2025 agenda Netflix, ElevenLabs, Wayve, Sequoia Capital, Elad Gil — just a few of the heavy hitters joining the Disrupt 2025 agenda. They're here to deliver the insights that fuel startup growth and sharpen your edge. Don't miss the 20th anniversary of TechCrunch Disrupt, and a chance to learn from the top voices in tech — grab your ticket now and save up to $600+ before prices rise. Tech and VC heavyweights join the Disrupt 2025 agenda Netflix, ElevenLabs, Wayve, Sequoia Capital — just a few of the heavy hitters joining the Disrupt 2025 agenda. They're here to deliver the insights that fuel startup growth and sharpen your edge. Don't miss the 20th anniversary of TechCrunch Disrupt, and a chance to learn from the top voices in tech — grab your ticket now and save up to $675 before prices rise. San Francisco | REGISTER NOW The company officially came out of stealth on Wednesday and announced a $7.2 million seed round led by Salesforce Ventures and Crane Venture Partners. Aryee said the onboarding process involves a setup where tools automatically connect with the user's integrations. The tool can then be customized for a user's needs like release pipelines, insight dashboards, and data monitoring. Meanwhile, has agents monitoring in the background to flag issues that need attention, such as security risks. The tool then offers recommendations on how to solve the problems. This leaves human IT teams free to tackle bigger, more meaningful projects, rather than being focused on tiresome tasks. Kadiyala described the fundraising process as 'high conviction,' and noted the round was oversubscribed. The company partook in YC's Fall '24 cohort, which helped Ayree and Kadiyala meet their lead investors. They will use the fresh capital to hire AI engineers and Salesforce experts. 'We're seeing a lot of early traction, we're excited about building out our team to support new customers and extend the platform with new features,' he said.