
HBM And Emerging Memory Technologies Enable AI Training And Inference
AI
During congressional hearing in the House of Representatives' Energy & Commerce Committee Subcommittee of Communication and Technology, Ronnie Vasishta, Senior VP of telecom at Nvidia said that mobile networks will be called upon to support a new kind of traffic—AI traffic. This AI traffic includes the delivery of AI services to the edge, or inferencing at the edge. Such growth in AI data could reverse the general trend towards lower growth in traffic on mobile networks.
Many AI-enabled applications will require mobile connectivity including autonomous vehicles, smart glasses, generative AI services and many other applications. He said that the transmission of this massive increase in data needs to be resilient, fit for purpose, and secure. Supporting this creation of data from AI will require large amount of memory, particularly very high bandwidth memory, such as HBM. This will result in great demand for memory that supports AI applications.
Micron announced that it is now shipping HBM4 memory to key customers, these are for early qualification efforts. The Micron HBM4 provides up to 2.0TB/s bandwidth and 24GB capacity per 12-high die stack. The company says that their HBM4 uses its 1-beta DRAM node, advanced through silicon via technologies, and has a highly capable built-in self-test. See image below.
Micron HBM4 Memory
HBM memory consisting of stacks of DRAM die with massively parallel interconnects to provide high bandwidth are combined GPU's such as those from Nvidia. This memory close to the processor allows training and inference of various AI models. The current generation of HBM memory used in current GPUs use HBM3e memory. At the 2025 March GTC in San Jose, Jensen Huang said that Micron HBM memory was being used in some of their GPU platforms.
The manufacturers of HBM memories are SK Hynix, Samsung and Micron with SK Hynix and Samsung providing the majority of supply and with Micron coming in third. SK hynix was the first to announce HBM memory in 2013, which was adopted as an industry standard by JEDEC that same year. Samsung followed in 2016 and in 2020 Micron said that it would create its own HBM memory. All of these companies expect to be shipping HBM4 memories in volume by sometime in 2026.
Numen, a company involved in magnetic random access memory applications, recently talked about how traditional memories used in AI applications, such as DRAM and SRAM have limitations in power, bandwidth and storage density. They said that processing performance has skyrocketed by 60,000X over the past 20 years but DRAM bandwidth has improved only 100X, creating a 'memory wall.'
The company says that its AI Memory Engine is a highly configurable memory subsystem IP that enables significant improvements in power efficiency, performance, intelligence, and endurance. This is not only for Numem's MRAM-based architecture, but also third-party MRAMs, RRAM, PCRAM, and Flash Memory.
Numem said that it has developed next-generation MRAM supporting die densities up to 1GB which can deliver SRAM-class performance with up to 2.5X higher memory density in embedded applications and 100X lower standby power consumption. The company says that its solutions are foundry-ready and production-capable today.
Coughlin Associates and Objective Analysis in their Deep Look at New Memories report predict that AI and other memory-intensive applications, including the use of AI inference in embedded devices such as smart watches, hearing aids and other applications are already using MRAM, RRAM and other emerging memory technologies will decrease the costs and increase production of these memories.
These memories technologies are already available from major semiconductor foundries. They scale to smaller lithographic scaling that DRAM and SRAM and because they are non-volatile, no refreshes are needed and so they consume less power. As a result, these memories allow more memory capacity and lower power consumption in space and power constrained environments. MRAM and RRAM are also being built into industrial, enterprise and data center applications.
The figure below shows our projections for replacement of traditional memories, SRAM, DRAM, NOR and NAND Flash memory by these emerging memories. NOR and SRAM, in particular, for embedded memories are projected to be replaced by these new memories within the next decade as part of a future $100B memory market.
Projected replacement of conventional memories with new memories
AI will generate increased demand for memory to support training and inference. It will also increase the demand for data over mobile networks. This will drive demand for HBM memory but also increase demand for new emerging memory technologies.
Hashtags

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles
Yahoo
19 minutes ago
- Yahoo
ASUS Announces Key Milestone with Nebius and Showcases NVIDIA GB300 NVL72 System at GTC Paris 2025
Accelerating AI with scalable performance and next-gen infrastructure KEY POINTS Breakthrough compute: ASUS unveils NVIDIA® GB300 NVL72 AI Factory solutions to accelerate training and inference at scale Scalable AI partnership: ASUS and Nebius deepen collaboration to deliver next-gen, NVIDIA® Blackwell-accelerated infrastructure Compact power: New desktops bring petaflop-class performance and support for 200-billion parameter models to the developer's desk TAIPEI, June 13, 2025 /PRNewswire/ -- ASUS today joined GTC Paris at VivaTech 2025 as a Gold Sponsor, highlighting its latest portfolio of AI infrastructure solutions and reinforcing its commitment to advancing the AI Factory vision with a full range of NVIDIA® Blackwell Ultra solutions, delivering breakthrough performance from large-scale datacenter to personal desktop. ASUS is also excited to announce a transformative partnership milestone in its partnership with Nebius. Together, the two companies are enabling a new era of AI innovation built on NVIDIA's advanced platforms. Building on the success of the NVIDIA GB200 NVL72 platform deployment, ASUS and Nebius are now moving forward with strategic collaborations featuring the next-generation NVIDIA GB300 NVL72 platform. This ongoing initiative underscores ASUS's role as a key enabler in AI infrastructure, committed to delivering scalable, high-performance solutions that help enterprises accelerate AI adoption and innovation. Andrey Korolenko, Chief Product and Infrastructure Officer at Nebius said: "We have collaborated with ASUS for many years and appreciate its impressive capability to deliver swift and efficient solutions. ASUS not only delivers consistently against our exacting technical requirements, but also demonstrates deep professional expertise in building AI infrastructure. The company's forward-thinking approach and technical excellence have been a key enabler for our projects, and we look forward to working together to deliver the next generations of AI infrastructure." AI servers: Building NVIDIA AI Factories for enterprise Leading the charge in AI advancement, ASUS is driving scalable, agentic AI through increased token generation. At GTC Paris, ASUS will unveil its latest AI Factory infrastructure solutions built on NVIDIA RTX PRO Servers as well as the NVIDIA Grace Blackwell Ultra systems. The ASUS AI POD, built with the NVIDIA GB300 NVL72 system, delivers exceptional performance for complex AI inference tasks, making it ideal for advanced AI applications. Meanwhile, ASUS XA NB3I-E12, featuring the NVIDIA HGX B300 system, pushes the boundaries of AI computing with higher FLOPS and a massive 2.3TB of HBM3e memory — accelerating training and inference for large-scale models. To further address the growing demands of high-performance AI and HPC environments, ASUS introduced the new ESC8000A-E13P. This 4U NVIDIA MGX server supports up to eight NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs, NVIDIA BlueField-3 DPUs, and NVIDIA ConnectX-8 SuperNICs with built-in PCIe® 6.0 switches, offering seamless integration, performance optimization, and scalability for modern data centers and agile IT deployments AI Inferencing: Enabling intelligent services at scale ASUS also unveiled powerful AI inference solutions, introducing a new lineup of workstations and a compact-sized supercomputer engineered to tackle today's most demanding workloads. Leading the range is the ExpertCenter Pro ET900N G3, the first system powered by the NVIDIA GB300 Grace Blackwell Ultra Superchip, with up to 784GB of large coherent memory. Another is the groundbreaking ASUS Ascent GX10, a compact AI supercomputer powered by the NVIDIA GB10 Grace Blackwell Superchip, that delivers 1,000 AI TOPS performance for demanding workloads. Equipped with a NVIDIA Blackwell GPU, 20-core Arm CPU, and 128GB of memory, it supports AI models up to 200-billion parameters, placing petaflop-scale inferencing capabilities on developers' desks. Designed from the ground up for AI, both products deliver exceptional performance for large-scale training and inference on a desktop. Combined with the NVIDIA AI software stack, it is purpose-built for teams that demand the best in AI development. ASUS: Proven expertise in AI infrastructure With server expertise dating back to 1995, ASUS delivers reliable, end-to-end infrastructure solutions —ranging from individual components to fully integrated systems — backed by world-class R&D and global manufacturing capabilities. Driven by the Ubiquitous AI, Incredible Possibilities vision, ASUS supports clients in accelerating their advancement in the global AI race. Through flexible customization, deep technical expertise, and a proven track record in deployment, ASUS empowers enterprises to scale AI initiatives with confidence and efficiency. AVAILABILITY & PRICING ASUS servers are available worldwide. Please visit for more ASUS infrastructure solutions or please contact your local ASUS representative for further information. About ASUS ASUS is a global technology leader that provides the world's most innovative and intuitive devices, components, and solutions to deliver incredible experiences that enhance the lives of people everywhere. With its team of 5,000 in-house R&D experts, the company is world-renowned for continuously reimagining today's technologies. Consistently ranked as one of Fortune's World's Most Admired Companies, ASUS is also committed to sustaining an incredible future. The goal is to create a net zero enterprise that helps drive the shift towards a circular economy, with a responsible supply chain creating shared value for every one of us. View original content to download multimedia: SOURCE ASUS Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

Wall Street Journal
24 minutes ago
- Wall Street Journal
Chinese AI Companies Dodge U.S. Chip Curbs by Flying Suitcases of Hard Drives Abroad
KUALA LUMPUR, Malaysia—In early March, four Chinese engineers flew to Malaysia from Beijing, each carrying a suitcase packed with 15 hard drives. The drives contained 80 terabytes of spreadsheets, images and video clips for training an artificial-intelligence model. At a Malaysian data center, the engineers' employer had rented about 300 servers containing advanced Nvidia chips. The engineers fed the data into the servers, planning to build the AI model and bring it back home.


Forbes
31 minutes ago
- Forbes
AWE 2025 Fueled By Android XR, Snap Specs, And AI
The theme of the show was evident from the start. Augmented World Expo 2025, now in its 16th year, wrapped up today in Long Beach, California. The XR industry's largest and longest-running event drew more than 5,000 attendees and 250 exhibitors to the cavernous Long Beach Convention Center from June 10 to 12. For the first time, both the conference and expo floor ran a full three days, with expanded programming that included hackathons, keynotes, investor meetups, and breakout areas for startups, game developers, and enterprise providers. The week began, as always, with Ori Inbar's annual keynote. AWE's co-founder took the stage with his usual mix of irreverence and conviction. This year's theme was direct: 'XR is going mainstream.' Inbar said the wait was over. 'The hardware is good enough, the tools are mature, and AI has lowered the barrier to entry,' he said, urging developers to stop building for the future and start shipping to the present. He celebrated XR's strange persistence—joking that we've been waiting for a 'mass market moment' for 30 years—and framed AI as both a complement and a catalyst: 'XR is the killer interface for AI,' he said, to sustained applause. AWE head of Programming Sonya Haskins and CEO and co-founder of AWE, Ori Inbar. Google and Snap delivered first day main stage keynotes that energized the crowd. Snap dominated the hallways with demos of Specs and their mirror technology. Niantic Spatial also had a big presence, as they did last year, before they spun off Pokemon Go to Scopely to focus on WebXR and a digital twin of the physical world. Google's Justin Payne at AWE 2025. Google's Justin Payne introduced Android XR, the company's new spatial computing operating system. Introduced to some fanfare at Google I/O two weeks ago, this was a direct pitch to the developers in the audience. Android XR is built to unify headset and glasses development across Qualcomm and Samsung hardware and deeply integrate with Gemini. 'This is the first Android platform built for the Gemini era,' Payne said. He described Android XR as the logical evolution of Google's long-term investment in vision-based computing—Glass, ARCore, Lens, Live View—now converging with real-time AI. He emphasized that XR devices shouldn't be siloed or episodic. 'The same person will use multiple XR devices throughout the day,' he said, 'and Gemini can follow them between worlds.' Snap's Evan Spiegel took the stage next and as expected he announced that consumer-ready Spectacles are coming in 2026. Snap has spent over $3 billion and 11 years refining its mobile AR platform, which now supports more than 4 million lenses used 8 billion times a day. 'We're obsessed with making computers more human,' Spiegel said. With OpenAI and Gemini onboard, the new Spectacles will support spatial AI interactions, WebXR, and shared gaming overlays. Specs are already in the hands of hundreds of developers, many of whom were demoing real-world applications throughout the Long Beach venue. In the past CTO Bobby Murphy has keynoted AWE, but this is Speigel's inaugural appearance, signaling the growing importance of the medium and its largest annual gathering. Chi Xu, founder and CEO of Xreal. Both Google and Snap highlighted the growing ecosystem of Android XR tools. XREAL's Chi Xu previewed Project Aura, the company's latest eyewear, built for Android XR stack and also unveiled two weeks earlier at I/O. Featuring an upgraded Qualcomm X1S spatial chip, Aura has a 70-degree field of view and native support for Gemini-powered voice interfaces. Xu described it as a long-awaited convergence of hardware, AI, and open platforms: 'All the pieces are finally ready,' he said. At Qualcomm's booth, attendees could test its new AR1+ Gen1 chipset, an on-device AI processor designed for smartglasses. Qualcomm SVP Ziad Asghar framed it as a turning point for wearable computing: 'It's time to build AI glasses that can stand alone.' From L to R: Dylan, Brent, Nolan, Alissa, and Wyatt Bushnell In a packed session featuring Atari and Chuck-E-Cheese founder Nolan Bushnell and his family consisting of entrepreneurs, daughter Alissa, and brothers Brent, Wyatt and Dylan, the family discussed the personal, and professional reality of being a Bushnell. The discussion turned to the lessons XR can learn from arcade design. The Bushnells made a persuasive case for intuitive mechanics and social play, less UI, more instinct. 'Nobody wants to play a tutorial,' one of them said. 'If they don't get it in the first ten seconds, they walk.' They also made a passionate case for location-based XR. Brent's Dream Park demo on the show floor's Playground allows players to interact with digital characters in the physical world. 'This isn't VR anymore,' he said. 'You are the game.' Palmer Luckey at AWE 2025. Palmer Luckey began by explaining his hoarse voice was the result of spending a week in Washington, DC with his main customers. In the news just weeks ago was his surprise reunion with Meta, seven years after being fired. They are together taking over the IVAS project from Microsoft. IVAS was a $22 billion contract to create AR equipped infantry that could use heads-up displays for threat detection, drone management, mapping, targeting, in addition to the thermal imaging (night vision) they use now. 'The best AR hardware isn't coming out of DARPA anymore,' he said. 'It's coming from the consumer sector. Meta, Snap, Google, they've pulled ahead.' His Eagle Eye platform, developed for the U.S. Army, is a high-resolution, multimodal sensor suite that fuses thermal, RF, and spatial data in real time. 'It's not entertainment hardware,' he said. 'It's a tool built for life-and-death decisions, but it will trickle back to consumers.' Author and entrepreneur Tom Emrich signing copies of his new book, Next Dimension. Emrich announced ... More at the show that he is launching a new spatial/XR news site, Remix Reality. Vicki Dobbs Beck of ILM and researcher and author Helen Papagiannis approached XR from a cultural and narrative perspective, emphasizing its potential as a medium for identity, expression, and immersive storytelling. Beck framed ILM's evolving mission as a shift from 'storytelling to storyliving.' Drawing from a decade of immersive projects under the Lucasfilm banner, she described the next frontier as emotionally responsive worlds, powered by real-time AI and character memory. Papagiannis, author of Augmented Human, unveiled her new book Reality Modding, which proposes that reality-like software which is now editable, customizable, and increasingly aesthetic. 'This is about identity and presence,' she said. 'We're no longer just users of technology, we're becoming the medium itself.' Mentra AR glassess will soon be compatible with Android XR. The tone of the show was celebratory but not naive. Inbar acknowledged the ghosts of past hype cycles. XR has been 'the next big thing' for nearly two decades. But this year, the combination of stable platforms, purpose-built hardware, and AI-native developer tools made the proposition feel more grounded. The term 'ambient computing' came up repeatedly—devices that disappear into daily life, interfaces that respond without friction. On the floor, dozens of demos aimed at enterprise deployment, not just entertainment: spatial planning, logistics, training, and field service. Enterprise now represents 71% of the XR market, and it showed. All 5000 people must have tried the new Snap Spectacles by the end of the show. The AWE Playground is always a highlight as it features entertainment experiences for both in-home and out-of-home audiences. Installations ranged from social XR games to large-scale multisensory exhibits. A highlight was an expanded version of Brent Bushnell's Dream Park, a walkable mixed-reality experience that allowed users to embody virtual characters without controllers. They just raised $1.3 M to expand from their Santa Monica pilot. Their 'theme park in a box' can literally be run by a couple of kids in a park. Auki's robot had a. lot of fans. Auki Labs placed QR codes on the floor of the convention center for indoor navigation. This mobile AR experience helped guide their attention-getting robot. Auki is doing a massive retail rollout of their indoor virtual positioning systems on a much larger scale in decentralized protocol, PoseMesh, uses scannable QR codes and self-hosted data to guide robots and humans through physical spaces. Auki also worked with Zappar on enhanced QR codes, which Unilever is now putting on their packaging. Auki won a coveted Auggie award for its Posemesh technology. Trying out Viture for the first time at CES 2023. Virture's Kickstarter raised $3.2 M for these ... More Assisted Reality smartglasses targeting gamers. Founder Marcus Lim has raised over $10M. Every year there are a handful of suite demos in the nearby Hyatt Hotel. Some meetings are better and more relevant than others. This year I got a private detailed tour from the founder David Jiang who I first met at CES in 2023, where he showed me his Viture AR screen reflecting glasses. According to IDC, they account for 52% of AR smartglasses sales worldwide. You plug them into your phone and see a 200' screen in a compact form factor. It's favored by gamers but popular for content consumption and productivity as well. They've come a long way in three short years, diversifying into software, including an app that uses AI to transform movies into 3D, spatial experiences, much like Leia, which does it with a 3D display in tablet form. It is even more impressive when fully immersed in Viture's lightweight headset. With Google and Apple entering the market they're hoping their software will give them a way to leverage the competition into even greater success. Trying out Flow Immersiver on an Xreal AR headset. In the hallways and informal corners of the convention center, old ideas resurfaced in sharper, more polished form. Jason Marsh, founder of Flow Immersive, gave one of his signature roaming demos—an evolving tradition that began seven years ago when he first cornered me outside a session room with a prototype on his tablet. This year, Flow's layered, interactive data visualizations ran smoothly on headsets, phones, and smartglasses. What once felt like an ambitious idea now looked like a viable product, complete with enterprise traction and UX refinements. The evolution of Flow mirrored the tone of the show itself: confident, capable, and finally ready for primetime. Patrick Johnson and the team from Rock, Paper, Reality, with the hideous yet coveted Auggie Award, ... More which they won for their extraordinary work with Google maps on the history of Paris. This year's Auggie Awards reflected both breadth and maturity across the XR spectrum. With a record number of nominations and public votes, the 16th annual ceremony honored excellence across 19 categories: LOS ANGELES, CA - FEBRUARY 11: Director for Medical Virtual Reality Institute for Creative ... More Technologies Albert "Skip" Rizzo at Participant Medias screening of That Which I Love Destroys Me in Los Angeles on Wednesday, February 11, 2015 in Los Angeles, California. (Photo byfor Participant Media) Ten new XR Hall of Fame inductees were honored on June 11, celebrating pioneers whose work has shaped today's $40 billion industry: Their induction honors the foundational work they've done while helping the next generation of creators. The packed theatre was a reminder that today's XR movement is not new, but finally catching up to its own imagination.