Latest news with #PEAK:AIO


Techday NZ
23-05-2025
- Business
- Techday NZ
PEAK:AIO unveils platform to tackle AI memory bottlenecks
PEAK:AIO has introduced a dedicated solution designed to unify KVCache acceleration and GPU memory expansion for large-scale artificial intelligence workloads, focusing on addressing memory bottlenecks in large language model inference and model development. The company's new platform, powered by CXL memory and integrated with Gen5 NVMe and GPUDirect RDMA, is positioned to provide up to 150 GB/sec sustained throughput with latency below five microseconds. This solution aims to support the growing demands of inference tasks, agentic systems, and model creation processes in AI deployments. Eyal Lemberger, Chief AI Strategist and Co-Founder of PEAK:AIO, described the current landscape of AI memory requirements as evolving beyond static prompts toward more complex workloads: "Whether you are deploying agents that think across sessions or scaling toward million-token context windows, where memory demands can exceed 500GB per model, this appliance makes it possible by treating token history as memory, not storage. It is time for memory to scale like compute has." As artificial intelligence models, particularly transformer-based architectures, increase in size and context, AI pipelines are encountering two main barriers: inefficiency with KVCache and saturation of GPU memory resources. According to the company, other vendors have attempted to adapt existing storage technologies or extend NVMe use to delay these limitations. The platform from PEAK:AIO, referred to as the 1U Token Memory Platform, adopts a token-centric architecture built specifically for scalable artificial intelligence. The company states that this approach allows KVCache reuse across multiple sessions, models, and nodes, along with expanded context windows for longer model history, GPU memory offload using CXL, and low latency access by means of RDMA over NVMe-oF. This platform diverges from traditional NVMe-based storage solutions by providing infrastructure that treats token memory as a primary resource rather than storing it as files. Teams are thus able to cache token history, attention maps, and streaming data at memory-like latency, which the company says is consistent with the performance requirements of advanced AI deployments. PEAK:AIO's system is designed to align specifically with NVIDIA's KVCache reuse and memory management frameworks, providing direct support for users employing TensorRT-LLM or Triton, which the company claims results in accelerated inference speeds with minimal integration work. Leveraging CXL for memory-class performance, the platform delivers token memory operations with characteristics similar to RAM. Lemberger commented further on the design philosophy behind the platform: "While others are bending file systems to act like memory, we built infrastructure that behaves like memory, because that is what modern AI needs. At scale, it is not about saving files; it is about keeping every token accessible in microseconds. That is a memory problem, and we solved it at embracing the latest silicon layer." The solution is fully software-defined and can be deployed on off-the-shelf servers. The company anticipates entering production by the third quarter. PEAK:AIO will be making early access and technical consultations available for organisations interested in integrating the platform within their own AI infrastructure. Mark Klarzynski, Co-Founder and Chief Strategy Officer at PEAK:AIO, highlighted the technical approach adopted by the company: "The big vendors are stacking NVMe to fake memory. We went the other way, leveraging CXL to unlock actual memory semantics at rack scale. This is the token memory fabric modern AI has been waiting for."
Yahoo
26-03-2025
- Business
- Yahoo
AI Chips Update - NVIDIA and PEAK:AIO Boost AI Infrastructure in EMEA Region
PEAK:AIO has collaborated with Scan Computers to enhance their AI infrastructure through a significant deployment of NVIDIA DGX Blackwell B200 clusters in the EMEA region. This development reflects a substantial investment in state-of-the-art AI capabilities, with PEAK:AIO's tailor-made AI data servers providing the necessary high-performance and efficient storage solutions. Key technologies, such as GPUDirect NVMe-oF and GPUDirect RDMA NFS, have been integrated to ensure optimal data movement between storage and GPU memory, accommodating complex AI workloads. This partnership underscores the growing demand for specialized storage solutions in the AI sector, allowing organizations to fully leverage the potential of advanced GPU technologies. last closed at $120.69 down 0.6%. In other market news, was a standout up 3.1% and ending the day at ₩214,500. Meanwhile, trailed, down 4.2% to close at $94.22, close to the 52-week low. NVIDIA's new Blackwell architecture may significantly enhance its AI offerings and data center revenue. Explore more about how this could impact NVIDIA's growth prospects by clicking here. Additionally, don't miss our Market Insights article, which urgently examines how DeepSeek's R1 model challenges the foundational AI chip market dynamics, potentially affecting major players like Nvidia. finished trading at $114.81 up 0.8%. closed flat at $160.15. closed flat at $24.20. Reveal the 51 hidden gems, such as Will Semiconductor, Applied Materials and Microchip Technology, among our AI Chip Stocks screener with a single click here. Seeking Other Investments? AI is about to change healthcare. These 23 stocks are working on everything from early diagnostics to drug discovery. The best part - they are all under $10b in market cap - there's still time to get in early. This article by Simply Wall St is general in nature. We provide commentary based on historical data and analyst forecasts only using an unbiased methodology and our articles are not intended to be financial advice. It does not constitute a recommendation to buy or sell any stock, and does not take account of your objectives, or your financial situation. We aim to bring you long-term focused analysis driven by fundamental data. Note that our analysis may not factor in the latest price-sensitive company announcements or qualitative material. Simply Wall St has no position in any stocks mentioned. Sources: Simply Wall St "PEAK:AIO Helps Scan Computers GPUaaS Lead EMEA AI with New NVIDIA DGX B200 Cluster and Next-Gen AI Storage" from PEAK:AIO on GlobeNewswire (published 25 March 2025) Companies discussed in this article include KOSE:A000660 NasdaqGS:AMD NasdaqGS:QCOM NasdaqGS:INTC NasdaqGS:NVDA and NasdaqGS:ENTG. Have feedback on this article? Concerned about the content? with us directly. Alternatively, email editorial-team@ Sign in to access your portfolio