logo
Red Hat & NVIDIA unveil hybrid cloud solution for AI agents

Red Hat & NVIDIA unveil hybrid cloud solution for AI agents

Techday NZ21-05-2025

Red Hat has announced integration with the NVIDIA Enterprise AI Factory validated design, aiming to facilitate the deployment of agentic AI systems across hybrid cloud environments.
The collaboration utilises NVIDIA RTX PRO Servers and NVIDIA Blackwell B200 systems, running on the Red Hat AI portfolio including Red Hat OpenShift AI, to support advanced generative and agentic AI workloads. Red Hat states that this move will provide enterprises with a flexible software foundation to deploy, operate and scale AI agents reliably.
Chris Wright, Chief Technology Officer and Senior Vice President, Global Engineering at Red Hat emphasised the significance of this integration, saying: "AI agents represent the near-term future of enterprise AI, where fast-moving, independent models can speed through any number of tasks to free organizations for higher level innovation. Red Hat OpenShift AI is an ideal platform to run these workloads delivered by NVIDIA Enterprise AI Factory validated designs, offering the scale, flexibility and power necessary to deliver product-ready generative AI across the hybrid cloud."
Red Hat's integration will enable NVIDIA Blackwell architecture support across Red Hat AI platforms, with a reference architecture for the NVIDIA Enterprise AI Factory developed on Red Hat OpenShift AI. This architecture has undergone full verification and performance-testing to address the growing demand for scaling AI agents. The company highlights OpenShift AI's capabilities for consistent deployment and management of AI agents, leveraging vLLM-based inference, as well as enhanced observability and monitoring features.
The reference architecture makes use of NVIDIA NIM microservices accessible through the application catalogue, permitting organisations to assemble a fully validated AI software stack. This design allows operation of on-premises AI factories equipped with NVIDIA RTX PRO Servers and NVIDIA Blackwell B200 systems.
Justin Boitano, Vice President, Enterprise AI at NVIDIA said: "NVIDIA and Red Hat are pioneering the future of enterprise AI by integrating Red Hat OpenShift AI with the NVIDIA Enterprise AI Factory validated design. Together, we're creating a more seamless, full-stack platform that IT can use as the foundation for transforming business data into actionable agentic AI intelligence."
Red Hat stresses that its enterprise AI approach combines open source technologies, the adaptability of a hybrid cloud model, and partnership with multiple stakeholders across the AI development ecosystem. The company collaborates with hardware providers and system integrators, aiming to support customers with comprehensive AI solutions that match specific business requirements.
This broader ecosystem-centric position, Red Hat notes, enables wider adoption of tools like the NVIDIA Enterprise AI Factory validated design, and is intended to meet customers' needs at various stages of the AI development lifecycle, both in the cloud and on-premises infrastructures.
Follow us on:
Share on:

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Illumio & NVIDIA team up to boost Zero Trust for infrastructure
Illumio & NVIDIA team up to boost Zero Trust for infrastructure

Techday NZ

timea day ago

  • Techday NZ

Illumio & NVIDIA team up to boost Zero Trust for infrastructure

Illumio has announced a new integration with NVIDIA BlueField Data Processing Units (DPUs) aimed at strengthening Zero Trust security in critical infrastructure environments. The collaboration brings together the Illumio breach containment platform with the NVIDIA BlueField networking platform, designed to provide security and operational efficiency across both Information Technology (IT) and Operational Technology (OT) environments. This integration allows critical infrastructure organisations to deploy Illumio directly on NVIDIA BlueField, giving security teams a holistic view of network dependencies and permitting security controls at both host and network levels. Through this solution, organisations gain visibility into traffic, protect vital assets, and are able to use NVIDIA BlueField DPUs as Zero Trust enforcement points. The architecture is intended to simplify breach containment for critical systems and help maintain operational continuity while meeting increasingly strict compliance requirements. In addition to current features, future capabilities are planned, including the use of Illumio's AI-driven insights to identify risks and attacker patterns. This will aim to enable rapid detection of threats within Industrial Control Systems (ICS) and OT settings. The integration comes in the context of escalating threats and higher regulatory demands worldwide for improving cyber resilience and reducing risks in OT infrastructure. Organisations are facing challenges from sophisticated cyber threats and the need for solutions that can bridge IT and OT security requirements. One of the key advantages of the integration is expanded visibility and policy enforcement for traffic within and between IT and OT layers. Using Illumio's labelling architecture, teams can view all traffic to and from OT systems equipped with NVIDIA BlueField, enabling a greater understanding of cross-infrastructure communications. The integration is positioned to help organisations rapidly deploy Zero Trust security strategies within critical infrastructure. By extending segmentation to OT and ICS environments, organisations are able to decrease deployment complexity, accelerate the implementation process, and contain breaches by limiting lateral movement risks. Illumio also highlights the compliance and resilience benefits of this integration. Organisations can identify assets, monitor traffic, identify threats, and enforce security policies across integrated IT and OT environments without compromising system performance or requiring significant architectural changes. The microsegmentation provided is designed to be consistent and reliable, supporting diverse environments and maintaining uptime and resilience. Todd Palmer, Senior Vice President of Global Partner Sales and Alliances at Illumio, commented: "The integration between Illumio and NVIDIA will significantly strengthen security for cyber-physical systems and bring us closer to achieving our vision of a world without cyber disasters. Critical infrastructure is under threat like never before. Together with NVIDIA, we're making it easier for organisations to protect critical systems, ensure operational continuity, and meet stringent compliance requirements in an increasingly complex landscape." Ofir Arkin, Senior Distinguished Architect for Cybersecurity at NVIDIA, added: "Cyber risks against critical infrastructure are more sophisticated and disruptive than ever, and lateral movement remains a key factor in successful attacks. Integrating the Illumio and NVIDIA BlueField platforms enables organisations to enhance visibility and control across IT and OT networks, reduce risk, contain attacks, and strengthen operational resilience." Illumio is recognised as a vendor within the NVIDIA partner ecosystem and was named a leader in The Forrester Wave: Microsegmentation Solutions, Q3 2024. Its AI-powered security graph underpins the breach containment platform, which comprises Illumio Insights for AI cloud detection and response, and Illumio Segmentation for Zero Trust segmentation. The objective is to enable organisations to promptly identify risks and contain threats for a Zero Trust security posture.

Intel unveils Xeon 6 CPUs to boost GPU-driven AI systems
Intel unveils Xeon 6 CPUs to boost GPU-driven AI systems

Techday NZ

time23-05-2025

  • Techday NZ

Intel unveils Xeon 6 CPUs to boost GPU-driven AI systems

Intel has introduced three new Intel Xeon 6 series processors designed to enhance performance in GPU-accelerated artificial intelligence (AI) systems. The new processors feature Performance-cores (P-cores) and incorporate Intel's Priority Core Turbo (PCT) technology and Intel Speed Select Technology – Turbo Frequency (Intel SST-TF). These features allow for customisable CPU core frequencies, which are expected to support GPU performance across demanding AI workloads. One of the newly released Xeon 6 processors, the Intel Xeon 6776P, serves as the host CPU in the NVIDIA DGX B300 AI-accelerated system. The Xeon 6776P manages, orchestrates, and supports the overall AI-accelerated system within the DGX B300 architecture. With an extensive memory capacity and robust bandwidth, the processor is engineered to cater to the expanding requirements of AI models and large datasets. Karin Eibschitz Segal, Corporate Vice President and Interim General Manager of the Data Center Group at Intel, commented, "These new Xeon SKUs demonstrate the unmatched performance of Intel Xeon 6, making it the ideal CPU for next-gen GPU-accelerated AI systems. We're thrilled to deepen our collaboration with NVIDIA to deliver one of the industry's highest-performing AI systems, helping accelerate AI adoption across industries." According to Intel, the pairing of Priority Core Turbo and Intel SST-TF brings significant advancements to AI system efficiency. The PCT technology allows high-priority cores to run at increased turbo frequencies for time-critical tasks, while lower-priority cores operate at base speed. This segregation ensures an optimal allocation of CPU resources, which is regarded as crucial for AI tasks that require significant sequential or serial processing. The intent is to enable CPUs to feed data to GPUs more rapidly, streamlining system effectiveness in demanding scenarios. The new Xeon 6 processors with P-cores are built to offer high core counts and improved single-threaded performance. Each processor can reach up to 128 P-cores, aimed at facilitating a balanced distribution of intensive AI workloads across the available cores. Intel reports that memory speeds with the Xeon 6 processors can be 30% faster compared to competing solutions, particularly in high-capacity DRAM configurations. This is supported by both latest MRDIMMs and Compute Express Link standards, which are designed to improve memory bandwidth for data-intensive applications. The processors also offer greater input/output capabilities, providing up to 20% more PCIe lanes than earlier generations. This feature is intended to enhance data transfer rates for input/output-intensive workloads commonly found in enterprise AI and data centre applications. Intel emphasises strong reliability and serviceability features in the design of these new CPUs, aiming for extended system uptime and reduced risk of disruptions to enterprise operations. Additional support comes from Intel Advanced Matrix Extensions, which enable FP16 precision arithmetic for efficient data preprocessing and critical CPU-driven tasks within AI environments. The adoption of these processors is intended for enterprises looking to modernise infrastructure in anticipation of increasingly complex AI workloads. The energy efficiency and performance characteristics are targeted at a broad array of data centre and network applications. Follow us on: Share on:

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store