logo
F5 and NVIDIA to meet the needs of accelerated computing and AI

F5 and NVIDIA to meet the needs of accelerated computing and AI

Tahawul Tech24-06-2025
F5 has announced new capabilities for F5 BIG-IP Next for Kubernetes accelerated with NVIDIA BlueField-3 DPUs and the NVIDIA DOCA software framework, underscored by customer Sesterce's validation deployment.
Sesterce is a leading European operator specialising in next-generation infrastructures and sovereign AI, designed to meet the needs of accelerated computing and artificial intelligence.
Extending the F5 Application Delivery and Security Platform, BIG-IP Next for Kubernetes running natively on NVIDIA BlueField-3 DPUs delivers high-performance traffic management and security for large-scale AI infrastructure, unlocking greater efficiency, control, and performance for AI applications. In tandem with the compelling performance advantages announced along with general availability earlier this year, Sesterce has successfully completed validation of the F5 and NVIDIA solution across a number of key capabilities, including the following areas:
– Enhanced performance, multi-tenancy, and security to meet cloud-grade expectations, initially showing a 20% improvement in GPU utilisation.
– Integration with NVIDIA Dynamo and KV Cache Manager to reduce latency for the reasoning of large language model (LLM) inference systems and optimisation of GPUs and memory resources.
– Smart LLM routing on BlueField DPUs, running effectively with NVIDIA NIM microservices for workloads requiring multiple models, providing customers the best of all available models.
– Scaling and securing Model Context Protocol (MCP) including reverse proxy capabilities and protections for more scalable and secure LLMs, enabling customers to swiftly and safely utilise the power of MCP servers.
– Powerful data programmability with robust F5 iRules capabilities, allowing rapid customisation to support AI applications and evolving security requirements.
'Integration between F5 and NVIDIA was enticing even before we conducted any tests', said Youssef El Manssouri, CEO and Co-Founder at Sesterce. 'Our results underline the benefits of F5's dynamic load balancing with high-volume Kubernetes ingress and egress in AI environments. This approach empowers us to more efficiently distribute traffic and optimise the use of our GPUs while allowing us to bring additional and unique value to our customers. We are pleased to see F5's support for a growing number of NVIDIA use cases, including enhanced multi-tenancy, and we look forward to additional innovation between the companies in supporting next-generation AI infrastructure'.
Highlights of new solution capabilities include:
LLM Routing and Dynamic Load Balancing with BIG-IP Next for Kubernetes
With this collaborative solution, simple AI-related tasks can be routed to less expensive, lightweight LLMs in supporting generative AI while reserving advanced models for complex queries. This level of customisable intelligence also enables routing functions to leverage domain-specific LLMs, improving output quality and significantly enhancing customer experiences. F5's advanced traffic management ensures queries are sent to the most suitable LLM, lowering latency and improving time to first token.
'Enterprises are increasingly deploying multiple LLMs to power advanced AI experiences—but routing and classifying LLM traffic can be compute-heavy, degrading performance and user experience', said Kunal Anand, Chief Innovation Officer at F5. 'By programming routing logic directly on NVIDIA BlueField-3 DPUs, F5 BIG-IP Next for Kubernetes is the most efficient approach for delivering and securing LLM traffic. This is just the beginning. Our platform unlocks new possibilities for AI infrastructure, and we're excited to deepen co-innovation with NVIDIA as enterprise AI continues to scale'.
Optimizing GPUs for Distributed AI Inference at Scale with NVIDIA Dynamo and KV Cache Integration
Earlier this year, NVIDIA Dynamo was introduced, providing a supplementary framework for deploying generative AI and reasoning models in large-scale distributed environments. NVIDIA Dynamo streamlines the complexity of running AI inference in distributed environments by orchestrating tasks like scheduling, routing, and memory management to ensure seamless operation under dynamic workloads. Offloading specific operations from CPUs to BlueField DPUs is one of the core benefits of the combined F5 and NVIDIA solution. With F5, the Dynamo KV Cache Manager feature can intelligently route requests based on capacity, using Key-Value (KV) caching to accelerate generative AI use cases by speeding up processes based on retaining information from previous operations (rather than requiring resource-intensive recomputation). From an infrastructure perspective, organisations storing and reusing KV cache data can do so at a fraction of the cost of using GPU memory for this purpose.
'BIG-IP Next for Kubernetes accelerated with NVIDIA BlueField-3 DPUs gives enterprises and service providers a single point of control for efficiently routing traffic to AI factories to optimize GPU efficiency and to accelerate AI traffic for data ingestion, model training, inference, RAG, and agentic AI,' said Ash Bhalgat, Senior Director of AI Networking and Security Solutions, Ecosystem and Marketing at NVIDIA. 'In addition, F5's support for multi-tenancy and enhanced programmability with iRules continue to provide a platform that is well-suited for continued integration and feature additions such as support for NVIDIA Dynamo Distributed KV Cache Manager'.
Improved Protection for MCP Servers with F5 and NVIDIA
Model Context Protocol (MCP) is an open protocol developed by Anthropic that standardizes how applications provide context to LLMs. Deploying the combined F5 and NVIDIA solution in front of MCP servers allows F5 technology to serve as a reverse proxy, bolstering security capabilities for MCP solutions and the LLMs they support. In addition, the full data programmability enabled by F5 iRules promotes rapid adaptation and resilience for fast-evolving AI protocol requirements, as well as additional protection against emerging cybersecurity risks.
'Organisations implementing agentic AI are increasingly relying on MCP deployments to improve the security and performance of LLMs', said Greg Schoeny, SVP, Global Service Provider at World Wide Technology. 'By bringing advanced traffic management and security to extensive Kubernetes environments, F5 and NVIDIA are delivering integrated AI feature sets—along with programmability and automation capabilities—that we aren't seeing elsewhere in the industry right now'.
F5 BIG-IP Next for Kubernetes deployed on NVIDIA BlueField-3 DPUs is generally available now. For additional technology details and deployment benefits, go to www.f5.com and visit the companies at NVIDIA GTC Paris, part of this week's VivaTech 2025 event. Further details can also be found in a companion blog from F5.
Image Credit: F5
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Palace lose appeal against Europa League demotion at CAS
Palace lose appeal against Europa League demotion at CAS

Khaleej Times

time3 hours ago

  • Khaleej Times

Palace lose appeal against Europa League demotion at CAS

Crystal Palace's appeal against UEFA's decision to drop them from the Europa League to the third-tier Conference League was dismissed by the Court of Arbitration for Sport on Monday. UEFA demoted Palace while allowing Olympique Lyonnais to play in the Europa League as, at the time of assessment on March 1, the Eagle Football Group were majority owners of Lyon while their chairman John Textor owned a controlling stake in Palace. Nottingham Forest, who finished seventh in the Premier League last season, will replace Palace in the Europa League. "After considering the evidence, the panel found that John Textor, founder of Eagle Football Holdings, had shares in CPFC and OL and was a board member with decisive influence over both clubs at the time of UEFA's assessment date," the CAS said in a statement. "The panel also dismissed the argument by CPFC that they received unfair treatment in comparison to Nottingham Forest and OL." Palace did not respond to a request for comment. Club chairman Steve Parish told reporters on Sunday that if Palace lost the appeal, they would "have to look if there's any steps after that." The club, who qualified for the Europa League as FA Cup winners, appealed against UEFA's decision last month. The appeal came days before New York Jets co-owner Robert Wood "Woody" Johnson completed the purchase of Eagle Football Holdings' stake in Palace. Textor has also resigned from Lyon's board of directors with Michele Kang appointed chairwoman and president. As both Lyon and Palace had qualified for the Europa League, the French club were allowed to keep their place because they had finished higher in their respective league. Lyon finished sixth in Ligue 1 while Palace were 12th in the Premier League. "Olympique Lyonnais welcomes today's decision by the Court of Arbitration for Sport confirming its participation in the Europa League," Lyon said in a statement. Palace, who won the Community Shield on Sunday beating Liverpool in a penalty shootout, are set to play in the Conference League's qualifying playoff round later this month.

Trump says gold to be spared from tariffs
Trump says gold to be spared from tariffs

The National

time4 hours ago

  • The National

Trump says gold to be spared from tariffs

US President Donald Trump said on Monday that gold will be exempt from tariffs, days after a government posting caused uncertainty as to what is considered a safe haven asset. "Gold will not be tariffed," Mr Trump wrote on the Truth Social media platform, without offering further details. US gold futures were trading 2.36 lower at to $3,408.80 an ounce at 2.15pm ET. Gold prices hit a record high last week after the US Customs and Border Protection ruled that gold bars from Switzerland would be subjected to tariffs that Mr Trump had placed on the country. A White House official later said it would issue an order in the near future to exempt gold bars. Had the tariff gone into effect, Switzerland's gold exports would have faced the same 39 per cent charge as other goods. The Swiss Precious Metals Association on Friday also warned the tariffs could harm "the international flow of physical gold". "We are particularly concerned about the implications of the tariffs for the gold industry and the physical exchange of gold with the US, a long-standing and historical partner for Switzerland,' said Christoph Wild, the association's president. Barrick Mining chief executive Mark Bristow told Reuters before Mr Trump's announcement that the tariff's impact on miners would have been minimal. The price of gold is up more than 27 per cent this year as investors flock to it amid Mr Trump's on-again, off-again tariff agenda. Eyes are on a trade truce between the US and China that was set to expire on Tuesday. Washington and Beijing had imposed escalating tariffs on each other's goods this year, and the two countries had agreed to a 90-day pause in May that temporarily lowered them. Asked on Monday about the deadline, Mr Trump said: "We'll see what happens. They've been dealing quite nicely. The relationship is very good with President Xi [Jinping] and myself." Abramovich London A Kensington Palace Gardens house with 15 bedrooms is valued at more than £150 million. A three-storey penthouse at Chelsea Waterfront bought for £22 million. Steel company Evraz drops more than 10 per cent in trading after UK officials said it was potentially supplying the Russian military. Sale of Chelsea Football Club is now impossible. Structural%20weaknesses%20facing%20Israel%20economy %3Cp%3E1.%20Labour%20productivity%20is%20lower%20than%20the%20average%20of%20the%20developed%20economies%2C%20particularly%20in%20the%20non-tradable%20industries.%3Cbr%3E2.%20The%20low%20level%20of%20basic%20skills%20among%20workers%20and%20the%20high%20level%20of%20inequality%20between%20those%20with%20various%20skills.%3Cbr%3E3.%20Low%20employment%20rates%2C%20particularly%20among%20Arab%20women%20and%20Ultra-Othodox%20Jewish%20men.%3Cbr%3E4.%20A%20lack%20of%20basic%20knowledge%20required%20for%20integration%20into%20the%20labour%20force%2C%20due%20to%20the%20lack%20of%20core%20curriculum%20studies%20in%20schools%20for%20Ultra-Othodox%20Jews.%3Cbr%3E5.%20A%20need%20to%20upgrade%20and%20expand%20physical%20infrastructure%2C%20particularly%20mass%20transit%20infrastructure.%3Cbr%3E6.%20The%20poverty%20rate%20at%20more%20than%20double%20the%20OECD%20average.%3Cbr%3E7.%20Population%20growth%20of%20about%202%20per%20cent%20per%20year%2C%20compared%20to%200.6%20per%20cent%20OECD%20average%20posing%20challenge%20for%20fiscal%20policy%20and%20underpinning%20pressure%20on%20education%2C%20health%20care%2C%20welfare%20housing%20and%20physical%20infrastructure%2C%20which%20will%20increase%20in%20the%20coming%20years.%3C%2Fp%3E%0A

Dubai traffic: How the RTA is using AI to slash congestion and boost road efficiency
Dubai traffic: How the RTA is using AI to slash congestion and boost road efficiency

Arabian Business

time5 hours ago

  • Arabian Business

Dubai traffic: How the RTA is using AI to slash congestion and boost road efficiency

Dubai's Roads and Transport Authority (RTA) has rolled out 'Data Drive – Clear Guide', a next-generation digital platform that uses artificial intelligence (AI) to analyse five years of historical traffic data alongside real-time road conditions — enabling instant, data-driven decisions to reduce congestion and improve traffic flow across the emirate. The platform aggregates and processes vast traffic datasets from leading international providers, delivering accurate, minute-by-minute insights via an interactive map. It allows users to pinpoint specific roads or areas, assess average speeds, track traffic density, and measure travel times, making it a powerful tool for both operational teams and strategic planners. AI traffic analysis in Dubai Key capabilities include: Analysing historical and live traffic flow to spot recurring congestion patterns Identifying times of smooth traffic flow and bottlenecks in the network Sending instant alerts on traffic disruptions, jams, or unusual slowdowns Monitoring traffic diversions and event-related road performance in real time Automatically generating before-and-after reports to evaluate the impact of roadworks or infrastructure upgrades By replacing older, manual data-gathering processes that often took weeks, the platform can now deliver the same insights within minutes. This dramatically improves response times, enabling RTA teams to take swift action to manage incidents, optimise diversions, and minimise delays. The launch is part of RTA's wider vision to harness AI, intelligent analytics, and autonomous mobility solutions to make Dubai the world's smartest city and a global leader in innovation, sustainability, and transport efficiency. According to RTA, the platform will play a vital role in enhancing digital transformation within the emirate's transport sector, providing stakeholders with the tools they need to plan smarter, respond faster, and keep Dubai moving efficiently.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store