Red Hat Optimizes Red Hat AI to Speed Enterprise AI Deployments Across Models, AI Accelerators and Clouds - Middle East Business News and Information

22-05-2025

Red Hat AI Inference Server, validated models and integration of Llama Stack and Model Context Protocol help users deliver higher-performing, more consistent AI applications and agents
Red Hat, the world's leading provider of open source solutions, today continues to deliver customer choice in enterprise AI with the introduction of Red Hat AI Inference Server, Red Hat AI third-party validated models and the integration of Llama Stack and Model Context Protocol (MCP) APIs, along with significant updates across the Red Hat AI portfolio. With these developments, Red Hat intends to further advance the capabilities organizations need to accelerate AI adoption while providing greater customer choice and confidence in generative AI (gen AI) production deployments across the hybrid cloud.
According to Forrester, open source software will be the spark for accelerating enterprise AI efforts.1 As the AI landscape grows more complex and dynamic, Red Hat AI Inference Server and third party validated models provide efficient model inference and a tested collection of AI models optimized for performance on the Red Hat AI platform. Coupled with the integration of new APIs for gen AI agent development, including Llama Stack and MCP, Red Hat is working to tackle deployment complexity, empowering IT leaders, data scientists and developers to accelerate AI initiatives with greater control and efficiency.
Efficient inference across the hybrid cloud with Red Hat AI Inference Server:
The Red Hat AI portfolio now includes the new Red Hat AI Inference Server, providing faster, more consistent and cost-effective inference at scale across hybrid cloud environments. This key addition is integrated into the latest releases of Red Hat OpenShift AI and Red Hat Enterprise Linux AI, and is also available as a standalone offering, enabling organizations to deploy intelligent applications with greater efficiency, flexibility and performance.
Tested and optimized models with Red Hat AI third party validated models
Red Hat AI third party validated models, available on Hugging Face, make it easier for enterprises to find the right models for their specific needs. Red Hat AI offers a collection of validated models, as well as deployment guidance to enhance customer confidence in model performance and outcome reproducibility. Select models are also optimized by Red Hat, leveraging model compression techniques to reduce size and increase inference speed, helping to minimize resource consumption and operating costs. Additionally, the ongoing model validation process helps Red Hat AI customers continue to stay at the forefront of optimized gen AI innovation.
Standardized APIs for AI application and agent development with Llama Stack and MCP
Red Hat AI is integrating Llama Stack, initially developed by Meta, along with Anthropic's MCP, to provide users with standardized APIs for building and deploying AI applications and agents. Currently available in developer preview in Red Hat AI, Llama Stack provides a unified API to access inference with vLLM, retrieval-augmented generation (RAG), model evaluation, guardrails and agents, across any gen AI model. MCP enables models to integrate with external tools by providing a standardized interface for connecting APIs, plugins and data sources in agent workflows.
The latest release of Red Hat OpenShift AI (v2.20) delivers additional enhancements for building, training, deploying and monitoring both gen AI and predictive AI models at scale. These include: Optimized model catalog (technology preview) provides easy access to validated Red Hat and third party models, enables the deployment of these models on Red Hat OpenShift AI clusters through the web console interface and manages the lifecycle of those models leveraging Red Hat OpenShift AI's integrated registry.
Distributed training through the KubeFlow Training Operator enables the scheduling and execution of InstructLab model tuning and other PyTorch-based training and tuning workloads, distributed across multiple Red Hat OpenShift nodes and GPUs and includes distributed RDMA networking–acceleration and optimized GPU utilization to reduce costs.
Feature store (technology preview), based on the upstream Kubeflow Feast project, provides a centralized repository for managing and serving data for both model training and inference, streamlining data workflows to improve model accuracy and reusability.
Red Hat Enterprise Linux AI 1.5 brings new updates to Red Hat's foundation model platform for developing, testing and running large language models (LLMs). Key features in version 1.5 include: Google Cloud Marketplace availability, expanding the customer choice for running Red Hat Enterprise Linux AI in public cloud environments–along with AWS and Azure–to help simplify the deployment and management of AI workloads on Google Cloud.
Enhanced multi-language capabilities for Spanish, German, French and Italian via InstructLab, allowing for model customization using native scripts and unlocking new possibilities for multilingual AI applications. Users can also bring their own teacher models for greater control over model customization and testing for specific use cases and languages, with future support planned for Japanese, Hindi and Korean.
The Red Hat AI InstructLab on IBM Cloud service is also now generally available. This new cloud service further streamlines the model customization process, improving scalability and user experience, empowering enterprises to make use of their unique data with greater ease and control.
Red Hat's vision: Any model, any accelerator, any cloud.
The future of AI must be defined by limitless opportunity, not constrained by infrastructure silos. Red Hat sees a horizon where organizations can deploy any model, on any accelerator, across any cloud, delivering an exceptional, more consistent user experience without exorbitant costs. To unlock the true potential of gen AI investments, enterprises require a universal inference platform–a standard for more seamless, high-performance AI innovation, both today and in the years to come.
Red Hat Summit:
Join the Red Hat Summit keynotes to hear the latest from Red Hat executives, customers and partners: Modernized infrastructure meets enterprise-ready AI — Tuesday, May 20, 8-10 a.m. EDT (YouTube)
Hybrid cloud evolves to deliver enterprise innovation — Wednesday, May 21, 8-9:30 a.m. EDT (YouTube)
Supporting Quotes:
Joe Fernandes, vice president and general manager, AI Business Unit, Red Hat
'Faster, more efficient inference is emerging as the newest decision point for gen AI innovation. Red Hat AI, with enhanced inference capabilities through Red Hat AI Inference Server and a new collection of validated third-party models, helps equip organizations to deploy intelligent applications where they need to, how they need to and with the components that best meet their unique needs.'
Michele Rosen, research manager, IDC
'Organizations are moving beyond initial AI explorations and are focused on practical deployments. The key to their continued success lies in the ability to be adaptable with their AI strategies to fit various environments and needs. The future of AI not only demands powerful models, but models that can be deployed with ability and cost-effectiveness. Enterprises seeking to scale their AI initiatives and deliver business value will find this flexibility absolutely essential.'
About Red Hat:
Red Hat is the open hybrid cloud technology leader, delivering a trusted, consistent and comprehensive foundation for transformative IT innovation and AI applications. Its portfolio of cloud, developer, AI, Linux, automation and application platform technologies enables any application, anywhere—from the datacenter to the edge. As the world's leading provider of enterprise open source software solutions, Red Hat invests in open ecosystems and communities to solve tomorrow's IT challenges. Collaborating with partners and customers, Red Hat helps them build, connect, automate, secure and manage their IT environments, supported by consulting services and award-winning training and certification offerings.
Forward-Looking Statements:
Except for the historical information and discussions contained herein, statements contained in this press release may constitute forward-looking statements within the meaning of the Private Securities Litigation Reform Act of 1995. Forward-looking statements are based on the company's current assumptions regarding future business and financial performance. These statements involve a number of risks, uncertainties and other factors that could cause actual results to differ materially. Any forward-looking statement in this press release speaks only as of the date on which it is made. Except as required by law, the company assumes no obligation to update or revise any forward-looking statements.

Hashtags

Business

Finance

#RedHatAI

#RedHatAIInferenceServer

#LlamaStack

#MCP

#ModelContextProtocol

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

WhatsApp Says Russia Attempts to Block Service for more than 100 Million Users

See - Sada Elbalad

19 hours ago

See - Sada Elbalad

WhatsApp Says Russia Attempts to Block Service for more than 100 Million Users

Taarek Refaat WhatsApp, owned by U.S. tech giant Meta, accused Russian authorities of attempting to block its services for more than 100 million users in the country, citing the app's refusal to weaken its end-to-end encryption standards. In a statement, the company stressed that WhatsApp 'is private, end-to-end encrypted, and will challenge any government's attempt to undermine people's right to secure communication.' It pledged to 'do everything possible to keep this encrypted service available to everyone, including users in Russia.' The remarks come after Moscow began restricting certain calls on WhatsApp and Telegram, accusing foreign platforms of failing to cooperate with law enforcement on cases related to fraud and terrorism. The move underscores a deepening standoff between Western technology companies and the Russian government, which is tightening control over information flows within its borders. The dispute is part of a broader pattern in Russia's digital landscape, where platforms such as Telegram, Google, and YouTube have previously faced blocks or heavy restrictions under laws aimed at monitoring online content and user data. For WhatsApp, holding the line on encryption is both a privacy principle and a brand statement, even at the risk of losing market share in Russia. read more CBE: Deposits in Local Currency Hit EGP 5.25 Trillion Morocco Plans to Spend $1 Billion to Mitigate Drought Effect Gov't Approves Final Version of State Ownership Policy Document Egypt's Economy Expected to Grow 5% by the end of 2022/23- Minister Qatar Agrees to Supply Germany with LNG for 15 Years Business Oil Prices Descend amid Anticipation of Additional US Strategic Petroleum Reserves Business Suez Canal Records $704 Million, Historically Highest Monthly Revenue Business Egypt's Stock Exchange Earns EGP 4.9 Billion on Tuesday Business Wheat delivery season commences on April 15 Videos & Features Story behind Trending Jessica Radcliffe Death Video News Israeli-Linked Hadassah Clinic in Moscow Treats Wounded Iranian IRGC Fighters Arts & Culture "Jurassic World Rebirth" Gets Streaming Date News China Launches Largest Ever Aircraft Carrier News Ayat Khaddoura's Final Video Captures Bombardment of Beit Lahia Business Egyptian Pound Undervalued by 30%, Says Goldman Sachs Videos & Features Tragedy Overshadows MC Alger Championship Celebration: One Fan Dead, 11 Injured After Stadium Fall Arts & Culture South Korean Actress Kang Seo-ha Dies at 31 after Cancer Battle Lifestyle Get to Know 2025 Eid Al Adha Prayer Times in Egypt News The Jessica Radcliffe Orca Attack? 100% Fake and AI-Generated

YouTube will start using AI to guess your age. If it's wrong, you'll have to prove it

Egypt Independent

a day ago

Egypt Independent

YouTube will start using AI to guess your age. If it's wrong, you'll have to prove it

New York — YouTube will begin guessing users' ages using artificial intelligence on Wednesday, as part of an effort to prevent kids from accessing inappropriate content online. It's part of a broader push to make social media safer for young people, but some users are already worried about what it will mean for their privacy and experience on the platform. The technology is designed to determine whether a viewer is an adult or a minor based on their activity on the platform — regardless of the birthdate they submitted when they signed up. The tool is being tested with a limited number of US users for now, but it's expected to be rolled out more widely in the coming months. If the tool identifies a user as a minor, YouTube will automatically apply its existing teen safety measures to their account. That includes restrictions on certain kinds of sensitive content, such as violent or sexually suggestive videos. Adult users incorrectly identified as minors will have to upload a government ID, credit card or a selfie to prove their age. Some YouTube users are already fretting about getting incorrectly flagged by the technology, and privacy experts have raised concerns about adults handing over sensitive personal information to verify their age. Here's what we know about the new system. How will YouTube's AI age verification work? The system relies on signals such as the types of videos a user searches for and watches, and how long their account has been active, to determine whether they are under the age of 18. Users identified by the AI as minors will automatically be opted-in to the platform's teen safety measures, including restrictions on certain kinds of content, adjusted recommendations, prohibitions on repetitive viewing of certain types of content, 'take a break' reminders and disabled personalized advertising. YouTube's new AI age verification system will work only for logged-in users, so young people could still get around some of the safety measures by accessing the site without an account. However, signed-out users can't access age-restricted content. Why is YouTube doing this? YouTube and other social media platforms are cracking down on age verification measures after facing criticism that teens could circumvent their safeguards by signing up with a fake birthdate. Those concerns come amid broader scrutiny from parents and lawmakers who have long worried such sites harm kids' safety and mental health. Meta last year said it would similarly use AI to identify when teen users lie about their age on Instagram so that it could apply expanded youth safety protections. And TikTok uses the technology to detect users who may be under 13 years old, the minimum age to be on the platform. Several other online platforms — including Reddit and Discord — have also started verifying some users' ages because of new rules under the UK's Online Safety Act. The law's child safety provisions went into effect last month. YouTube said its AI age verification system has shown promise in other countries prior to Wednesday's US rollout. Why are some users worried? Some YouTube users are already up in arms over the idea of having to hand over a credit card, ID or selfie (in other words, biometric data) to keep using the adult version of YouTube if they're incorrectly flagged as a teen. Some have shared their frustrations on X and Reddit with the hashtag #boycottyoutube. Suzanne Bernstein, a lawyer for the nonprofit research group Electronic Privacy Information Center, raised concerns about how YouTube will manage that data in an interview with tech news site Ars Technica. 'Discomfort with certain appeals processes which require providing really sensitive personal information is totally understandable,' she said. A YouTube spokesperson told CNN that its parent company, Google, 'uses the world's most advanced security to protect user data against threats, and users can choose the privacy settings that are right for them including deleting their data.' The spokesperson added that YouTube will not retain data from users' IDs or credit cards to use for advertising purposes.

SolarWinds Named Among Notable Vendors in New AIOps Landscape Report - Middle East Business News and Information

Mid East Info

3 days ago

Mid East Info

SolarWinds Named Among Notable Vendors in New AIOps Landscape Report - Middle East Business News and Information

SolarWinds AIOps-powered observability provides predictive intelligence to help manage complex hybrid IT environments and enables a shift toward autonomous operations Dubai, United Arab Emirates —March, 2023—SolarWinds (NYSE:SWI), a leading provider of simple, powerful, and secure IT management software, was named among notable AIOps vendors by Forrester in the new report, The Process-Centric AIOps Landscape, Q1 2023. SolarWinds was included in the report in the 'large' vendor market size segment, a category which includes companies with over $150 million in product revenue. Forrester describes AIOps as a 'practice that combines human and technological applications of AI/ML, advanced analytics, and operational practices with business and operations data.' The February 2023 report states that, 'Environment complexity, data volumes, lack of data science expertise, and entrenchment of technology-centric vendors pose challenges to process-centric AIOps solutions.' The report also lists several primary AIOps use cases capable of benefiting customers, including: Increasing operational awareness and event noise reduction Data-driven automation and remediation Operational anomaly and root cause detection Real-time monitoring and observability Intelligent and suggestive alerting 'We're thrilled to be recognized among notable AIOps vendors by Forrester,' said SolarWinds GVP, Product Management Cullen Childress. 'Modern IT environments and applications are too complex, dynamic, and expansive for humans alone to manually process and analyze the enormous amount of data collected. AIOps has become an invaluable tool for technology teams. Our AIOps-powered solutions help organizations analyze huge amounts of data in real time so they can identify and predict digital service issues and resolve them before business activities or clients are affected.' 'Although AIOps receives less fanfare than other AI-powered technology such as ChatGPT,' Childress continued, 'its application helps organizations take the critical steps toward proactive management of digital services and move toward autonomous operations, which will require little to no human intervention.' Advanced AIOps capabilities are part of the cloud-native SolarWinds® Platform. All components of the SolarWinds Platform leverage these capabilities, including SolarWinds Observability and the company's service management solutions. Powerful SolarWinds AIOps was built to help businesses overcome today's paradox of having too much data and not enough insights. With AIOps-powered observability, organizations can integrate data from across complex hybrid IT environments, receiving the actionable, predictive intelligence they need to optimize performance, reduce costs, and take steps toward autonomous operations. Similarly, with AIOps-powered service management, companies can resolve service issues faster than ever before, reducing toil and relieving the pressure on end-user services teams. SolarWinds solutions provide maximum visibility into the state of any IT environment—whether it's on-premises or hybrid—while enabling fast and accurate performance analysis, anomaly detection, issue remediation, root cause analysis, and alert noise reduction. SolarWinds offers multiple AIOps-powered observability solutions to support customers no matter where they are on their cloud journeys. The company recently unveiled SolarWinds Observability, its first fully integrated, cloud-native offering supporting complex multi-cloud environments and featuring extensibility through a native open-source (OpenTelemetry) framework and third-party integrations. SolarWinds also introduced a new version of Hybrid Cloud Observability, which can be deployed in company data centers but seamlessly implemented in hybrid cloud environments, enabling customers to migrate from on-premises to software as a service (SaaS) at their own pace. Resources: Forrester, The Process-Centric AIOps Landscape, Q1 2023, February 21, 2023 (available to Forrester subscribers and for purchase) The SolarWinds Platform SolarWinds Observability SolarWinds Hybrid Cloud Observability About SolarWinds: SolarWinds (NYSE:SWI) is a leading provider of simple, powerful, secure observability and IT management software built to enable customers to accelerate their digital transformation. Our solutions provide organizations worldwide—regardless of type, size, or complexity—with a comprehensive and unified view of today's modern, distributed, and hybrid network environments. We continuously engage with IT service and operations professionals, DevOps and SecOps professionals, and database administrators (DBAs) to understand the challenges they face in maintaining high-performing and highly available hybrid IT infrastructures, applications, and environments. The insights we gain from them, in places like our THWACK community, allow us to address customers' needs now and in the future. Our focus on the user and our commitment to excellence in end-to-end hybrid IT management have established SolarWinds as a worldwide leader in solutions for observability, IT service management, application performance, and database management. The SolarWinds, SolarWinds & Design, Orion, and THWACK trademarks are the exclusive property of SolarWinds Worldwide, LLC or its affiliates, are registered with the U.S. Patent and Trademark Office, and may be registered or pending registration in other countries. All other SolarWinds trademarks, service marks, and logos may be common law marks or are registered or pending registration. All other trademarks mentioned herein are used for identification purposes only and are trademarks of (and may be registered trademarks of) their respective companies.