logo
Alibaba Cloud Releases Qwen2.5-Omni-7B: An End-to-end Multimodal AI Model

Alibaba Cloud Releases Qwen2.5-Omni-7B: An End-to-end Multimodal AI Model

Mid East Info10-04-2025
Alibaba Cloud has launched Qwen2.5-Omni-7B, a unified end-to-end multimodal model in the Qwen series. Uniquely designed for comprehensive multimodal perception, it can process diverse inputs, including text, images, audio, and videos, while generating real-time text and natural speech responses. This sets a new standard for optimal deployable multimodal AI for edge devices like mobile phones and laptops.
Despite its compact 7B-parameter design, Qwen2.5-Omni-7B delivers uncompromised performance and powerful multimodal capabilities. This unique combination makes it the perfect foundation for developing agile, cost-effective AI agents that deliver tangible value – especially intelligent voice applications. For example, the model could be leveraged to transform lives by helping visually impaired users navigate environments through real-time audio descriptions, offer step-by-step cooking guidance by analyzing video ingredients, or power intelligent customer service dialogues that really understand customer needs.
The model is now open-sourced on Hugging Face and GitHub, with additional access via Qwen Chat and Alibaba Cloud's open-source community ModelScope. Over the past years, Alibaba Cloud has made over 200 generative AI models open-source.
High Performance Driven by Innovative Architecture:
Qwen2.5-Omni-7B delivers remarkable performance across all modalities, rivaling specialized single-modality models of comparable size. Notably, it sets a new benchmark in real-time voice interaction, natural and robust speech generation, and end-to-end speech instruction following.
Its efficiency and high performance stem from its innovative architecture, including Thinker-Talker Architecture, which separates text generation (through Thinker) and speech synthesis (through Talker) to minimize interference among different modalities for high-quality output; TMRoPE (Time-aligned Multimodal RoPE), a position embedding technique to better synchronize the video inputs with audio for coherent content generation; and Block-wise Streaming Processing, which enables low-latency audio responses for seamless voice interactions.
Outstanding Performance Despite Compact Size:
Qwen2.5-Omni-7B was pre-trained on a vast, diverse dataset, including image-text, video-text, video-audio, audio-text, and text data, ensuring robust performance across tasks.
With the innovative architecture and high-quality pre-trained dataset, the model excels in following voice command, achieving performance levels comparable to pure text input. For tasks that involve integrating multiple modalities, such as those evaluated in OmniBench – a benchmark that assesses models' ability to recognize, interpret, and reason across visual, acoustic, and textual inputs – Qwen2.5-Omni achieves state-of-the-art performance.
Qwen2.5-Omni-7B also demonstrates high performance on robust speech understanding and generation capabilities through in-context learning (ICL). Additionally, after reinforcement learning (RL) optimization, Qwen2.5-Omni-7B showed significant improvements in generation stability, with marked reductions in attention misalignment, pronunciation errors, and inappropriate pauses during speech response.
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Alibaba Cloud Collaborates with University of Birmingham Dubai to Enhance Digital Skills Across the MEA Region
Alibaba Cloud Collaborates with University of Birmingham Dubai to Enhance Digital Skills Across the MEA Region

Mid East Info

time18 hours ago

  • Mid East Info

Alibaba Cloud Collaborates with University of Birmingham Dubai to Enhance Digital Skills Across the MEA Region

Inaugural Internship Programme in the UAE Region Aimed at Advancing Digital Innovation and Developing Industry-Ready Talent Dubai, United Arab Emirates, August 2025 — Alibaba Cloud, the digital technology and intelligence backbone of Alibaba Group, today announced a collaboration with the University of Birmingham Dubai through a newly signed Memorandum of Understanding (MoU). The partnership will introduce advanced curriculum in cloud computing and artificial intelligence (AI) for students and educators at the University of Birmingham Dubai campus. As part of this initiative, Alibaba Cloud also launched its first internship programme in the UAE region, with the University of Birmingham in Dubai being the pioneer university for the program. Through this collaboration, students and faculty from the University of Birmingham Dubai will gain deeper insights into cloud and AI technologies by participating in advanced technology workshops, extensive training sessions available both online and offline, and various certification courses provided by Alibaba Cloud. Additionally, Alibaba Cloud introduced its 'Future Star' internship initiative in the United Arab Emirates, with the University of Birmingham Dubai as its inaugural academic partner. This programme offers a vibrant platform for talent development, delivering interns comprehensive training, hands-on experience, and exclusive mentorship spanning three to six months. Designed to provide a thorough understanding of multiple roles within Alibaba Cloud, 'Future Star' ensures that interns are well-equipped to contribute effectively to the cloud computing industry in the digital age. ''We are thrilled to partner with the University of Birmingham Dubai to nurture the next-generation digital skills in the region. The University of Birmingham Dubai, renowned for its dedication to research and education in technology and digital fields, brings a wealth of expertise and visionary thinking to the collaboration. Through this partnership, we are committed to equipping their students and faculties to acquire crucial digital insights, allowing them to engage more deeply with cutting-edge technologies and methodologies. Together, we are poised to unlock numerous opportunities in our rapidly digitalizing world, ensuring that students, educators, and industry professionals alike can thrive in the evolving digital landscape.' said Eric Wan, General Manager of the Middle East, Turkey and Africa, Alibaba Cloud Intelligence. Professor Yusra Mouzughi, Provost of the University of Birmingham Dubai, said: 'We are proud to partner with Alibaba Cloud to provide our students with direct access to real-world learning experiences and cutting-edge technologies in cloud computing and AI. This collaboration reflects our commitment to equipping future graduates with the digital skills and global industry exposure needed to thrive in an increasingly data-driven world. The internship and academic enrichment opportunities offered through this partnership will play a pivotal role in shaping industry-ready talent from our campus here in Dubai to join leading institutions globally.' The freshly inked partnership is yet another demonstration of Alibaba Cloud's long-term commitment to creating a sustainable pool of digital talent for the Middle East region. Alibaba Cloud has been a leading provider of cloud services to local private companies and public institutions in the UAE and the wider Middle East region since 2016. It has a robust local ecosystem in the MEA region with partners covering banking, financial services and insurance, media, public sectors, and customers locally across different sectors. About Alibaba Cloud: Established in 2009, Alibaba Cloud ( is the digital technology and intelligence backbone of Alibaba Group. It offers a complete suite of cloud services to customers worldwide, including elastic computing, database, storage, network virtualization services, large-scale computing, security, big data analytics, machine learning and artificial intelligence (AI) services. Alibaba has been named the leading IaaS provider in Asia Pacific by revenue in U.S. dollars since 2018, according to Gartner. It has also maintained its position as one of the world's leading public cloud IaaS service providers since 2018, according to IDC. About University of Birmingham Dubai: The University of Birmingham is ranked amongst the world's top 100 institutions, its work brings people from across the world to Birmingham, including researchers and teachers and more than 8,000 international students from over 150 countries. The University of Birmingham was established by Queen Victoria in 1900 as Great Britain's first civic university, where students from all religions and backgrounds were accepted on an equal basis. The University is renowned for its research excellence and its researchers have received 10 Nobel Prizes. From pioneering organ transplants, discovering gravitational waves and furthering understanding of Shakespeare, to developing cures for cancer, advances in robotics and revealing the structure of DNA, the University has been at the forefront of some of the most ground-breaking discoveries of the last 100 years. The University of Birmingham Dubai opened a purpose-built campus in Dubai International Academic City in 2022, which demonstrates the University of Birmingham's long-term commitment to contributing to UAE society – through in-country partnership in education and areas of research strength that support the National Agenda.

Alibaba Cloud to Power First Youth Olympic Games in Africa to Boost Efficiency and Engagement - Middle East Business News and Information
Alibaba Cloud to Power First Youth Olympic Games in Africa to Boost Efficiency and Engagement - Middle East Business News and Information

Mid East Info

time4 days ago

  • Mid East Info

Alibaba Cloud to Power First Youth Olympic Games in Africa to Boost Efficiency and Engagement - Middle East Business News and Information

Dakar 2026's core digital services to be deployed on Alibaba Cloud's digital infrastructure Alibaba Cloud, the digital technology and intelligence backbone of Alibaba Group, today announced to power the Summer Youth Olympic Games Dakar 2026 ('Dakar 2026') – the first Olympic sporting event to be held in Africa – with its proven cloud and AI technologies. Dakar 2026 as the 4th Summer Youth Olympic Games will take place in Senegal, the first country on the African continent to be awarded the honor of hosting an Olympic sports event. As the official cloud service provider of the IOC, Alibaba Cloud is committed to supporting the digital transformation of the Olympic Games, delivering a future-ready digital foundation that enhances the operational efficiency and fan engagement of the Youth Olympic Games. The initiative was announced during a signing ceremony in Hangzhou, China among Alibaba Cloud, the Dakar 2026 Youth Olympic Games Organizing Committee (YOGOC) and the International Olympic Committee (IOC). Antoine Azokly, Head of Youth Olympic Games Technology & Energy at International Olympic Committee said: 'As a result of our ongoing partnership, the integration of Alibaba Cloud's proven AI and cloud technology into Dakar 2026 exemplifies our shared commitment to making the Olympic events more efficient, sustainable and engaging. This collaboration will not only benefit the Youth Olympic Games but also leave a lasting digital legacy for sport in Africa.' 'As we prepare to host Africa's first Olympic event, partnering with Alibaba Cloud marks a crucial step in our journey to deliver a technologically advanced and seamlessly operated Youth Olympic Games,' said Ibrahima Wade, General Coordinator of Dakar 2026 Youth Olympic Games Organizing Committee, 'The implementation of Alibaba Cloud's digital technologies across our core services will not only ensure efficient Games operations but also create a lasting technological legacy that will benefit Senegal and the African sporting community long after the Games conclude.' Under the partnership, the Dakar 2026 Organizing Committee will deploy Alibaba Cloud's Apsara Stack, a comprehensive private cloud solution to build a secure, scalable and high-performance infrastructure for the Summer Youth Olympic Games Dakar 2026. The platform will serve as the core IT infrastructure, hosting digital applications and services required for the planning, operation, logistics and post-event activities for the Dakar 2026, enabling enhanced fan experiences and streamlined event logistics Notably, Alibaba Cloud will support Dakar 2026 with digital flame services, leveraging its latest AI and cloud technologies to create dynamic, immersive visual experiences for global audiences and sports fans. The Summer Youth Olympic Games Dakar 2026 will be held in Senegal from October 31 to November 13, across three host cities: Dakar, Diamniadio, and Saly. The Games will bring together 2700 world's best young athletes, with a maximum age of 17. A total of 35 sports will feature on the YOG programme, including 25 competition sports and 10 engagement sports. About Alibaba Cloud: Established in 2009, Alibaba Cloud is the digital technology and intelligence backbone of Alibaba Group. It offers a complete suite of cloud services to customers worldwide, including elastic computing, database, storage, network virtualization services, large-scale computing, security, management and application services, big data analytics, a machine learning platform and IoT services. Alibaba maintained its position as the third leading public cloud IaaS service provider globally since 2018, according to IDC. Alibaba is the world's third leading and Asia Pacific's leading IaaS provider by revenue in U.S. dollars since 2018, according to Gartner.

Alibaba Cloud Named a Leader in Serverless Development Platforms Report - Middle East Business News and Information
Alibaba Cloud Named a Leader in Serverless Development Platforms Report - Middle East Business News and Information

Mid East Info

time6 days ago

  • Mid East Info

Alibaba Cloud Named a Leader in Serverless Development Platforms Report - Middle East Business News and Information

Demonstrates strategic strength in serverless development through sustained innovation Hangzhou, China, August, 2025 – Alibaba Cloud, the digital technology and intelligence backbone of Alibaba Group, has been named a Leader in The Forrester Wave™: Serverless Development Platforms, Q2 2025 report. The report assessed 11 vendors over six months, evaluating them across 21 criteria, including developer experience, partner ecosystem, API and event-driven integration, AI application development, and vision. Alibaba Cloud, with its Function Compute and Serverless App Engine product capabilities, achieved the highest score possible (5 points) in 9 out of 21 criteria including initialization and deployment, workload flexibility, observability, AI application development, innovation, among others. Jiangwei Jiang, Vice President, General Manager of Infrastructure Products, Alibaba Cloud, said: 'For us, this recognition from Forrester reflects Alibaba Cloud's continued focus on advancing serverless development. The development of AI applications remains a priority in our serverless solutions, as we strive to combine innovative cloud technologies with reliable customer support—helping businesses of all sizes adopt the latest advancements for their growth.' The report stated that Alibaba Cloud's platforms, including Function Compute and Serverless App Engine, deliver scalable event-driven computing with strong integration across the Alibaba ecosystem. As a market front-runner in China and the broader APAC region, Alibaba Cloud combines localized innovation with broad enterprise adoption. According to the report, Alibaba Cloud demonstrates strategic strength in serverless development through sustained innovation, including a commitment to open source and reinforcement of ecosystem growth. In terms of capabilities, Alibaba Cloud offers one of the most comprehensive serverless platforms in the market, with strong capabilities across initialization, deployment, and runtime flexibility. AI application development was a key focus area, with native support for model deployment and event-driven inference workflows. Constant Innovation in Serverless Solutions: Launched in 2017, Alibaba Cloud's Function Compute is a fully managed, event-driven compute service which alleviates users from managing their own infrastructure. Its secure and stable, pay-as-you-go platform is designed to simplify the computing experience to enable faster development and iteration of business logic and core code. Function Compute powers Alibaba Cloud's generative AI development platform Model Studio and open-source ModelScope Community with model inference and training support, as well as elastic invocation capabilities for Agent and MCP services. Alibaba Cloud Serverless App Engine (SAE) is the industry's first application-oriented serverless PaaS, providing a cost-effective and highly efficient one-stop application hosting solution. This Kubernetes-based cloud product combines the serverless architecture & the microservice model, allowing users to deploy an online application in any programming language to SAE within seconds by using source code, a code package, or a Docker image. Alibaba Cloud serverless solutions has supported over 10,000 enterprises worldwide across sectors including e-commerce, manufacturing, education, media and entertainment, internet, gaming, among others. About Alibaba Cloud: Established in 2009, Alibaba Cloud is the digital technology and intelligence backbone of Alibaba Group. It offers a complete suite of cloud services to customers worldwide, including elastic computing, database, storage, network virtualization services, large-scale computing, security, big data analytics, machine learning and artificial intelligence (AI) services. Alibaba has been named the leading IaaS provider in Asia Pacific by revenue in U.S. dollars since 2018, according to Gartner. It has also maintained its position as one of the world's leading public cloud IaaS service providers since 2018, according to IDC.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store