Latest news with #Wan2.1


Mid East Info
15-05-2025
- Business
- Mid East Info
Alibaba Introduces Open-Source Model for Video Creation and Editing - Middle East Business News and Information
All-in-one AI model, Wan2.1-VACE, designed to transform the video creation industry Alibaba has unveiled Wan 2.1-VACE (Video All-in-one Creation and Editing), its latest open-source model for video creation and editing. This innovative tool combines multiple video processing functions into a single model, to streamline the video creation process, boosting efficiency and productivity. As part of Alibaba's video generation large model – the Wan2.1 series – VACE is the first open-source model in the industry to provide a unified solution for various video generation and editing tasks. Wan2.1-VACE supports video generation with multi-modal inputs spanning text, image, and video while offering creators comprehensive video editing capabilities. These editing features include referencing images or frames, video repainting, modifying selected areas of the video and spatio-temporal extension, all of which enable the flexible combination of various tasks to enhance creativity. With this advanced tool, users can generate video containing specific interacting subjects based on image samples and bring static images to life by adding natural movement effects. They can also enjoy advanced video repainting functions such as pose transfer, motion control, depth control, and recolorization. The model also supports adding, modification or deletion to selective specific areas of a video without affecting the surroundings. It also allows for the extension of video boundaries while intelligently filling in content to enrich the visual experience. As an all-in-one AI model, Wan2.1-VACE delivers unparalleled versatility, enabling users to seamlessly combine multiple functions and unlock innovative potential. Users can turn a static image into video while controlling the movement of objects by specifying the motion trajectory. They can seamlessly replace characters or objects with specified references, animate referenced characters, control poses, and expand a vertical image horizontally to create a horizontal video while adding new elements through referencing. Innovative Technologies: Wan2.1-VACE leverages several innovative technologies, to take into account the needs of different video editing tasks during construction and design. Its unified interface, called Video Condition Unit (VCU), supports unified processing of multimodal inputs such as text, images, video, and masks. The model employs a Context Adapter structure that injects various task concepts using formalized representations of temporal and spatial dimensions. This innovative design enables it to flexibly manage a wide range of video synthesis tasks. Thanks to advancements in model architecture, Wan2.1-VACE can be widely applied in the rapid production of social media short videos, content creation for advertising and marketing, post-production and special effects processing in film and television, and for educational training video generation. Training video foundation models requires immense computing resources and vast amounts of high-quality training data. Open access helps lower the barrier for more businesses to leverage AI, enabling them to create high-quality visual content tailored to their needs, quickly and cost-effectively. Alibaba is open-sourcing the Wan2.1-VACE model in two versions; a 14-billion(B)-parameter and a 1.3-billion(B)-parameter. The models are available to download for free on Hugging Face and GitHub, as well as Alibaba Cloud's open-source community, ModelScope. As one of the earliest major global tech companies to open source its self-developed large-scale AI models, Alibaba open sourced four Wan2.1 models in February 2025 and, last month, a video generation model that supports video creation with start and end frames. To date, the models have attracted over 3.3 million downloads on Hugging Face and ModelScope. About Alibaba Group: Alibaba Group's mission is to make it easy to do business anywhere. The company aims to build the future infrastructure of commerce. It envisions that its customers will meet, work and live at Alibaba, and that it will be a good company that lasts for 102 years.


Zawya
15-05-2025
- Business
- Zawya
Alibaba introduces open-source model for video creation and editing
Alibaba has unveiled Wan 2.1-VACE (Video All-in-one Creation and Editing), its latest open-source model for video creation and editing. This innovative tool combines multiple video processing functions into a single model, to streamline the video creation process, boosting efficiency and productivity. As part of Alibaba's video generation large model – the Wan2.1 series – VACE is the first open-source model in the industry to provide a unified solution for various video generation and editing tasks. Wan2.1-VACE supports video generation with multi-modal inputs spanning text, image, and video while offering creators comprehensive video editing capabilities. These editing features include referencing images or frames, video repainting, modifying selected areas of the video and spatio-temporal extension, all of which enable the flexible combination of various tasks to enhance creativity. With this advanced tool, users can generate video containing specific interacting subjects based on image samples and bring static images to life by adding natural movement effects. They can also enjoy advanced video repainting functions such as pose transfer, motion control, depth control, and recolorization. The model also supports adding, modification or deletion to selective specific areas of a video without affecting the surroundings. It also allows for the extension of video boundaries while intelligently filling in content to enrich the visual experience. As an all-in-one AI model, Wan2.1-VACE delivers unparalleled versatility, enabling users to seamlessly combine multiple functions and unlock innovative potential. Users can turn a static image into video while controlling the movement of objects by specifying the motion trajectory. They can seamlessly replace characters or objects with specified references, animate referenced characters, control poses, and expand a vertical image horizontally to create a horizontal video while adding new elements through referencing. Innovative Technologies Wan2.1-VACE leverages several innovative technologies, to take into account the needs of different video editing tasks during construction and design. Its unified interface, called Video Condition Unit (VCU), supports unified processing of multimodal inputs such as text, images, video, and masks. The model employs a Context Adapter structure that injects various task concepts using formalized representations of temporal and spatial dimensions. This innovative design enables it to flexibly manage a wide range of video synthesis tasks. Thanks to advancements in model architecture, Wan2.1-VACE can be widely applied in the rapid production of social media short videos, content creation for advertising and marketing, post-production and special effects processing in film and television, and for educational training video generation. Training video foundation models requires immense computing resources and vast amounts of high-quality training data. Open access helps lower the barrier for more businesses to leverage AI, enabling them to create high-quality visual content tailored to their needs, quickly and cost-effectively. Alibaba is open-sourcing the Wan2.1-VACE model in two versions; a 14-billion(B)-parameter and a 1.3-billion(B)-parameter. The models are available to download for free on Hugging Face and GitHub, as well as Alibaba Cloud's open-source community, ModelScope. As one of the earliest major global tech companies to open source its self-developed large-scale AI models, Alibaba open sourced four Wan2.1 models in February 2025 and, last month, a video generation model that supports video creation with start and end frames. To date, the models have attracted over 3.3 million downloads on Hugging Face and ModelScope. About Alibaba Group Alibaba Group's mission is to make it easy to do business anywhere. The company aims to build the future infrastructure of commerce. It envisions that its customers will meet, work and live at Alibaba, and that it will be a good company that lasts for 102 years.


Techday NZ
15-05-2025
- Business
- Techday NZ
Alibaba releases VACE, an open-source AI video editor
Alibaba has introduced Wan2.1-VACE (Video All-in-one Creation and Editing), an open-source artificial intelligence model that integrates video generation and editing capabilities within a single multimodal framework. Wan2.1-VACE is part of Alibaba's Wan2.1 series and is the first open-source model reported to provide unified video generation and editing solutions for a variety of content creation tasks. The system is designed to support inputs from text, images and video, enabling creators to transform different forms of media into video content rapidly. The model's comprehensive editing tools include referencing still images or selected video frames, repainting video sequences, modifying chosen regions within a video, and spatio-temporal extensions, which collectively enable more flexible editing workflows. These editing functionalities are applicable across multiple sectors, such as short-form social media content, advertising and marketing production, post-production effects for film and television, and educational training resources. According to Alibaba, Wan2.1-VACE enables users to generate videos featuring specific interactions between subjects based on image samples. Static images can be converted into moving video sequences with realistic motion effects, providing creators with options for pose transfer, motion control, depth simulation, and recolouring. The system's selective editing functions allow additional content to be added, altered or removed from designated video regions without impacting other segments, while the video boundary extension feature can expand a video's spatial dimension and autonomously generate complementary content. "As an all-in-one AI model, Wan2.1-VACE delivers unparalleled versatility, enabling users to seamlessly combine multiple functions and unlock innovative potential. Users can turn a static image into video while controlling the movement of objects by specifying the motion trajectory. They can seamlessly replace characters or objects with specified references, animate referenced characters, control poses, and expand a vertical image horizontally to create a horizontal video while adding new elements through referencing," the company stated. The technical architecture of Wan2.1-VACE is designed around several new concepts, including the Video Condition Unit (VCU), which serves as a unified interface supporting the processing of different input modalities—text, images, video footage, and masks. The model also incorporates a Context Adapter structure, using representations of time and space to inject task-specific information across a range of video editing and synthesis applications. "Wan2.1-VACE leverages several innovative technologies, to take into account the needs of different video editing tasks during construction and design. Its unified interface, called Video Condition Unit (VCU), supports unified processing of multimodal inputs such as text, images, video, and masks," said the company. "The model employs a Context Adapter structure that injects various task concepts using formalised representations of temporal and spatial dimensions. This innovative design enables it to flexibly manage a wide range of video synthesis tasks." Alibaba stated that advances in the model architecture support quick and efficient content creation for social media, marketing, and entertainment, as well as training and education. The company also highlighted the resource intensity of training video foundation models, noting, "Training video foundation models requires immense computing resources and vast amounts of high-quality training data. Open access helps lower the barrier for more businesses to leverage AI, enabling them to create high-quality visual content tailored to their needs, quickly and cost-effectively." The Wan2.1-VACE model is being released in two versions—a 14-billion-parameter model and a 1.3-billion-parameter version. Both will be available for free download on platforms such as Hugging Face, GitHub, and Alibaba Cloud's ModelScope open-source community. Earlier this year, Alibaba released four other Wan2.1 models, followed in April by a video generation system supporting start and end frame creation. Collectively, these models have accumulated more than 3.3 million downloads on platforms including Hugging Face and ModelScope, highlighting growing interest in AI-driven video production tools.


Bloomberg
14-05-2025
- Business
- Bloomberg
Alibaba Sustains AI Frenzy With Second Video Upgrade in Weeks
Alibaba Group Holding Ltd. updated its video-generating model for the second time in one month as Chinese firms look to better compete against each other and American rivals in the white-hot AI race. Wan2.1-VACE supports video creation from inputs of various formats, including texts, images and videos, and allows users to edit the generated content, according to a company statement. Alibaba is offering VACE, or Video All-in-one Creation and Editing, in two open-source versions: a 14-billion-parameter iteration and one with 1.3 billion parameters. The Hangzhou-based company previously upgraded its video model based on the same Wan2.1 series in mid-April.


South China Morning Post
03-03-2025
- Business
- South China Morning Post
Alibaba's open-source Sora-like AI video model tops third-party rankings
Advertisement The cloud computing unit of Alibaba last week unveiled Wan 2.1, the latest iteration of its Sora-like video model, boasting performance that rivals other leading offerings. As of Monday, it ranked as the top video-generation model on the VBench Leaderboard, a benchmark test for video generation, and was the only open-source model among the top five. Alibaba, owner of the South China Morning Post, is doubling down on its AI strategy, investing significantly in models and computing power. The Chinese tech giant has budgeted at least 380 billion yuan (US$52.4 billion) for computing resources and AI infrastructure over the next three years, marking the largest-ever allocation by a private Chinese entity. Its shares rose 3.5 per cent in Hong Kong on Monday morning. Advertisement Alibaba Cloud has open-sourced four variants of the new model series, making them freely available for academics, researchers and commercial institutions worldwide to use and modify.