Alibaba Open-Sources Wan2.1 Its AI Video Generation Model

作者智能小编

2 月 27, 2025 #aivideo, #alibaba, #opensource, #每日AI快讯

news studio

A new contender has entered the AI video generation arena, and it’s making waves. Alibaba has open-sourced Wan2.1, a powerful AI model capable of generating videos from both text and images. Boasting impressive performance and accessibility, Wan2.1 is poised to empower developers and researchers alike.

Alibaba’s Wan2.1 is an AI video generation model that stands out for its robust visual generation capabilities. It supports both text-to-video and image-to-video tasks, catering to a wide range of creative needs. The model comes in two sizes:

Professional Version (14B parameters): Designed for complex motion generation and physics modeling, this version delivers exceptional performance.
Express Version (1.3B parameters): This version is designed to run on consumer-grade graphics cards, making it ideal for secondary development and academic research due to its low memory requirements.

The Wan2.1 model is built on a causal 3D VAE and video Diffusion Transformer architecture, enabling efficient spatiotemporal compression and long-term dependency modeling. The 14B version has achieved a remarkable score of 86.22% in the Vbench benchmark, surpassing other leading models such as Sora, Luma, and Pika.

Key Features of Wan2.1

Text-to-Video: Generates videos based on text descriptions, supporting both English and Chinese long-text instructions. It accurately recreates scene transitions and character interactions.
Image-to-Video: Creates videos from images, offering more controllable creation and suitable for expanding static images into dynamic videos.
Complex Motion Generation: Accurately displays complex movements of people or objects, such as rotations, jumps, and turns, with support for advanced camera movement control.
Physics Simulation: Precisely recreates realistic physical scenarios such as collisions, rebounds, and cuts, generating video content that adheres to physical laws.

Open Source and Accessible

Wan2.1 is open-sourced under the Apache 2.0 license and supports multiple mainstream frameworks. It is available on GitHub, Hugging Face, and the ModelScope community, making it easy for developers to use and deploy.

Conclusion

Alibaba’s open-sourcing of Wan2.1 marks a significant step forward in the field of AI video generation. Its impressive performance, coupled with its accessibility and open-source nature, positions it as a valuable tool for researchers, developers, and creators looking to explore the possibilities of AI-generated video content. As the field continues to evolve, Wan2.1 is sure to play a key role in shaping the future of video creation.

References

Wan2.1 – 阿里开源的AI视频生成大模型

>>> Read more <<<

智能新闻

一	二	三	四	五	六	日
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

Alibaba Open-Sources Wan2.1 Its AI Video Generation Model

作者智能小编

Key Features of Wan2.1

Open Source and Accessible

Conclusion

References

相关文章

Sports Brands Go Big Outsizing Luxury with Mega-Stores

TikTok劲敌？两天MVP估值5亿，资本狂涌！

运动品牌“巨无霸”店来袭，奢侈品都得让路？

发表回复取消回复

为您推荐

Sports Brands Go Big Outsizing Luxury with Mega-Stores

TikTok劲敌？两天MVP估值5亿，资本狂涌！

运动品牌“巨无霸”店来袭，奢侈品都得让路？

Cloudflare Workers & Hyperdrive Supercharge Global MySQL App Performance

作者智能小编

Key Features of Wan2.1

Open Source and Accessible

Conclusion

References

相关文章

发表回复 取消回复

为您推荐

发表回复取消回复