智源研究院发布新一代多模态基础模型Emu2，大幅超越同类模型

作者智能小编

1 月 2, 2024 #Emu2, #多模态基础模型, #智源研究院, #每日AI快讯

智源研究院日前宣布开源发布新一代多模态基础模型Emu2，该模型通过大规模自回归生成式多模态预训练，显著推动多模态上下文学习能力的突破。Emu2在少样本多模态理解任务上大幅超越Flamingo-80B、IDEFICS-80B等主流多模态预训练大模型，在包括VQAv2、OKVQA、MSVD、MM-Vet、TouchStone在内的多项少样本理解、视觉问答、主体驱动图像生成等任务上取得最优性能。

Emu2的发布引起了业内的高度关注，它标志着智源研究院在多模态基础模型领域取得了重要突破，进一步提升了我国在人工智能领域的影响力。

新闻翻译：

Title:智源研究院发布新一代多模态基础模型 Emu2， significantly outperforms mainstream models

Keywords:智源研究院，多模态基础模型，Emu2，breakthrough

News content:

Zhiyuan Research Institute recently announced the open source release of a new generation of multi-modal basic model Emu2. This model, through large-scale self-reliable generation and multi-modal pre-training, significantly promotes the breakthrough of multi-modal context learning ability. Emu2 significantly outperforms Flamingo-80B, IDEFICS-80B, and other mainstream multi-modal pre-trained large models on the task of few-sample multi-modal understanding, and achieves the optimal performance on multiple tasks such as visual question answering, object-oriented image generation, and others.

Emu2’s release has attracted widespread attention in the field, and it marks a significant breakthrough in the field of multi-modal basic models, further enhancing China’s influence in the field of artificial intelligence.

【来源】https://mp.weixin.qq.com/s/Xf4xBzYwubVd8Lpw68ikDA