苹果发布300亿参数多模态大模型 MM1，引领人工智能新纪元

作者智能小编

3 月 21, 2024 #多模态大模型, #每日AI快讯, #混合专家架构, #苹果AI研究

苹果公司的研究团队近日宣布推出一款名为 MM1 的多模态大模型，该模型拥有高达 300 亿参数，并采用了混合专家（MoE）架构，展示了苹果在人工智能领域的最新突破。这一研究成果在一篇由多位作者共同署名的论文《MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training》中得到详细阐述。MM1 模型不仅在预训练阶段实现了当前最优（SOTA）的性能，而且在多个已有的多模态基准上经过监督微调后，也展现了强大的竞争力。这标志着苹果在人工智能技术研究上的又一重要进展，为未来的智能应用和交互开辟了新的可能性。

英文标题：Apple Releases 30B Parameter Multimodal Model MM1, Pushing the Boundaries of AI

英文关键词：Apple AI Research, Multimodal Models, Mixture of Experts

英文新闻内容：Apple’s research team has unveiled MM1, a multimodal model with up to 30 billion parameters and a Mixture of Experts (MoE) architecture, marking a significant step forward in AI research. The model’s capabilities are detailed in a paper titled “MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training”, authored by a team of researchers. MM1 has demonstrated state-of-the-art performance in pre-training and competitive results on several multimodal benchmarks after fine-tuning, signaling new possibilities for intelligent applications and user interaction.

【来源】https://mp.weixin.qq.com/s/i9bx6M32uk4Jq2KSRhv4ng