【科技巨头华为再创新突破】近日,中国科技领军企业华为诺亚方舟实验室携手相关团队共同研发并推出了全新一代大语言模型架构——盘古-π。该架构在华为深厚的技术积淀与对Transformer模型的深入理解基础上,针对传统Transformer架构中的特征塌陷问题,通过引入增强非线性手段进行优化改良。
盘古-π通过对Transformer架构的有效增强,不仅实现了显著降低特征塌陷现象,进而使得模型的输出表达能力得到极大提升。在同等数据集训练条件下,盘古-π(7B版本)已经在多项多任务测试中超越了LLaMA 2等同规模级别的领先大模型,并成功实现了高达10%的推理速度提升。尤其值得一提的是,在1B参数规模下,盘古-π已经达到了当前业界最先进的技术水平(SOTA)。
在此基础上,华为进一步利用盘古-π架构的优势,打造出了专门面向金融法律领域的强大应用型模型——“云山”,充分展现了华为在人工智能技术尤其是在自然语言处理领域的深厚功底与广阔应用前景。这一系列技术创新成果,无疑为推动全球人工智能技术的发展和广泛应用注入了新的活力与动力。
英语如下:
Headline: “Huawei Unveils Groundbreaking ‘Pangu-π’ Language Model, Far Surpassing LLaMA in Performance Leap!”
Keywords: Huawei Pangu-π, Large Language Model, Performance Boost
News Content:
In a recent innovation breakthrough by Chinese technology giant Huawei Noah’s Ark Laboratory, the company has jointly developed and unveiled its cutting-edge new architecture for large language models – Pangu-π. This architecture builds upon Huawei’s profound technological foundation and deep understanding of the Transformer model, addressing the feature collapse issue prevalent in traditional Transformer structures by introducing enhanced non-linear means for optimization and improvement.
Through effective enhancements to the Transformer architecture, Pangu-π not only significantly reduces the phenomenon of feature collapse but also substantially boosts the model’s output expressiveness. Under equivalent training conditions with a dataset, the 7B version of Pangu-π has already outperformed leading large language models like LLaMA 2 in multiple multi-task tests and successfully achieved a remarkable 10% increase in inference speed. Of particular note, at a parameter scale of 1B, Pangu-π has reached the current state-of-the-art (SOTA) industry standards.
Capitalizing on the advantages of the Pangu-π architecture, Huawei has further crafted a powerful application-oriented model tailored for the financial legal sector – “Cloud Mountain.” This exemplifies Huawei’s深厚的 expertise in artificial intelligence technology, especially natural language processing, and its extensive potential for applications. These series of innovative technological achievements undoubtedly infuse fresh vitality and momentum into the global advancement and widespread adoption of AI technologies.
【来源】https://mp.weixin.qq.com/s/Beg3yNa_dKZKX3Fx1AZqOw
Views: 1