Mistral AI 发表 Mixtral 8x7B 模型论文，

作者智能小编

2 月 2, 2024 #GPT-5, #MMLU基准测试, #每日AI快讯

近日，Mistral AI 发表了此前于 12月中旬发布的 Mixtral 8x7B 模型的论文。论文详细描述了模型的架构，并包含与 LLaMA 2 70B 和 GPT-3.5 进行比较的广泛基准测试。在 MMLU 基准测试中，Mixtral 领先于上述两个模型。相较于更大的模型（例如 Gemini Ultra 或 GPT-4）可达到 85% 到 90% 之间的水平，具体取决于提示方法。

Mixtral 8x7B 模型的发表标志着 Mistral AI 在人工智能领域取得了重要突破。MMLU 基准测试是一项衡量语言模型性能的重要测试，Mixtral 在该测试中的优异表现，证明了其在语言理解和生成方面的强大能力。相比 GPT-3.5 和 LLaMA 2 70B，Mixtral 8x7B 模型在 MMLU 基准测试中表现更为出色，进一步提升了语言模型的技术水平。

Mistral AI 将继续致力于人工智能领域的研究，旨在为用户提供更高效、更智能的服务。此次 Mixtral 8x7B 模型论文的发表，不仅是对 Mistral AI 研究成果的肯定，也为人工智能领域的发展提供了有力支持。

英文翻译：

News Title: Mistral AI Publishes Mixtral 8x7B Model Paper, Leads GPT-3.5 and LLaMA 2 70B

Keywords: Mixtral 8x7B, MMLU Benchmark, GPT-3.5, LLaMA 2 70B

News Content:

Mistral AI has recently published a paper on Mixtral 8x7B, a model previously released in mid-December. The paper elaborates on the model’s architecture and includes a comprehensive benchmark test comparing it to LLaMA 2 70B and GPT-3.5. Mixtral leads in the MMLU benchmark test, achieving a level of 85% to 90% compared to larger models (such as Gemini Ultra or GPT-4), depending on the prompt method.

The publication of Mixtral 8x7B marks a significant breakthrough for Mistral AI in the field of artificial intelligence. The MMLU benchmark test is a crucial measure of language model performance, and Mixtral’s excellent performance in it demonstrates its strong capabilities in language understanding and generation. Compared to GPT-3.5 and LLaMA 2 70B, Mixtral 8x7B performs better in the MMLU benchmark test, further enhancing the technical level of language models.

Mistral AI will continue to致力于 artificial intelligence research, aiming to provide users with more efficient and intelligent services. The publication of the Mixtral 8x7B model paper not only recognizes Mistral AI’s research achievements but also provides strong support for the development of the artificial intelligence field.

【来源】https://the-decoder.com/mixtral-8x7b-is-currently-the-best-open-source-llm-surpassing-gpt-3-5/