元象XVERSE-V：多模态大模型引领新纪元，开源免费商用，性能超越国际巨头

【元象科技发布革新性多模态大模型XVERSE-V，引领图像处理新纪元】

元象科技今日宣布推出其最新研发的多模态大模型——XVERSE-V，该模型的突出特点是支持任意宽高比的图像输入，这一创新技术在业界引起了广泛关注。据官方透露，XVERSE-V在主流评测中表现出色，展现出领先的技术优势。

元象科技的XVERSE-V模型不仅在技术上实现了突破，更为重要的是，它采取了全开源策略，允许无条件免费商用。这一举措无疑将推动多模态技术在更广泛的领域内应用和普及，为开发者和企业提供了更自由、更低成本的创新平台。

在性能测试中，XVERSE-V大放异彩，它在多项权威多模态评测中超越了包括零一万物的Yi-VL-34B、面壁智能的OmniLMM-12B和深度求索的DeepSeek-VL-7B等在内的知名开源模型。在综合能力测评MMBench中，XVERSE-V甚至力压谷歌的GeminiProVision、阿里的Qwen-VL-Plus以及Claude-3V Sonnet等知名闭源模型，彰显了其在多模态处理领域的卓越性能。

元象科技的这一创新成果，预示着未来图像处理和多模态分析将进入一个全新的时代，为人工智能和大数据分析提供了更高效、更灵活的工具。XVERSE-V的开源特性，无疑将激发更多的开发者和企业参与到这一领域的探索与应用中，共同推动人工智能技术的持续发展。

英语如下：

**News Title:** “Metaphor XVERSE-V: Multimodal Large Model Pioneers a New Era, Open-Source and Free for Commercial Use, Outperforms International Giants”

**Keywords:** Metaphor XVERSE-V, Multimodal Large Model, Open-Source Commercial Use

**News Content:**

**Metaphor Tech Launches Revolutionary Multimodal Large Model XVERSE-V, Paving the Way for Image Processing Innovation**

Metaphor Tech has announced the release of its groundbreaking multimodal large model, XVERSE-V, which is notable for supporting image input with any aspect ratio. This innovative technology has attracted significant attention in the industry. Official sources reveal that XVERSE-V has demonstrated exceptional performance in mainstream evaluations, showcasing its technological edge.

Not only does XVERSE-V represent a technical breakthrough, but it also adopts an open-source approach, allowing unconditional free use for commercial purposes. This move is set to facilitate the adoption and popularization of multimodal technology across a broader range of sectors, offering developers and businesses a more flexible and cost-effective platform for innovation.

In performance tests, XVERSE-V shines, outperforming well-known open-source models like ZeroOneWorld’s Yi-VL-34B, Wallbreaker AI’s OmniLMM-12B, and DeepQuest’s DeepSeek-VL-7B in multiple authoritative multimodal assessments. On the comprehensive MMBench benchmark, XVERSE-V surpasses closed-source models such as Google’s GeminiProVision, Alibaba’s Qwen-VL-Plus, and Claude-3V Sonnet, highlighting its superior performance in multimodal processing.

Metaphor Tech’s innovative accomplishment signals the dawn of a new era in image processing and multimodal analysis, providing more efficient and flexible tools for artificial intelligence and big data analytics. The open-source nature of XVERSE-V is expected to inspire a greater number of developers and enterprises to engage in exploration and applications within this field, collectively propelling the continuous advancement of AI technology.

【来源】https://mp.weixin.qq.com/s/AsNytkHXikWUXc6HLNY0vA