智源CMMU评测中文理解能力

作者智能小编

3 月 30, 2024 #GPT-4V, #多模态模型, #智源CMMU, #每日AI快讯

智源研究院近日发布了中文多模态多题型理解及推理评测基准CMMU，旨在评估人工智能在中文理解方面的能力。CMMU v0.1版本包含3603道题目，涵盖单选题、多选题和填空题，这些题目均来自中国教育体系规范下的全国小学、初中和高中考试。智源研究院采用多重评测手段，确保模型不是随机猜对答案。评测结果显示，OpenAI的GPT-4V多模态模型在CMMU上的答题准确率约为30%，表明模型在图像理解和推理能力方面仍有较大提升空间。

英文标题：Zhiyuan CMMU Evaluates Chinese Understanding Abilities
英文关键词：Zhiyuan CMMU, GPT-4V, Multimodal Models
英文新闻内容：
The Zhiyuan Institute has recently released the CMMU (Chinese Multimodal Multi-choice Understanding and Reasoning Evaluation Benchmark), a benchmark designed to assess the capabilities of AI in understanding the Chinese language. CMMU v0.1 consists of 3,603 questions from national exams for primary, middle, and high schools in China, covering multiple-choice, multiple-selection, and fill-in-the-blank question types. The Zhiyuan Institute has employed multiple evaluation methods to ensure that the models are not merely guessing the answers randomly. The results indicate that the GPT-4V multimodal model from OpenAI has an accuracy rate of around 30% on the CMMU, suggesting that there is still significant room for improvement in the model’s image understanding and reasoning abilities.

【来源】https://mp.weixin.qq.com/s/wegZvv4hwLef0BpdIh32-A

智能新闻

发表回复取消回复

洞见天下，智领未来! 👏

AI With Me

一	二	三	四	五	六	日
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

智源CMMU评测中文理解能力

作者智能小编

相关文章

腾讯AI“元宝”杀入微信，13亿用户社交版图重塑？

2025人工智能：颠覆与新生

北大团队突破！单目长视频实时重建高质量3D点云

发表回复取消回复

为您推荐

腾讯AI“元宝”杀入微信，13亿用户社交版图重塑？

2025人工智能：颠覆与新生

北大团队突破！单目长视频实时重建高质量3D点云

Powering Real-Time Engagement Build with Live APIs

作者智能小编

相关文章

发表回复 取消回复

为您推荐

发表回复取消回复