今日,阿里大模型产品“通义听悟”宣布重大升级,新增六大功能,旨在提升用户体验和工作效率。此次升级的核心亮点是推出了音视频问答助手“小悟”,用户可以通过简单的提问,直接获取超长音视频中的关键信息,实现了对超长媒体内容的自由问答。
“小悟”采用了先进的多语言 Query 处理技术,结合长篇章文本理解能力,以及优化的指令演化框架和检索增强生成算法,开创性地支持对超长音视频进行单记录、跨记录的多语言自由问答。这一突破意味着用户不再受限于音视频的长度和数量,能够更高效地检索和理解海量信息。
此外,通义听悟还新增了一键 AI 改写功能,帮助用户快速生成多样化的文本表述,提高创作效率。同时,新推出的思维导图生成功能则为用户提供了结构化整理信息的新途径,便于理解和记忆复杂内容。
此次“通义听悟”的全面升级,不仅彰显了阿里在人工智能领域的技术实力,也为新闻编辑、研究人员以及广大信息消费者提供了更为智能、便捷的工具,有望重塑音视频内容的消费和处理方式。来源:IT之家。
英语如下:
News Title: “Alibaba’s ‘Tongyi Tingwu’ Upgrades, Launching Long-Form Video Q&A and Mind Map Features, Pioneering a New AI Media Experience”
Keywords: Tongyi Tingwu Upgrade, Long-Form Video Q&A, Xiao Wu Assistant
News Content: Today, Alibaba’s large language model product “Tongyi Tingwu” announced a significant upgrade, introducing six new features aimed at enhancing user experience and work efficiency. The core highlight of this upgrade is the launch of the audio-video question-and-answer assistant “Xiao Wu.” Users can now easily retrieve key information from lengthy audio and video content through simple queries, enabling free-form questioning of long media content.
Xiao Wu employs advanced multi-language Query processing technology, combined with a deep understanding of long-form text and an optimized command evolution framework, as well as retrieval-augmented generation algorithms. This innovation supports multi-language free-form Q&A for single and cross-record long audio and videos. This breakthrough means users are no longer constrained by the length or quantity of audio-visual content, allowing for more efficient search and comprehension of vast information.
Moreover, Tongyi Tingwu now includes an AI rewriting function, which helps users swiftly generate diverse textual expressions, boosting creative efficiency. The newly introduced mind map generation feature also provides users with a new method for structuring information, facilitating the understanding and retention of complex content.
This comprehensive upgrade of “Tongyi Tingwu” underscores Alibaba’s technological prowess in the AI field and offers smarter, more convenient tools to news editors, researchers, and a broad range of information consumers. It is poised to reshape the consumption and handling of audio and video content. Source: IT Home.
【来源】https://www.ithome.com/0/756/690.htm
Views: 1