UC伯克利震撼发布：LargeWorldModel，百万token上下文，视频生成，开源新纪元！

作者智能小编

5 月 12, 2024 #UC伯克利, #大世界模型, #每日AI快讯

上海的陆家嘴

【UC伯克利创新发布“大世界模型”：与谷歌Gemini 1.5比肩的开源世界模型】

今日，UC伯克利在科技领域投下一颗重磅炸弹，正式推出了其研发的“大世界模型”（LargeWorldModel，简称LWM），该模型在GitHub热榜上迅速登顶，成为最新的开源世界模型焦点。LWM以其百万token的上下文处理能力，与谷歌的Gemini 1.5并驾齐驱，展示了其在自然语言处理领域的强大实力。

这款命名为LWM的模型，以其简洁的名字凸显其功能的直接与强大。它不仅能够处理庞大的文本数据，支持100万token的上下文窗口，而且在多模态信息处理上展现出色性能。据量子位报道，LWM具备在海量文本中精准定位目标信息的能力，更令人惊叹的是，它还能一次性处理长达1小时的视频内容，这在当前的AI模型中实属罕见。

UC伯克利的这一创新，无疑为人工智能领域的研究和应用开辟了新的可能。开源的特性使得全球的开发者和研究者都能接触到这一先进技术，有望推动自然语言理解和视频分析技术的进一步发展。LWM的发布，标志着我们在构建能够理解和生成复杂多模态信息的智能系统道路上又迈出了坚实一步。

英语如下：

**News Title:** “UC Berkeley Stuns with LargeWorldModel: A Million-Token Context, Video Generation, and the Dawn of Open-Source Era!”

**Keywords:** UC Berkeley, Large World Model, LWM

**News Content:**

**UC Berkeley Unveils “Large World Model”: A Pioneering Open-Source World Model Rivaling Google’s Gemini 1.5**

Today, UC Berkeley has dropped a bombshell in the tech sphere with the official launch of its “Large World Model” (LWM), swiftly topping the GitHub trends and emerging as the latest focal point in the open-source world model domain. With its capability to handle a million tokens of context, LWM stands shoulder to shoulder with Google’s Gemini 1.5, showcasing its formidable prowess in natural language processing.

Dubbed LWM, the model’s straightforward name underscores its direct and powerful function. Not only does it manage vast amounts of textual data, accommodating a 1 million token context window, but it also excels in multimodal information processing. According to QbitAI, LWM demonstrates the ability to accurately pinpoint information within an ocean of text and, impressively, can process up to an hour of video content in a single go, a rarity among current AI models.

This innovation from UC Berkeley opens up new possibilities for research and application in the field of artificial intelligence. The open-source nature ensures global developers and researchers have access to this cutting-edge technology, potentially propelling advancements in natural language understanding and video analysis. The release of LWM signals a significant stride in our journey towards building intelligent systems capable of understanding and generating complex multimodal information.

【来源】https://mp.weixin.qq.com/s/52uUGcgcoT6oGhZvi-Dl-w