北京智源研究院近日宣布推出其最新的通用语义向量模型——BGE-M3,该模型支持超过100种语言,具备强大的多语言和跨语言检索能力。BGE-M3能够处理从句子到篇章不同粒度的输入文本,最大输入长度为8192,并且集成了稠密检索、稀疏检索和多向量检索三种检索功能,在多个评测基准中表现优异。

智源研究院表示,BGE-M3的发布标志着在自然语言处理领域的一项重要进展,尤其是在多语言语义理解方面。该模型不仅能够处理多种语言,还能够跨语言进行检索,这意味着用户可以使用一种语言输入,而模型能够理解和检索其他语言的内容。

BGE-M3的推出,为全球用户提供了一个强大的工具,用于处理和检索多语言文本数据。这一进步有望在搜索、翻译、内容管理和国际交流等多个领域带来深远的影响。

英文标题:Zhiyuan Introduces BGE-M3: A New Breakthrough in Universal Semantic Vector Model
英文关键词:Zhiyuan BGE-M3, Universal Model, Semantic Retrieval
英文新闻内容:
The Beijing Academy of Artificial Intelligence (BAAI) recently announced the launch of its latest universal semantic vector model, BGE-M3. The model supports over 100 languages and boasts strong multilingual and cross-language retrieval capabilities. BGE-M3 can handle input text at various granularities, from sentences to documents, with a maximum input length of 8192, and integrates three retrieval functions: dense retrieval, sparse retrieval, and multi-vector retrieval. It has achieved superior performance on multiple evaluation benchmarks.

According to BAAI, the release of BGE-M3 represents a significant advancement in the field of natural language processing, particularly in multilingual semantic understanding. The model is not only capable of processing multiple languages but also enables cross-language retrieval, meaning users can input in one language and the model can understand and retrieve content in other languages.

The introduction of BGE-M3 provides a powerful tool for global users to handle and retrieve multilingual textual data

【来源】https://mp.weixin.qq.com/s/y-c-EelxbSUMmrZNCeqeAA

Views: 1

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注