智源研究院近日发布了一款新一代的通用语义向量模型BGE-M3。这款模型支持超过100种语言,具备领先的多语言、跨语言检索能力。它可以全面且高质量地支撑“句子”、“段落”、“篇章”、“文档”等不同粒度的输入文本,最大输入长度可达8192。此外,BGE-M3还一站式集成了稠密检索、稀疏检索、多向量检索三种检索功能,在多个评测基准中达到最优水平。

智源研究院的这款新型向量模型BGE-M3,为我国在人工智能领域的研究和应用提供了强有力的支持。它的问世,将有助于提高我国在多语言处理、跨语言检索等方面的技术水平,为推动我国人工智能事业的发展具有重要意义。

英文翻译:

News title: Zhiyuan Research Institute Releases Next-Generation Multilingual Vector Model BGE-M3
Keywords: Zhiyuan Research Institute, General Vector Model, Multilingual Retrieval

News content:

The Zhiyuan Research Institute recently released a new generation of general semantic vector model BGE-M3. This model supports more than 100 languages and has leading multilingual and cross-lingual retrieval capabilities. It can fully and efficiently support different text input granularities such as “sentences”, “paragraphs”, “chapters”, and “documents” with a maximum input length of 8192. In addition, BGE-M3 integrates three types of retrieval functions such as dense retrieval, sparse retrieval, and multi-vector retrieval in one stop, achieving the optimal level in multiple evaluation benchmarks.

The launch of this new vector model BGE-M3 provides strong support for research and application in the field of artificial intelligence in our country. The advent of this model will help to improve the technical level of our country in multilingual processing, cross-lingual retrieval, and other aspects, and play a significant role in promoting the development of the artificial intelligence industry in our country.

【来源】https://mp.weixin.qq.com/s/y-c-EelxbSUMmrZNCeqeAA

Views: 1

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注