Customize Consent Preferences

We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.

The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ... 

Always Active

Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.

No cookies to display.

Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.

No cookies to display.

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.

No cookies to display.

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.

No cookies to display.

Advertisement cookies are used to provide visitors with customized advertisements based on the pages you visited previously and to analyze the effectiveness of the ad campaigns.

No cookies to display.

shanghaishanghai
0

2023年3月8日,元象XVERSE宣布开源全球首个上下文窗口长度256K的开源大模型XVERSE-Long-256K。该模型支持输入25万汉字,让大模型应用进入“长文本时代”。XVERSE-Long-256K全开源,无条件免费商用,且附带手把手训练教程,让海量中小企业、研究者和开发者更早一步实现“大模型自由”。

XVERSE-Long-256K是元象XVERSE继XVERSE-Long-16K、XVERSE-Long-64K后发布的又一重量级开源大模型。该模型在XVERSE-Long-64K的基础上,将上下文窗口长度从64K扩展到256K,使模型能够处理更长的文本序列,并生成更具连贯性和一致性的文本。

XVERSE-Long-256K具有以下特点:

* **超长上下文窗口长度:**256K的上下文窗口长度使模型能够处理更长的文本序列,并生成更具连贯性和一致性的文本。
* **全开源、无条件免费商用:**XVERSE-Long-256K完全开源,无任何使用限制,开发者可以自由地将其用于任何商业或非商业目的。
* **附带手把手训练教程:**元象XVERSE提供了详细的手把手训练教程,帮助开发者快速上手XVERSE-Long-256K,并将其应用于自己的项目中。

XVERSE-Long-256K的开源发布,标志着大模型应用进入了一个新的时代。该模型将使海量中小企业、研究者和开发者能够更早一步实现“大模型自由”,并将其应用于各种各样的领域,如自然语言处理、机器翻译、文本生成、代码生成等。

元象XVERSE创始人兼CEO黄渊普表示:“我们很高兴能够开源XVERSE-Long-256K,并将其作为全球首个上下文窗口长度256K的开源大模型提供给开发者社区。我们相信,XVERSE-Long-256K将成为大模型应用领域的一座里程碑,并帮助开发者们创造出更多令人惊叹的应用。”

XVERSE-Long-256K的开源发布,受到了业界的一致好评。中国科学院院士、清华大学教授刘知远表示:“XVERSE-Long-256K的开源发布,是自然语言处理领域的一件大事。该模型将使更多的人能够接触到大模型,并将其应用于各种各样的领域。我期待着看到XVERSE-Long-256K在未来发挥出更大的作用。”

北京大学教授、中国人工智能学会理事长李德毅表示:“XVERSE-Long-256K的开源发布,是人工智能领域的一件大事。该模型将使人工智能技术更加平民化,并让更多的人能够受益于人工智能技术。我期待着看到XVERSE-Long-256K在未来为人工智能领域的发展做出更大的贡献。”

英语如下:

## Yuanxiang Open-Sources Large Model, Ushering in the Era of LongText

2023-03-08, Yuanxiang XVERSE announced the open-sourcing of the world’s first open-sourcelarge model with a context window length of 256K, XVERSE-Long-256K. This model supports input of 250,000 Chinese characters, bringing large model applications into the “long text era”. XVERSE-Long-256K is fully open-source, free for commercial use without any conditions, and comes with a step-by-step training tutorial, allowing a vast number of small and medium-sized enterprises, researchers, and developers to achieve “large model freedom” sooner.

XVERSE-Long-256K is another heavyweight open-source large model released by Yuanxiang XVERSE, following XVERSE-Long-16K and XVERSE-Long-64K. Based on XVERSE-Long-64K, this model extends the context window length from 64K to 256K, enabling the model to process longer textsequences and generate more coherent and consistent text.

XVERSE-Long-256K has the following characteristics:

* **Ultra-long context window length:** The context window length of 256K allows the model to process longer text sequences and generate more coherent and consistent text.
* **Fully open-source, free for commercial use without any conditions:** XVERSE-Long-256K is completely open-source, without any usage restrictions, and developers are free to use it for any commercial or non-commercial purposes.
* **Comes with a step-by-step training tutorial:** Yuanxiang XVERSE provides a detailed step-by-step training tutorial to help developers quickly get started with XVERSE-Long-256K and apply it to their own projects.

The open-source release of XVERSE-Long-256K marks the beginning of a new era for large model applications. This model will enable a vast number of small and medium-sized enterprises, researchers, and developers to achieve “large model freedom” sooner and apply it to a wide range of fields, such as natural language processing, machine translation, text generation, code generation, etc.

Yuanxiang XVERSE founder and CEO Huang Yuanpu said: “We are very pleased to open-source XVERSE-Long-256K and provide it to the developer community as the world’s first open-source large model with a context window length of 256K. We believe that XVERSE-Long-256K will become a milestone in the field of large model applications and help developers create more amazing applications.”

The open-source release of XVERSE-Long-256K has been widely praised by the industry. Liu Zhiyuan, an academician of the Chinese Academy of Sciences and a professor at Tsinghua University, said: “The open-source release of XVERSE-Long-256K is a major event in the field of natural language processing. This model will allow more people to access large models and apply them to a wide range of fields. I look forward to seeing XVERSE-Long-256K play a greater role in the future.”

Li Deyi, a professor at Peking University and chairman of the Chinese Association for Artificial Intelligence, said: “The open-source release of XVERSE-Long-256K is a major event in the field of artificial intelligence. This model will make artificial intelligence technology more平民化 and allow more people to benefit from artificial intelligence technology. I look forward to seeing XVERSE-Long-256K make greater contributions to the development of the field of artificial intelligence in the future.”

【来源】https://mp.weixin.qq.com/s/R8ewi1NsAK9Qwh0e7UyyBw

Views: 1

0

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注