出门问问日前宣布,将向公众开放其超大规模语言模型“序列猴子”的部分训练数据集,命名为“序列猴子开源数据集1.0”。本次开源的“序列猴子数据集1.0”包含了中文通用文本语料、古诗今译语料以及文本生成语料。这一举措标志着出门问问在推动人工智能技术发展上的又一重要进展,同时也为全球的研究者和开发者提供了宝贵的资源。
英文标题:Mobvoi Opens Up First Open-Source Dataset for “Sequence Monkeys”
英文关键词:Language Model, Dataset, Open Source
英文新闻内容:
Mobvoi, a leading artificial intelligence company, has announced the release of the first open-source dataset for “Sequence Monkeys,” its large-scale language model. The dataset, named “Sequence Monkeys Open Source Dataset 1.0,” includes a variety of Chinese language corpora, such as general text, classical poetry translations, and text generation data. This move represents another significant step forward in Mobvoi’s efforts to advance the development of AI technology and provides a valuable resource for researchers and developers worldwide.
【来源】https://mp.weixin.qq.com/s/oSQR3gCCDpJ3Wdu-9iTcbA
Views: 1