法国Kyutai实验室发布实时语音多模态模型Moshi,对标GPT-4o
近日,法国知名开源人工智能研究实验室Kyutai在官网宣布推出了一款全新的实时语音多模态模型Moshi。据悉,该模型具备看、听、说多模态功能,与OpenAI公司在近期展示的GPT-4o模型功能相似。
据了解,Moshi模型能够进行实时语音交互,听取人类提问并进行推理回答。值得一提的是,虽然GPT-4o的语音模式尚未全面开放使用,需要等到秋季,但Kyutai实验室已经向公众提供了Moshi模型的使用权限。这意味着研究人员和开发人员可以立即开始探索这一新兴技术。
Kyutai实验室表示,Moshi模型的发布标志着人工智能领域又向前迈进了一步。该模型的应用前景广泛,包括但不限于智能助手、语音识别和自然语言处理等领域。此外,作为一款开源模型,Moshi的发布也将促进人工智能技术的共享和创新。
目前,人工智能领域的竞争日益激烈,Kyutai实验室的Moshi模型的发布无疑将对行业产生深远影响。我们期待着这一技术的进一步发展和应用。
华尔街见闻消息报道。
英语如下:
News Title: “French Kyutai Lab Unveils Real-time Voice Multimodal Model Moshi, Challenging GPT-4o”
Keywords: 1. Kyutai Lab
News Content: French Kyutai Lab Launches Real-time Voice Multimodal Model Moshi, Pitting It Against GPT-4o
Recently, Kyutai Lab, a renowned open-source artificial intelligence research lab in France, announced on its official website the release of a new real-time voice multimodal model called Moshi. It is reported that this model possesses multi-modal functions of seeing, hearing, and speaking, similar to the GPT-4o model recently showcased by OpenAI.
It is understood that the Moshi model is capable of real-time voice interaction, listening to human questions and answering them through reasoning. Notably, while the voice mode of GPT-4o has not yet been fully opened for use and is expected to be available in the autumn, Kyutai Lab has already provided public access to the Moshi model. This means that researchers and developers can immediately start exploring this emerging technology.
Kyutai Lab indicates that the release of the Moshi model marks another step forward in the field of artificial intelligence. This model has broad application prospects, including but not limited to intelligent assistants, speech recognition, and natural language processing. Furthermore, as an open-source model, the release of Moshi will promote the sharing and innovation of artificial intelligence technology.
Currently, competition in the artificial intelligence field is becoming increasingly fierce, and the release of Kyutai Lab’s Moshi model will undoubtedly have a far-reaching impact on the industry. We look forward to further development and application of this technology.
Reported by Wall Street Journal.
【来源】https://ai-bot.cn/go/?url=aHR0cHM6Ly93YWxsc3RyZWV0Y24uY29tL2FydGljbGVzLzM3MTg3ODY%3D
Views: 1