Stability AI推出Stable Audio Open音乐生成模型：文生图延伸，音频领域的新里程碑

作者智能小编

6 月 7, 2024 #“生成高质量音频”, #“自动编码器模型”, #每日AI快讯

最新消息

重磅！Stability AI推出Stable Audio Open音乐生成模型，进军音频领域

近日，Stability AI公司宣布开源发布Stable Audio Open音乐生成模型，标志着该公司基于Stable Diffusion文生图模型的进一步拓展。Stable Audio Open能够基于用户输入的提示词，生成高质量音频样本，为音频创作领域带来革命性变革。

据悉，Stable Audio Open能够创建最长47秒的音乐，不仅适用于各种类型的音乐创作，还能轻松应对鼓点、乐器旋律、环境音和拟声音效等复杂需求。该模型基于transforms扩散模型（DiT），在自动编码器的潜在空间中操作，显著提高了生成音频的质量和多样性。

业内专家表示，Stable Audio Open的推出将极大地推动音频领域的发展，为音乐创作、音效设计等领域带来更为便捷高效的创作方式。同时，该模型的开源性质也将促进技术交流和进步，推动音频技术的普及和应用。

总的来说，Stability AI的这次创新尝试，有望在音频领域掀起一场技术革命，并为广大音乐爱好者和专业人士带来更多的创作灵感和可能性。

以上是Stability AI推出Stable Audio Open音乐生成模型的报道。

英语如下：

News Title: “Stability AI Launches Stable Audio Open Music Generation Model: Extending from Text-to-Image to a New Benchmark in Audio Domain”

Keywords: “Stable Audio Open”, “Generate High-Quality Audio”, “Autoencoder Model”

News Content:

Stability AI has made a groundbreaking move into the audio domain with the release of its Stable Audio Open music generation model.

Recently, Stability AI announced the open-source launch of the Stable Audio Open music generation model, marking a further expansion based on its popular Stable Diffusion text-to-image model. The Stable Audio Open is capable of generating high-quality audio samples based on user-entered prompts, bringing about a revolutionary change to the audio creation field.

It is reported that Stable Audio Open can create music up to 47 seconds long, suitable for various types of music creation, and can easily handle complex needs such as beats, instrument melodies, ambient sounds, and sound effects. The model, based on the DiT (Diffusion Model with Transforms), operates in the autoencoder’s latent space, significantly improving the quality and diversity of generated audio.

Industry experts indicate that the launch of Stable Audio Open will greatly promote the development of the audio field, bringing more convenient and efficient creation methods to music creation, sound effect design, and other fields. At the same time, the open-source nature of this model will promote technical exchanges and progress, as well as the popularization and application of audio technology.

In general, Stability AI’s innovative attempt is expected to set off a technological revolution in the audio field and bring more creative inspiration and possibilities to music lovers and professionals.

The above is the report on Stability AI’s launch of the Stable Audio Open music generation model.

【来源】https://www.ithome.com/0/773/537.htm