上海宝山炮台湿地公园的蓝天白云上海宝山炮台湿地公园的蓝天白云

Beijing, October 25, 2024 -AI unicorn MiniMax is set to launch its first end-to-end real-time voice dialogue API product in November, aiming to directly compete with OpenAI’s GPT-4o. This new service will enhance real-time multi-modal processing capabilities, delivering lower latency, more natural, and immersive real-time voice conversations. It is designed to cater to a variety of scenarios, including enterprise collaboration, social media, live streaming, and gaming.

Sources close to MiniMax have revealed that the company is currently refining the product and is determined to achieve performance onpar with OpenAI’s GPT-4o upon its release.

OpenAI launched GPT-4o in May, a free-to-use flagship AI model capable of real-time audio, visual, and text reasoning. Itcan respond to audio input in as little as 232 milliseconds, averaging 320 milliseconds, matching human response times in conversation.

In terms of API usage, GPT-4o boasts a 50% price reduction and a 200% speed increase compared to its predecessor, GPT-4-turbo, released in November 2023. OpenAI CEO Sam Altman highlighted GPT-4o as their best model ever, praising its intelligence, speed, native multi-modality, and accessibility to all ChatGPT users, both free and paid GPT-4 subscribers.

This move by MiniMaxcomes as OpenAI has opened its Realtime API to public beta testing, with Agora, a real-time voice technology company and sister company of 声网, listed as a voice API collaborator. Recognizing this opportunity, MiniMax has partnered with Agora, with Agora Founder and CEO Zhao Bin stating at the RTE 2024 Tenth Real-Time Internet Conference that they are developing China’s first Realtime API. This API will power intelligent agents capable of engaging in effortless and smooth real-time voice communication with humans.

MiniMax is not alone in this pursuit. Several Chinese companies, including iFlytek, Zhipu AI,and SenseTime, are developing generative AI dialogue products that rival GPT-4o’s performance. OpenAI has also recently opened up ChatGPT-4o’s dialogue functionality.

According to iResearch Consulting, the market size of conversational AI in 2021 reached 4.5 billion yuan, driving alarger market of 12.6 billion yuan. MiniMax’s upcoming Realtime API, coupled with the growing demand for conversational AI, is poised to significantly impact the industry and shape the future of human-machine interaction.

References:

  • 钛媒体AGI独家|大模型独角兽MiniMax将于11月发布首款对标GPT-4o的端到端实时语音对话API产品
  • OpenAI’s GPT-4o Website
  • RTE 2024 Tenth Real-Time Internet Conference
  • iResearch Consulting Report on Conversational AI Market Size (2021)


>>> Read more <<<

Views: 0

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注