Baidu’s MuseStreamer AI Video Generation Model Takes on Google’s Veo 3 With Native Audio Support: Report

Baidu’s MuseStreamer AI Video Generation Model Takes on Google’s Veo 3 With Native Audio Support: Report

Baidu just dropped a bombshell: MuseStreamer, an AI video generator that’s not just creating videos, but weaving in authentic Chinese audio. Move over, Google’s Veo 3, there’s a new contender! Baidu is boasting that MuseStreamer is the world’s first AI of its kind, natively fluent in Chinese audio generation.

But wait, there’s more! Baidu also unveiled HuiXiang, a video content creation platform ready to redefine digital storytelling. The catch? For now, both MuseStreamer and HuiXiang are exclusively for the Chinese market. The world will be watching to see if this dynamic duo will eventually make their global debut.

Baidu’s MuseStreamer Can Reportedly Generate Chinese Audio

Two years ago, AI-generated video felt like a clumsy dream. Extra fingers sprouted from digital hands, and physics took a holiday. Now? LLMs paint reality with stunning accuracy, breathing life into motion itself. Yet, a silent frontier remains unexplored: AI video with native sound. Most have been conspicuously quiet about it.

Google I/O 2025 wasn’t just another keynote; it was a seismic shift. With the unveiling of Veo 3, Google didn’t just raise the bar – it launched it into orbit. Forget whispers, the internet exploded. Veo 3 instantly became the “it” tool, eclipsing even OpenAI’s Sora. And Google isn’t holding back. Veo 3 is now available in all 154 countries where Gemini thrives, signaling a global blitzkrieg for AI dominance.

But hold on, the plot thickens! Tech in Asia, via AI Base, whispers that Chinese heavyweight Baidu has thrown its hat in the ring with MuseStream. This AI maestro doesn’t just generate videos; it composes them with authentic Chinese audio – a feat unmatched by even OpenAI’s Veo 3, which currently speaks only in English. It’s a bilingual battle brewing in the video AI arena!

Baidu’s MuseStreamer isn’t just adding subtitles; it’s orchestrating entire audio landscapes for your videos. Imagine AI-generated dialogues flawlessly synced, punctuated by perfectly placed sound effects and immersive ambient noise. Baidu boasts MuseStreamer aced the VBench I2V benchmark with an 89.38% score, claiming the top spot. The company envisions this LLM as a revolutionary tool, empowering everyday users to become content creation maestros.

Baidu’s unleashing its AI video engine, complete with a playground called HuiXiang. Think of it as a front-end for Baidu’s AI brains, letting users conjure videos from simple prompts. Forget fleeting glimpses – HuiXiang cranks out crisp 1080p videos, a solid 10 seconds each. Feeling limited by Veo 3’s measly eight seconds? Baidu’s already upped the game. The only question mark? The video’s default shape and whether you can tailor it to your vision.

Thanks for reading Baidu’s MuseStreamer AI Video Generation Model Takes on Google’s Veo 3 With Native Audio Support: Report

MightNews
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.