Mission release audio generation model LongCat-AudioDiT

April 3 News.Meituan (Japanese company)It was released yesterdayAudio Generation Model LongCat-AudioDiT and synchronise open source 1B and 3.5B versions。

It was described that LongCat-AudioDiT, which was directly modelled in wave-shaped subspace, only needed a wave-forming decoder (Wav-VAE) and a proliferation Transformer (DiT) to eliminate the accumulation of errors from the root causes of multistage cascades。

Training - Logic Alignment: Force the reset of the hidden variable of the hint area to the real value in each step of the reasoning to resolve the long-standing problem of sound drift。

SELF-ADAPTATION PROJECTORS (APG)REPLACES THE TRADITIONAL NON-CLASSIFIER GUIDE (CFG), DECOMPOSES THE GUIDANCE SIGNAL TO A POSITIVE AND PARALLEL MASS, PRESERVES THE USEFUL, INHIBITS THE POOR, AND AVOIDS THE "SATURATION" OF THE SPECTRUM WHILE INCREASING THE SOUND-COLOR SIMILARITIES。

In the Seed benchmark test, the LongCat-AudioDiT-3.5B speaker-similarity (SIM) reached 0.818 in the Chinese test set (Seed-ZH) and the Chinese hard-word set (Seed-Hard) reached 0.797, exceeding models such as Seed-TTS, CosyVoice 3.5 and MiniMax-Speech to achieve current SOTA performance。

GitHub: https://github.com/meituan-longcat/LongCat-AudioDiT

Hugging Face: https://huggingface.co/meituan-longcat/LongCat-AudioDiT

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

Mission release audio generation model LongCat-AudioDiT

MORGAN CHASE CEO DAMON: AI WILL BRING THREE AND A HALF DAYS OF WORK, AND HUMAN LIFE IS EXPECTED TO BE 100 YEARS OLD

Genre GLM-51 low-key on line, 2.6 minutes from Claude Opus 4.6

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

MORGAN CHASE CEO DAMON: AI WILL BRING THREE AND A HALF DAYS OF WORK, AND HUMAN LIFE IS EXPECTED TO BE 100 YEARS OLD

Genre GLM-51 low-key on line, 2.6 minutes from Claude Opus 4.6

The news said that Meituan "All in AI", Wang Xing, Wang Puzhong both valued

America releases open source LongCat-Video video generation model, which stabilizes 5 minute content

Launch and open-source LongCat-Flash-Omni model: support live video interaction to SOTA level

The group LongCat Big Model Official App release: Supporting online search and enabling voice calls

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow