Online open source Confucius 4-TTS, 3 seconds audio to clone sound

June 25th.NetEaseThere was a announcement yesterday to launch "The Son 4.0."TTS Voice synthesis engine Confucius 4-TTS. Officially, it's the first in the industry to support 14 languages with no accent and no reference to textOpen SourceModel.

Online open source Confucius 4-TTS, 3 seconds audio to clone sound

 

Confucius 4-TTS supports voice cloning of zero samples. 3 seconds of audio material provided by users, without reference to text and prior training, the model completes sound cloning; official nameClone SoundSIMILAR TO THE ORIGINAL ACOUSTIC IS MORE THAN 85% AND THE CLONING MISSION IS AS ACCURATE AS 97%。

THE MODEL SUPPORTS 14 LANGUAGES SUCH AS CHINESE, ENGLISH, SPANISH, FRENCH, GERMAN, KOREAN, THAI AND VIETNAMESE. OFFICIALLY, ITS FOCUS IS ON TRANSLINGUAL PRONUNCIATION: AFTER UPLOADING THE CHINESE AUDIO, AI CAN GENERATE FOREIGN LANGUAGES SUCH AS JAPANESE, ENGLISH, ETC。

GitHub: gethub.com/netease-youudao/Confucus4-TTS

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

OpenAI and Broadcom publish the initial Jalalapeño reason chip

2026-6-25 14:56:25

Information

Beanbag Visual Understanding Model Stuns: Ranked #2 in the World in First Review

2024-12-23 20:06:57

Search