Facewall smart release 0.5B argument voice model, sound echoes as human

FACEWALL SMART RELEASE 0.5B ARGUMENT VOICE MODEL, SOUND ECHOES HUMAN

Message September 19, yesterday afternoonWall-facing intelligenceDeclares the “Small Steel Gun” series to be refreshed: the VoxCPM voice generation base model of 0.5B parameter sizes was introduced. The VoxCPM base model for voice generation was officially launched at the Human Voice Interactive Laboratory of the International Graduate School of Shenzhen, Singhua University. The model parameter size is 0.5B, with an industry SOTA level in terms of voice naturality, sound similarity and rhythm performance。

FACEWALL SMART RELEASE 0.5B ARGUMENT VOICE MODEL, SOUND ECHOES HUMAN

Performance performance: RTF ≈ 0.17, supporting current output VoxCPM performed excellently in the Seed-TTS-EVAL test, with extremely low word error rate, and similarity in the sound cloning mission up to real level. The NVIDIA RTX 4090 graphic card speeds the reasoning of RTF ≈ 0.17 to meet high-quality real-time interactive needs。

Hearing experience: Emotions, accents, rhythmic models automatically select the right sound styles for text content, generating a variety of audio scenes such as weather, pre-war speeches, dialect anchors, etc. It supports a bilingual Chinese-British reset, with very few samples to "repeat" or even to read mathematical formulas and symbols。

Technology architecture: Integration of language modelling and diffusion into VoxCPM. The core modules include LocEnc, TSLM, RALM and LocDiT, which achieve the efficient generation and restructuring of voice continuum features through VAE decodors。

VoxCPM is now open on various platforms such as GitHub, Hugging Face, and developers are free to download and experience, and on-line PlayGround is fast-tracked and audio sample pages are synchronized。

Model links:

Github: https://github.com/OpenBMB/VoxCPM

Hugging Face: https://huggingface.co/openbmb/VoxCPM-0.5B

ModelScope: https://modelsscope.cn/models/OpenBMB/VoxCPM-0.5B

PlayGrowd Experience: https://huggingface.co/spaces/OpenBMB/VoxCPM-Demo

Audio sample page: https://openbmb.github.io/VoxCPM-demopage

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

FACEWALL SMART RELEASE 0.5B ARGUMENT VOICE MODEL, SOUND ECHOES HUMAN

5 BILLION DOLLARS TO BUY SHARES OF 4%, NVIDIA AND INTEL WILL JOINTLY DEVELOP AI INFRASTRUCTURE AND PERSONAL COMPUTING PRODUCTS

Upload a map, lead any video, "Model for the most powerful action" Alithongyan Wan2.2-Animate Open Source

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

5 BILLION DOLLARS TO BUY SHARES OF 4%, NVIDIA AND INTEL WILL JOINTLY DEVELOP AI INFRASTRUCTURE AND PERSONAL COMPUTING PRODUCTS

Upload a map, lead any video, "Model for the most powerful action" Alithongyan Wan2.2-Animate Open Source

Mianbi Intelligent Releases Eurux-8x22B Open Source Large Model: Code Performance Exceeds Llama3-70B

Stanford team apologizes for plagiarizing Tsinghua's AI model: Llama3-V model will be removed

Tsinghua-affiliated company FaceBrain Intelligence collaborates with Huawei Cloud to promote large-scale model-based end-cloud collaborative solutions

Alibaba releases new voice model Qwen2-Audio, surpassing OpenAI Whisper

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow