Tencent open source hybrid voice digital human model: a picture a piece of audio can make the figure speak and sing

Tencent open source hybrid voice digital human model: a picture of a piece of audio can make the figure speak and sing

May 28 News.TencentThe hybrid public announced in a post todayOpen Sourceorigin of the universeVoice Digital Human Model, with just a picture and a piece of audio, you can make the main character in the picture speak and sing naturally.

Tencent open source hybrid voice digital human model: a picture of a piece of audio can make the figure speak and sing

The released and open source voice digital human model HunyuanVideo-Avatar, jointly developed by Tencent hybrid video model (HunyuanVideo) and Tencent Music Tianqin Labs MuseV technology, supportHead and shoulders, half and full body views,as well asMulti-style, multi-species & two-player scenarios, which is geared towards video creators to provide highly consistent and dynamic video generation capabilities.

Users can upload character images and audio, and the HunyuanVideo-Avatar model will automatically understand the images and audio, such asEmotions embedded in the character's environment, audioetc., allowing the characters in the picture to speak or sing naturally, generating videos that contain natural expressions, lip synchronization, and full-body movements.

HunyuanVideo-Avatar is suitable for short video creation, e-commerce and advertising and other application scenarios. It can generate clips of characters speaking, dialoguing and acting in different scenes, quickly create product introduction videos or multi-person interactive advertisements, and reduce production costs.

The single-subject capability of HunyuanVideo-Avatar has been open-sourced and launched on Tencent's official website. Users can experience it in "Model Square - Hunyuan Raw Video - Digital Human - Speech-driven - HunyuanVideo-Avatar", which supports uploading and downloading.Audio not exceeding 14 secondsVideo generation will be performed, and other capabilities will be gradually brought online and open-sourced in the future.

1AI attaches the relevant links below:

Experience the portal:https://hunyuan.tencent.com/ modelSquare / home / play?modelId=126
Project home page:https://hunyuanvideo-avatar.github.io
Github:https://github.com/Tencent-Hunyuan/HunyuanVideo-Avatar
CNB:https://cnb.cool/tencent/hunyuan/HunyuanVideo-Avatar
Technical report:https://arxiv.org/ pdf/2505.20156

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

Tencent open source hybrid voice digital human model: a picture of a piece of audio can make the figure speak and sing

Opera Launches Neon, a Proxy Browser: AI Writes Code and Creates Websites Directly for You

Japan Introduces First Artificial Intelligence Law: Promoting Technology R&D Applications and Preventing Abuse

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Opera Launches Neon, a Proxy Browser: AI Writes Code and Creates Websites Directly for You

Japan Introduces First Artificial Intelligence Law: Promoting Technology R&D Applications and Preventing Abuse

Tencent opens source lip-syncing tool AniPortrait to let photos sing and talk

Tencent's Hunyuan Wenshengtu model is open source: equipped with the first Chinese-English bilingual DiT architecture, free for commercial use

Tencent Hunyuan Wenshengtu Large Model Open Source Training Code Release LoRA and ControlNet Plugins

Tencent Launches Hunyuan-Large Large Model: 389B Total Parameters, Industry's Largest Transformer-Based MoE Model Open-Sourced

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow