Linly Talker: a digital human dialog system, an open source digital human framework from the Linly open source community

Linly Talker: a digital human dialog system, an open source digital human framework from the Linly open source community

Linly TalkerIt is an innovativeDigital HumanA dialog system that combines Large Language Models (LLMs) with visual models to create a novel approach to human-computer interaction. The system integrates various technologies such as Whisper, Linly, Microsoft Speech Services, and SadTalker generation system, aiming to provide a realistic digital human dialog experience.Linly-Talker supports users to upload images for dialog and enhances interactivity and realism through a multi-round dialog system. The project was developed by Kedreamix and is open-sourced on GitHub for developers and researchers to use and improve.

Linly Talker Features

  1. Multi-model Integration: Linly-Talker integrates big models such as Linly, GeminiPro, Qwen, and visual models such as Whisper, SadTalker, etc., which enables high quality dialog and visual generation.
  2. Multi-Round Dialogue Capability: With the multi-round dialog system modeled by GPT, Linly-Talker is able to understand and maintain contextually relevant and coherent dialogs, which greatly enhances the realism of the interaction.
  3. Voice Cloning: Using technologies such as GPT-SoVITS, users can upload a one-minute voice sample for fine-tuning, and the system will clone the user's voice, enabling the digital person to converse in the user's voice.
  4. Real-time interaction: The system supports real-time speech recognition and video captioning, enabling users to communicate naturally with digital people via voice.
  5. Visual Enhancement: Through technologies such as Digital Human Generation, Linly-Talker is able to generate realistic digital human images to provide a more immersive experience.

Official website link:https://github.com/Kedreamix/Linly-Talker

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
productothervideo

EchoMimic: a photo generates a talking video, an open source digital person project launched by Alibaba

2025-5-11 9:33:08

productothervideo

LiveTalking: an open source digital human production platform comparable to commercial software for real-time interactive streaming digital human projects

2025-5-12 9:02:45

Search