The new multi-mode referral system paradigm DiffMM, allowing proliferation models to recommend short videos!

The novel multimodal recommendation system paradigm DiffMM allows the diffusion model to recommend short videos!

Researchers from HKU and Tencent have proposed a new paradigm for multimodal recommender systems -- theDiffMMThe aim is to increaseShort VideoRecommendation accuracy. The system achieves more accurate recommendations by creating a graph containing information about users and videos and utilizing graph diffusion and contrast learning techniques to better understand the relationship between users and videos.

DiffMM ' s model methodology consists of three main components: multi-mode map diffusion model, multi-mode map aggregation and cross-mode comparison enhancement. Among them, the multi-modular diffusion model uses a model to detect the probability of noise diffusion, aligning the user-matter synergetic signal with the multi-modular information and effectively addressing the negative effects of the multi-modular referral system. At the same time, the production and optimization of model sensory images has been achieved through the optimization of the diffusion of the probabilistic proliferation paradigm and model perception。

The novel multimodal recommendation system paradigm DiffMM allows the diffusion model to recommend short videos!

In terms of cross-modal contrast enhancement, DiffMM utilizes modality-aware contrast view and contrast enhancement methods to capture the consistency of user interaction patterns on different item modalities and improve recommender system performance.

Paper:https://arxiv.org/abs/2406.1178

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

The novel multimodal recommendation system paradigm DiffMM allows the diffusion model to recommend short videos!

Baidu Smart Cloud (Wuzhen) AI Data Industry Base is launched, which will realize the full implementation of local AI native applications

Alibaba Tongyi's audio generation model FunAudioLLM is open source and supports scenarios such as emotional voice dialogue and audiobooks

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Baidu Smart Cloud (Wuzhen) AI Data Industry Base is launched, which will realize the full implementation of local AI native applications

Alibaba Tongyi's audio generation model FunAudioLLM is open source and supports scenarios such as emotional voice dialogue and audiobooks

AI face-changing Ukrainian beauty makes money in China: software costs only $72 per month

Developers share short videos generated by OpenAI Sora: leaf elephants, rainbow waterfalls, etc.

Sora hasn’t made any money yet, so “AI resurrection” is here to reap the profits

Beijing's first "AI face-changing" software infringement case was sentenced: the Chinese style blogger's short video was "face-changed" and made into a paid template

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow