{"id":36546,"date":"2025-05-31T11:34:46","date_gmt":"2025-05-31T03:34:46","guid":{"rendered":"https:\/\/www.1ai.net\/?p=36546"},"modified":"2025-05-31T11:34:46","modified_gmt":"2025-05-31T03:34:46","slug":"yandex-%e5%8f%91%e5%b8%83%e6%9c%80%e5%a4%a7%e9%9f%b3%e4%b9%90%e6%8e%a8%e8%8d%90%e5%bc%80%e6%ba%90%e6%95%b0%e6%8d%ae%e9%9b%86-yambda","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/36546.html","title":{"rendered":"Yandex releases Yambda, the largest open source dataset for music recommendations"},"content":{"rendered":"<p>Russian search engine giant <a href=\"https:\/\/www.1ai.net\/en\/tag\/yandex\" title=\"_Other Organiser\" target=\"_blank\" >Yandex<\/a> Released on May 30th, the world's<strong>The largest music recommendation system open source<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e6%95%b0%e6%8d%ae%e9%9b%86\" title=\"[See articles with [data set] labels]\" target=\"_blank\" >Dataset<\/a> <a href=\"https:\/\/www.1ai.net\/en\/tag\/yambda\" title=\"_Other Organiser\" target=\"_blank\" >Yambda<\/a>Contains<\/strong>\u00a04.79 billion pieces of anonymized user interaction data designed to help developers create smart music services.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-36547\" title=\"ffcd603fj00sx3wkh003bd000sg0068p\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2025\/05\/ffcd603fj00sx3wkh003bd000sg0068p.jpg\" alt=\"ffcd603fj00sx3wkh003bd000sg0068p\" width=\"1024\" height=\"224\" \/><\/p>\n<p>Yandex collected in ten months<strong>Nearly 28 million monthly Yandex Music subscribers<\/strong>The data, specifically the user<strong>4.79 billion interactions with 9.39 million songs<\/strong>, the dataset includes key feedback from listeners on the goodness or badness of the song, and all interactions are<strong>With timestamp<\/strong>to improve accuracy.<\/p>\n<p>Yambda at Hugging Face\u00a0<strong>Three sizes of data sets are available<\/strong>For free download: Yambda-5B (behavior from 1 million users), Yambda-500M (100,000 users) and Yambda-50M (10,000 users). Among the sizes<strong>largest<\/strong>\u00a0<strong>5B dataset requires a minimum of 85 GB of storage space<\/strong>.<\/p>\n<p>The dataset contains information about the preferences of music listeners, the<strong>Stored in Apache Parquet format<\/strong>The company's music recommendations can be used for research purposes or to develop AI music recommendation features similar to those offered by streaming services such as Spotify.<\/p>\n<p>Streaming services such as Spotify and Tidal don't typically release the code or models for their music recommendation algorithms because the ability to recommend a listener's favorite song is seen as a trade secret to their success, 1AI understands.<\/p>","protected":false},"excerpt":{"rendered":"<p>Russian search engine giant Yandex on May 30 released Yambda, the world's largest open source dataset for music recommendation systems, containing 4.79 billion anonymized user interactions designed to help developers create smart music services. Yandex collected data from nearly 28 million monthly Yandex Music users over a ten month period - 4.79 billion interactions with 9.39 million songs - and the dataset includes key feedback from listeners on what they like and dislike about songs, with all interactions timestamped to improve accuracy. Yambda offers three sizes of datasets for free download at Hugging Face: Yambda-5B (behavior from 1 million users)<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[6801,3022,3355],"collection":[],"class_list":["post-36546","post","type-post","status-publish","format-standard","hentry","category-news","tag-yambda","tag-yandex","tag-3355"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/36546","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=36546"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/36546\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=36546"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=36546"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=36546"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=36546"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}