{"id":43084,"date":"2025-09-15T11:24:53","date_gmt":"2025-09-15T03:24:53","guid":{"rendered":"https:\/\/www.1ai.net\/?p=43084"},"modified":"2025-09-15T11:24:53","modified_gmt":"2025-09-15T03:24:53","slug":"stable-audio-2-5-%e4%bc%81%e4%b8%9a%e7%ba%a7%e9%9f%b3%e9%a2%91%e7%94%9f%e6%88%90-ai-%e6%a8%a1%e5%9e%8b%e5%8f%91%e5%b8%83%ef%bc%8c%e5%8f%b7%e7%a7%b03-%e5%88%86%e9%92%9f%e6%9b%b2%e7%9b%ae-2","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/43084.html","title":{"rendered":"Stable Audio 2.5 Enterprise Audio Generation AI Model Release, called \"3 Minute Track 2 seconds completed\""},"content":{"rendered":"<p>September 14th.<a href=\"https:\/\/www.1ai.net\/en\/tag\/stability-ai\" title=\"_Other Organiser\" target=\"_blank\" >Stability AI<\/a> The enterprise level has been officially released<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e9%9f%b3%e9%a2%91%e7%94%9f%e6%88%90%e6%a8%a1%e5%9e%8b\" title=\"[Sees articles with tags]\" target=\"_blank\" >Audio Generation Model<\/a> Stable Audio 2.5, increased relative to the previous generation, mainly in terms of audio detail, production speed, with the name \"Only 2 seconds to create 3 minutes<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e9%9f%b3%e9%a2%91%e6%9b%b2%e7%9b%ae\" title=\"[See articles with tags]\" target=\"_blank\" >Audio Track<\/a>\u201d.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-43085\" title=\"527c1eej00t2m1ga00mvd000v90hkp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2025\/09\/527c1eeej00t2m1ga00mvd000v900hkp.jpg\" alt=\"527c1eej00t2m1ga00mvd000v90hkp\" width=\"1125\" height=\"632\" \/><\/p>\n<p>According to a presentation, the core improvements of Stable Audio 2.5 focus on the ability to produce music, which is said to produce results that are more in tune with the actual chorus logic<strong>A full multi-band structure with foreplay, development and end<\/strong>I don't know. At the same time, the new model is more accurate in its understanding of the hints, particularly in terms of emotional description and soundness of the music-style vocabulary, and more responsive to expectations\u3002<\/p>\n<p>In addition, the new version of the model has significantly improved the speed of audio generation, which, according to Stability AI, is largely the result of a post-training approach proposed by the R &amp; D team ARC (Note: Adversarial Relativistic-Contrastive), a technique that accelerates the production of the proliferation model by combining a relativist training and contrastor<strong>A SIGNIFICANT REDUCTION IN THE GPU REASONING TIME-CONSUMING WHILE ENSURING THE QUALITY OF THE TRACK CAN RESULT IN THE GENERATION OF AUDIO CONTENT OF UP TO 3 MINUTES IN 2 SECONDS<\/strong>.<\/p>\n<p>In addition to this, Stable Audio 2.5 has added an audio patch that allows users to import their own audio files and assigns a \"extend position\"<strong>The model allows a \"extend\" to the audio, depending on the content of the sound and the overall curve, which is particularly suitable for a scenario such as a clip<\/strong>.<\/p>\n<p>At present, Stable Audio 2.5 has been tested directly through the StableAudio network, while supporting localization. Officially, however, the audio files uploaded by users should not contain copyrighted content, and the StableAudio website will be tested using its own content identification system to ensure that copyrights are not violated\u3002<\/p>","protected":false},"excerpt":{"rendered":"<p>On the 14th of September, Stability AI has officially released the Enterprise Audio Generation Model Stable Audio 2.5, which, relative to the previous generation, has been upgraded to focus on audio detail and production speed, stating that \u201cit takes only two seconds to create three minutes of audio. It was described that the core improvements of Stable Audio 2.5 focused on the ability to produce music, which was described as producing results that were more accompanied by the actual chorus logic and presented a full multi-part structure of foreplay, development and end. At the same time, the new model is more accurate in its understanding of the hints, particularly in terms of emotional description and soundness of the music-style vocabulary, and more responsive to expectations. In addition, the new version of the model has significantly improved the speed of audio generation, Stability AI<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[718,7593,3681],"collection":[],"class_list":["post-43084","post","type-post","status-publish","format-standard","hentry","category-news","tag-stability-ai","tag-7593","tag-3681"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/43084","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=43084"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/43084\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=43084"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=43084"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=43084"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=43084"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}