{"id":8058,"date":"2024-04-16T09:43:32","date_gmt":"2024-04-16T01:43:32","guid":{"rendered":"https:\/\/www.1ai.net\/?p=8058"},"modified":"2024-04-16T09:43:32","modified_gmt":"2024-04-16T01:43:32","slug":"sora%e5%b9%b3%e6%9b%bf%ef%bc%9f2%e5%88%86%e9%92%9f%e8%b6%85%e9%95%bfai%e8%a7%86%e9%a2%91%e6%a8%a1%e5%9e%8bstreamingt2v%e5%85%8d%e8%b4%b9%e5%bc%80%e6%ba%90-%e8%af%95%e7%8e%a9%e5%9c%b0%e5%9d%80%e5%85%ac","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/8058.html","title":{"rendered":"Sora replacement? 2-minute long AI video model StreamingT2V free open source trial address announced"},"content":{"rendered":"<p>Recently, Picsart AI Research and other teams jointly released a product called<a href=\"https:\/\/www.1ai.net\/en\/tag\/streamingt2v\" title=\"[See article with [StreamingT2V] label]\" target=\"_blank\" >StreamingT2V<\/a>of<a href=\"https:\/\/www.1ai.net\/en\/tag\/ai%e8%a7%86%e9%a2%91%e6%a8%a1%e5%9e%8b\" title=\"[SEES ARTICLES WITH [AI VIDEO MODEL] LABELS]\" target=\"_blank\" >AI Video Model<\/a>, the model is able to generate videos up to 1200 frames and 2 minutes long, which technically surpasses the previously highly regarded Sora model. The release of StreamingT2V not only achieved a breakthrough in video length, but it is also a<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%85%8d%e8%b4%b9%e5%bc%80%e6%ba%90\" title=\"[See articles with [free open source] labels]\" target=\"_blank\" >Free and open source<\/a>The project can be seamlessly compatible with models such as SVD and animatediff, which is of great significance to the development of the open source ecosystem.<\/p>\n<p class=\"article-content__img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-8059\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/04\/6384877391847396472713764.png\" alt=\"\" width=\"323\" height=\"160\" \/><\/p>\n<p>Before Sora, the video generation models on the market, such as Pika, Runway, Stable Video Diffusion (SVD), etc., usually could only generate videos of a few seconds to more than ten seconds. The emergence of Sora has become a new benchmark in the industry with its 60-second video generation capability. Now, the launch of StreamingT2V has not only made a breakthrough in duration, but can also theoretically be infinitely long, which brings more possibilities to the field of video generation.<\/p>\n<p>StreamingT2V&#039;s architecture uses advanced autoregressive techniques to create long videos with rich motion dynamics while maintaining temporal consistency and high frame-level image quality. Compared with existing text-to-video diffusion models, these models usually focus on high-quality short video generation, but often suffer from quality degradation, stiff performance, or stagnation when extended to long videos. StreamingT2V effectively solves these problems by introducing conditional attention modules (CAM) and appearance preservation modules (APM), as well as a random mixing method.<\/p>\n<p>CAM, as a short-term memory block, adjusts the current generation of video through the attention mechanism to achieve consistent block transition; while APM, as a long-term memory block,<span class=\"spamTxt\">First<\/span>Extract from video chunks<span class=\"spamTxt\">advanced<\/span>Scene and object features to prevent the model from forgetting the initial scene. In addition, StreamingT2V also uses a high-resolution text-to-video model to perform auto-regressive enhancement on the generated videos to improve quality and resolution.<\/p>\n<p>Currently, StreamingT2V has been open sourced on GitHub and is available for free trial on huggingface. Although the server load may be high, users can try to generate videos by inputting text and image prompts. In addition, huggingface also shows some successful cases, which proves the powerful ability of StreamingT2V in video generation.<\/p>\n<p>The release of StreamingT2V not only brings new technological breakthroughs to the field of video generation, but also provides a powerful tool for the open source community, which helps promote the development and application of related technologies. In the future, we may expect more innovative applications based on such technologies, such as playing an important role in film production, game development, virtual world construction and other fields.<\/p>\n<p>Paper address: https:\/\/arxiv.org\/pdf\/2403.14773.pdf<\/p>\n<p>Trial address 1: https:\/\/huggingface.co\/spaces\/PAIR\/StreamingT2V<\/p>\n<p>Trial address 2: https:\/\/replicate.com\/camenduru\/streaming-t2v<\/p>","protected":false},"excerpt":{"rendered":"<p>Recently, Picsart AI Research and other teams jointly released an AI video model called StreamingT2V, which is capable of generating videos up to 1,200 frames and 2 minutes in length, which technically surpasses the previous much-anticipated Sora model.The release of StreamingT2V is not only a breakthrough in video length, but also It is a free and open source project that is seamlessly compatible with models such as SVD and animatediff, which is significant for the development of the open source ecosystem. Before Sora, video generation models on the market such as Pika, Runway, Stable Video Diffusion (SVD), etc., could usually only generate a few seconds to a dozen seconds<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[868,2242,2243],"collection":[],"class_list":["post-8058","post","type-post","status-publish","format-standard","hentry","category-news","tag-ai","tag-streamingt2v","tag-2243"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/8058","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=8058"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/8058\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=8058"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=8058"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=8058"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=8058"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}