{"id":5811,"date":"2024-03-20T09:26:42","date_gmt":"2024-03-20T01:26:42","guid":{"rendered":"https:\/\/www.1ai.net\/?p=5811"},"modified":"2024-03-20T09:26:42","modified_gmt":"2024-03-20T01:26:42","slug":"%e9%98%bf%e9%87%8c%e5%a4%a7%e6%a8%a1%e5%9e%8b%e4%ba%a7%e5%93%81%e9%80%9a%e4%b9%89%e5%90%ac%e6%82%9f%e5%8d%87%e7%ba%a7%ef%bc%9a%e8%b6%85%e9%95%bf%e8%a7%86%e9%a2%91%e8%87%aa%e7%94%b1","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/5811.html","title":{"rendered":"Alibaba&#039;s big model product &quot;Tongyi Tingwu&quot; has been upgraded: ultra-long videos with free questions and mind mapping"},"content":{"rendered":"<p data-vmark=\"7a3c\">March 19,<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e9%98%bf%e9%87%8c\" title=\"[View articles tagged with [Ali]]\" target=\"_blank\" >Ali<\/a>Large Model Products<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e9%80%9a%e4%b9%89%e5%90%ac%e6%82%9f\" title=\"[Sees articles with [broadcast] labels]\" target=\"_blank\" >Listening to the general meaning<\/a>&quot;Released a number of new features, including the online audio and video question-and-answer assistant &quot;Xiaowu&quot;, one-click AI rewriting, mind map generation and other six major functions.<\/p>\n<p data-vmark=\"272e\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-5813\" title=\"08845253-50f5-4112-88c8-0b407b7b030e\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/03\/08845253-50f5-4112-88c8-0b407b7b030e.png\" alt=\"08845253-50f5-4112-88c8-0b407b7b030e\" width=\"1440\" height=\"811\" \/><\/p>\n<p data-vmark=\"b247\">Listen to the common meaning<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%85%a5%e9%80%9a%e4%b9%89%e5%8d%83%e9%97%ae%e5%a4%a7%e6%a8%a1%e5%9e%8b\" title=\"[Sees articles with labels]\" target=\"_blank\" >Iruto Yoshikichi large model<\/a>, integrating more than ten AI functions, including transcription, translation, role separation, full-text summary, chapter overview, speech summary, PPT extraction, etc., and supports marking key points and taking notes.<\/p>\n<p data-vmark=\"ac86\">Tongyi Tingwu has added six new features in this upgrade, the most important of which is the audio and video question-and-answer assistant &quot;Xiaowu&quot;, which allows key information to be &quot;asked&quot; directly. Xiaowu uses multi-language query processing, long-length text understanding, instruction evolution framework optimization, and retrieval enhancement generation algorithms to achieve single-record, cross-record, and multi-language free question-and-answer for ultra-long audio and video for the first time in the industry. The length and number of audio and video files that support content question-and-answer have exceeded the industry&#039;s upper limit.<\/p>\n<p data-vmark=\"0da0\">Users can not only call Xiaowu on a single record page, ask any questions about audio and video up to 6 hours and 6G in size, or directly ask Xiaowu to sort out golden sentences, sort out conclusions, and write meeting minutes; they can also ask questions about all user records on the homepage, supporting one-time scanning and understanding of hundreds of audio and video content; they can also ask questions in Chinese about English videos, and Xiaowu will directly give Chinese answers, eliminating the need for translation. As an AI that &quot;understands you&quot;, Xiaowu can also intelligently recommend questions.<\/p>\n<p data-vmark=\"5a8e\">In response to user needs, Tongyi Tingwu has also launched new capabilities such as one-click AI rewriting and mind map generation. For example, one-click AI rewriting converts spoken language into written expression, which is especially suitable for organizing interviews; mind maps are automatically generated, supporting up to five levels of xmind brain maps, which is suitable for podcast summaries.<\/p>\n<p data-vmark=\"6f23\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-5812\" title=\"99ac1925-a46a-4ae6-a6f8-614d1ba09895\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/03\/99ac1925-a46a-4ae6-a6f8-614d1ba09895.png\" alt=\"99ac1925-a46a-4ae6-a6f8-614d1ba09895\" width=\"1440\" height=\"1001\" \/><\/p>\n<p>\u25b2 Example of a mind map for general understanding<\/p>\n<p data-vmark=\"4d69\">The product details experience has also been further upgraded, including support for one-click insertion of video timestamps and screenshots in notes, and automatic recognition of the language of audio and video files.<\/p>\n<p data-vmark=\"60f9\">In addition, Tongyi Tingwu launched the &quot;University Charity Plan&quot;, all teachers and students of universities in mainland China passed the suffix\u00a0<span class=\"link-text-start-with-http\">edu.cn<\/span>\u00a0After the educational email address is authenticated, everyone can directly receive 500 hours of transcription time and the storage space will be expanded from 20G to 200G.<\/p>\n<p data-vmark=\"6250\">According to official introduction, as the first large-scale model product open to public beta in China, Tongyi Tingwu has accumulated millions of users since its release in June last year, including students, teachers, white-collar workers, reporters, lawyers, financial analysts and other groups. Active users transcribe audio and video more than 3 times a day on average, and the platform processes about 2 billion characters every day.<\/p>","protected":false},"excerpt":{"rendered":"<p>On March 19, Ali's big model product \"Tongyi Listening\" released a number of new features, on-line audio and video Q&amp;A assistant \"Xiaowu\", one-key AI rewriting, mind map generation and other six major functions. Tongyi Listening and Wisdom is connected to the big model of Tongyi Q&amp;A, and integrates more than ten AI functions, including rewriting, translation, role separation, full-text summary, chapter summary, speech summary, PPT extraction, etc., and supports marking the key points and note-taking. The upgrade of Tongyi Listening has six new features, the most important of which is the audio-video Q&amp;A assistant \"Xiaowu\", which can directly \"ask\" for key information. Through multi-language query processing, long chapter text understanding, command evolution framework optimization and retrieval of enhanced generation algorithms, Xiaowu is the first in the industry to realize single-recorded, cross-recorded, multi-language free questioning of ultra-long audio and video.<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[1760,335,1759],"collection":[],"class_list":["post-5811","post","type-post","status-publish","format-standard","hentry","category-news","tag-1760","tag-335","tag-1759"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/5811","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=5811"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/5811\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=5811"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=5811"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=5811"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=5811"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}