{"id":4807,"date":"2024-03-03T09:11:55","date_gmt":"2024-03-03T01:11:55","guid":{"rendered":"https:\/\/www.1ai.net\/?p=4807"},"modified":"2024-03-03T09:11:55","modified_gmt":"2024-03-03T01:11:55","slug":"%e5%8a%a9%e8%a7%86%e9%9a%9c%e8%80%85%e7%9c%8b%e8%a7%81%e4%b8%96%e7%95%8c%ef%bc%8c%e5%a4%8d%e6%97%a6%e5%a4%a7%e5%ad%a6%e5%9b%a2%e9%98%9f%e7%a0%94%e5%8f%91%e7%9c%b8%e6%80%9d","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/4807.html","title":{"rendered":"To help the visually impaired &quot;see&quot; the world, the Fudan University team developed the &quot;Mousi&quot; large model and the &quot;Hear the World&quot; App"},"content":{"rendered":"<p data-vmark=\"1393\">according to<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%a4%8d%e6%97%a6%e5%a4%a7%e5%ad%a6\" title=\"[Sees articles with labels]\" target=\"_blank\" >Fudan University<\/a>The official public account is made possible by the efforts of teachers and students of Fudan University Natural Language Processing Laboratory (FudanNLP).<span class=\"accentTextColor\">Based on multimodal<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%a4%a7%e6%a8%a1%e5%9e%8b\" title=\"[View articles tagged with [large models]]\" target=\"_blank\" >Large Model<\/a>&quot;Fudan MouSi&quot; launches the &quot;Hear the World&quot; app tailored for the visually impaired<\/span>.<\/p>\n<p data-vmark=\"d99c\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-4808\" title=\"6de48a77-69fb-4e55-9793-30855e229a6b\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/03\/6de48a77-69fb-4e55-9793-30855e229a6b.png\" alt=\"6de48a77-69fb-4e55-9793-30855e229a6b\" width=\"759\" height=\"764\" \/><\/p>\n<p data-vmark=\"791b\">This system only needs a camera and a pair of headphones to convert images into language, and supports functions such as describing scenes and warning of risks. The &quot;Hear the World&quot; App can design three modes for the daily life needs of the visually impaired.<\/p>\n<ul class=\"list-paddingleft-2\">\n<li>\n<p data-vmark=\"501a\">Street Walking: In this mode,<span class=\"accentTextColor\">&quot;Mosi&quot; can scan road conditions in detail and indicate potential risks<\/span>.<\/p>\n<\/li>\n<li>\n<p data-vmark=\"4ce0\">Free Q&amp;A: It can help the visually impaired walk into museums, art galleries, and parks, capture every detail of the surrounding scenes, and use sound to build rich life scenes. The official demonstration picture shows that<span class=\"accentTextColor\">The app can also realize functions such as retelling TV screen content<\/span>.<\/p>\n<\/li>\n<li>\n<p data-vmark=\"5202\">Object Search: This mode provides the visually impaired with the function of finding everyday objects, and the official calls it a \u201creliable butler.\u201d<\/p>\n<\/li>\n<\/ul>\n<p data-vmark=\"a09e\">It is reported that the &quot;Hear the World&quot; App is expected to complete its first round of testing in March this year, and will simultaneously launch pilot projects in China&#039;s first- and second-tier cities and regions, and promote it based on the computing power deployment situation.<\/p>\n<p data-vmark=\"16a4\">Fudan University Natural Language Processing Laboratory (FudanNLP) previously developed the MOSS large model and announced its official open source release in April 2023.<span class=\"accentTextColor\">Became the first plug-in enhanced open source conversational language model in China<\/span>. Half a year later, the multimodal model &quot;Mosi&quot; was launched.<\/p>","protected":false},"excerpt":{"rendered":"<p>According to Fudan University's official public website, the \"Hear the World\" app for the visually impaired based on the multimodal big model \"FudanNLP\" (MouSi) has been launched through the efforts of students and faculty members of Fudan University's Natural Language Processing Laboratory (FudanNLP). The app is online. The system, which requires only a camera and a pair of headphones, translates images into language and supports functions such as describing scenarios and alerting people to risks. The \"Hear the World\" app can be used in three modes to meet the needs of the visually impaired in their daily lives. Walking on the street: In this mode, Eyes can scan the road in detail and alert potential risks. Free Question and Answer: It can help the visually impaired to enter museums, art galleries and parks, capture every detail of the surrounding scenery, and build rich life scenes with sound, as shown in the official demo.<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[1461,216],"collection":[],"class_list":["post-4807","post","type-post","status-publish","format-standard","hentry","category-news","tag-1461","tag-216"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/4807","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=4807"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/4807\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=4807"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=4807"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=4807"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=4807"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}