{"id":24771,"date":"2024-12-09T20:53:27","date_gmt":"2024-12-09T12:53:27","guid":{"rendered":"https:\/\/www.1ai.net\/?p=24771"},"modified":"2024-12-09T20:53:27","modified_gmt":"2024-12-09T12:53:27","slug":"%e4%b8%ad%e5%9b%bd%e7%a7%bb%e5%8a%a8%e8%81%94%e5%90%88%e7%a0%94%e5%8f%91-2d-%e6%95%b0%e5%ad%97%e4%ba%ba%e8%af%b4%e8%af%9d%e9%a9%b1%e5%8a%a8%e7%b3%bb%e7%bb%9f%ef%bc%9a%e5%8f%af%e7%94%9f%e6%88%90-7","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/24771.html","title":{"rendered":"China Mobile Jointly Develops 2D Digital Human Speech Driving System: 7 Emotions Generated for 5G New Calls, AI Customer Service, etc."},"content":{"rendered":"<p><a href=\"https:\/\/www.1ai.net\/en\/tag\/%e4%b8%ad%e5%9b%bd%e7%a7%bb%e5%8a%a8\" title=\"[See articles with [China Moves] labels]\" target=\"_blank\" >China Mobile<\/a> On December 8, it was announced that a joint team from Nanjing University had developed the<strong>High Fidelity 2D <a href=\"https:\/\/www.1ai.net\/en\/tag\/%e6%95%b0%e5%ad%97%e4%ba%ba\" title=\"[View articles tagged with [digital people]]\" target=\"_blank\" >Digital Human<\/a>Speech drive system<\/strong>.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-24772\" title=\"367cbc8cj00so893w005hd000u000avp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/12\/367cbc8cj00so893w005hd000u000avp.jpg\" alt=\"367cbc8cj00so893w005hd000u000avp\" width=\"1080\" height=\"391\" \/><\/p>\n<p>As a communications operator with the world's largest number of users, China Mobile's annual customer service operating costs are huge. Now widely popularized intelligent voice customer service can complete a certain amount of business automatic response tasks, but still less than artificial customer service face to face, one-to-one star service experience.<\/p>\n<p>Aiming at the pain points of the actual business, China Mobile's JiuTian vision team joined hands with Tai Ying's team from Nanjing University to develop a high-fidelity 2D digital human speech driving system.<strong>Aims to provide users with a digital human broadcasting dialog service with natural expressions, lip-synchronized voice and harmonious head posture<\/strong>, which can be applied to intelligent customer service, education and training, advertising and marketing scenarios.<\/p>\n<p>According to the official introduction of China Mobile, the 2D Digital Human Speaking Driving System realizes the generation of a video stream of the target character's speech synchronized with the audio based on the given target character's photo or video and any piece of audio. The character in the generated video is required to have a high degree of realism, natural expression and gesture, and at the same time, it needs to have a high degree of real-time, and can do with the language model, audio synthesis capabilities of the organic integration, to build up the character of the digital stand-in.<\/p>\n<p>The high-fidelity 2D digital human speech drive system developed by China Mobile's JiuTian vision team in conjunction with Nanjing University has carried out technological attacks and program innovations in the following three areas:<\/p>\n<ul>\n<li>First, the performance is real-time: compared with the previous digital human methods, it has reached the leading level in the academic world in terms of the mouth generation technology for real-time broadcasting.<strong>Supports Chinese and English digital demographic drivers to achieve real-time performance of 30ms\/frame while maintaining effects<\/strong>.<\/li>\n<li>Second, the effect is leading the way: the development of a two-stage learning framework that disassembles the digital human speech drive into:<strong>Two parts: from audio to mouth coefficients and from mouth coefficients to generated portrait<\/strong>, reducing the difficulty of learning and achieving better generation.<\/li>\n<li>Third, emotional control: introduction of an emotionally guided learning module.<strong>Supports the ability to generate 7 mainstream emotions: normal, smile, surprise, anger, fear, sadness, etc.<\/strong>that empowers the generated announcer with the ability to express human emotions.<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-24773\" title=\"2f99fb5bj00so894900bhd000u000ebp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/12\/2f99fb5bj00so894900bhd000u000ebp.jpg\" alt=\"2f99fb5bj00so894900bhd000u000ebp\" width=\"1080\" height=\"515\" \/><\/p>\n<p>1AI was officially informed by China Mobile that the digital human generation technology has realized end-to-end two-stage 30 FPS real-time generation performance.<strong>And supports 512*512 face area generation.<\/strong>It also has the ability to generate 7 mainstream emotions, such as happiness and sadness, at the same time.<\/p>\n<p>In terms of the VoxCeleb metrics, the technology achieved a 4.3 LMD (LandMark Distance) for mouthing accuracy and an 11.1 FID for naturalness of generation.<\/p>\n<p>China Mobile officials said that the application of the research and development results is promising, effectively reducing the threshold of creation and improving the visual quality of the generated characters.<strong>Enabled and upgraded the expansion of the 5G new call, and message secretary brand business<\/strong>.<\/p>","protected":false},"excerpt":{"rendered":"<p>ON DECEMBER 8TH, CHINA MOBILE ANNOUNCED THAT THE UNITED NANJING UNIVERSITY TEAM WAS DEVELOPING A 2D DIGITAL PERSON TALK DRIVE. AS A COMMUNICATIONS OPERATOR WITH THE NUMBER OF FIRST USERS ON A WORLD SCALE, THE ANNUAL RUNNING COSTS OF MOBILE CUSTOMER SERVICES IN CHINA ARE ENORMOUS. THE SMART VOICE SERVICE, WHICH IS NOW WIDELY AVAILABLE, ALTHOUGH ABLE TO PERFORM SOME BUSINESS AUTOMATIC RESPONSE TASKS, STILL FALLS SHORT OF A ONE-ON-ONE, FACE-TO-FACE, ONE-ON-ONE SERVICE EXPERIENCE. IN RESPONSE TO THE PAIN OF THE PHYSICAL PRESENCE OF THE BUSINESS, CHINA MOVED A NINE-DAY VISUAL TEAM TO WORK WITH THE NANJING UNIVERSITY ' S WINNING TEAM TO DEVELOP A 2D DIGITAL PERSON TALK-DRIVEN SYSTEM DESIGNED TO PROVIDE USERS WITH VOICE-BASED, VOICE-SYNCHRONOUS AND HEAD-TO-FACE CHAT SERVICES THAT CAN BE USED IN SMART GUEST SERVICES, EDUCATION TRAINING, ADVERTISING, ETC. ACCORDING TO CHINA MOBILE OFFICIAL, 2D DIGITAL PERSON<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[2072,1252],"collection":[],"class_list":["post-24771","post","type-post","status-publish","format-standard","hentry","category-news","tag-2072","tag-1252"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/24771","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=24771"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/24771\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=24771"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=24771"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=24771"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=24771"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}