{"id":1864,"date":"2023-12-12T09:25:29","date_gmt":"2023-12-12T01:25:29","guid":{"rendered":"https:\/\/www.1ai.net\/?p=1864"},"modified":"2023-12-12T09:25:29","modified_gmt":"2023-12-12T01:25:29","slug":"%e8%b0%b7%e6%ad%8c%e5%89%af%e6%80%bb%e8%a3%81-sissie-hsiao%ef%bc%9agemini-ai-%e6%bc%94%e7%a4%ba%e8%a7%86%e9%a2%91%e3%80%8c%e5%ae%8c%e5%85%a8%e7%9c%9f%e5%ae%9e%e3%80%8d%ef%bc%8c%e5%b0%bd%e7%ae%a1","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/1864.html","title":{"rendered":"Google VP Sissie Hsiao: Gemini AI demo video is &quot;completely real,&quot; even though Google &quot;shortened some parts for brevity&quot;"},"content":{"rendered":"<p>In the increasingly fierce competition in the generative AI market,<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e8%b0%b7%e6%ad%8c\" title=\"[View articles tagged with [Google]]\" target=\"_blank\" >Google<\/a>Recently launched its large language model\u00a0<strong><a href=\"https:\/\/www.1ai.net\/en\/tag\/gemini\" title=\"[View articles tagged with [Gemini]]\" target=\"_blank\" >Gemini<\/a><\/strong>\u00a0However,<strong>The controversy over the authenticity of the video subsequently attracted widespread attention<\/strong>.<\/p>\n<p class=\"article-content__img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1865\" title=\"202312070835429226_0-1\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2023\/12\/202312070835429226_0-1.jpg\" alt=\"202312070835429226_0-1\" width=\"1000\" height=\"563\" \/><\/p>\n<p>The demo video released by Google shows the multimodal capabilities of the Gemini model, which can cleverly interpret and process information from real-time video and audio. This is a major achievement for Google, especially in the fierce competition with rivals such as OpenAI. However, according to Bloomberg, the demo video was actually<strong>Made using still image frames from a video and text prompts<\/strong>, rather than the real-time voice and video processing that seems to be achieved.<\/p>\n<p>Sissie Hsiao, vice president and general manager of Google Assistant and Bard, discussed the controversial demo video at Fortune\u2019s Brainstorm AI conference in San Francisco, highlighting the standards Gemini has achieved as a model and how it will advance the development of Google\u2019s chatbot, Bard.<strong>\u201cThis video is completely real. All the prompts and model responses are real,\u201d Hsiao said. \u201cWe did shorten some parts for brevity, but that information is also explained in the video.\u201d<\/strong><\/p>\n<p>The demonstration video shows the multimodal capabilities of the new AI model, which recognizes a wavy line, then recognizes the curve of a new line, and finally draws a picture of a duck. Throughout the process, the model continues to recognize each element and provide facts and answers related to the duck in real time.<\/p>\n<p>Hsiao highlighted Gemini\u2019s achievements in a variety of benchmarks, ranging from high school physics to professional legal puzzles and ethical scenarios. According to The Verge, Gemini Ultra beat OpenAI\u2019s GPT-4 in 32 benchmarks, winning 30 tests in total, an achievement worth boasting about, even though Gemini Ultra won\u2019t be released until next year.<strong>Currently, Bard uses the less advanced Gemini Pro, which is roughly equivalent to GPT 3.5.<\/strong><\/p>\n<p>Hsiao said these Gemini models will continue to improve Google Search as well as the Google Bard chatbot, which she said is \u201cthe most advanced chatbot on the market today.\u201d<span class=\"spamTxt\">Most Popular<\/span>Free chatbot for .<\/p>","protected":false},"excerpt":{"rendered":"<p>In the context of the increasing production-oriented AI market competition, Google recently launched its large-language model Gemini ' s advance video. However, the controversy over the authenticity of the video subsequently generated widespread concern. Google's presentation video shows the multi-modular capability of the Gemini model to interpret and process information from real-time video and audio in a skilful manner. This is a major achievement for Google, especially in the fierce competition with competitors like OpenAI. However, according to Bloomberg, the presentation video was actually produced through \"static image frames using video, text tips\" rather than seemingly real-time voice and video processing. In San Francisco at the Brainstorm AI conference, Google<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[436,281],"collection":[],"class_list":["post-1864","post","type-post","status-publish","format-standard","hentry","category-news","tag-gemini","tag-281"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/1864","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=1864"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/1864\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=1864"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=1864"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=1864"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=1864"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}