{"id":17905,"date":"2024-08-14T09:35:05","date_gmt":"2024-08-14T01:35:05","guid":{"rendered":"https:\/\/www.1ai.net\/?p=17905"},"modified":"2024-08-14T09:35:05","modified_gmt":"2024-08-14T01:35:05","slug":"%e8%b0%b7%e6%ad%8c%e5%8f%91%e5%b8%83-gemini-live%ef%bc%9a%e6%94%af%e6%8c%81-ai%e8%af%ad%e9%9f%b3%e8%81%8a%e5%a4%a9%ef%bc%8c%e5%8f%af%e6%a8%a1%e6%8b%9f%e9%9d%a2%e8%af%95%e5%9c%ba%e6%99%af%e3%80%81","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/17905.html","title":{"rendered":"Google releases Gemini Live: supports AI voice chat, simulates interview scenarios, and recommends presentation skills"},"content":{"rendered":"<p><a href=\"https:\/\/www.1ai.net\/en\/tag\/%e8%b0%b7%e6%ad%8c\" title=\"[View articles tagged with [Google]]\" target=\"_blank\" >Google<\/a>At today's Pixel 9 series<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e6%89%8b%e6%9c%ba%e5%8f%91%e5%b8%83%e4%bc%9a\" title=\"[Sees articles with labels]\" target=\"_blank\" >Mobile phone launch<\/a>superior,<strong>Published <a href=\"https:\/\/www.1ai.net\/en\/tag\/gemini-live\" title=\"_Other Organiser\" target=\"_blank\" >Gemini Live<\/a> The service, which will be available first to English-speaking Gemini Advanced subscribers, begins today.<\/strong><\/p>\n<p>Promoting natural and fluid dialogic exchanges<\/p>\n<p>Google says Gemini Live provides a mobile conversational experience that lets users have free-flowing conversations with Gemini.<\/p>\n<p>Gemini Live can be said to be the counterpart to OpenAI ChatGPT's newly launched Advanced Voice mode (limited alpha testing), which employs an enhanced voice engine to enable more coherent, emotionally expressive and realistic multi-round conversations.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-17906\" title=\"c8a9ff4bj00si6pos000ad000ms00crm\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/08\/c8a9ff4bj00si6pos000ad000ms00crm.jpg\" alt=\"c8a9ff4bj00si6pos000ad000ms00crm\" width=\"820\" height=\"459\" \/><\/p>\n<p>Google says users can interrupt the chatbot while it's talking to ask follow-up questions, and the chatbot will adapt to the user's speech patterns in real time.<\/p>\n<p>Translate part of the Google blog post below:<\/p>\n<blockquote>\n<ul>\n<li>With Gemini Live [using the Gemini app], the user can talk to the Gemini and choose from [10 new] natural sounds that it can respond to.<\/li>\n<li>Users can even speak at their own pace, or interrupt mid-answer and ask clarifying questions, just as they would in a human conversation.<\/li>\n<\/ul>\n<\/blockquote>\n<p>Google demos a scenario from Gemini Live that simulates a user talking to a hiring manager (or AI, as the case may be) to provide recommendations on presentation skills and offer optimization advice.<\/p>\n<p>A Google spokesperson said:<\/p>\n<blockquote>\n<ul>\n<li>Live uses our Gemini Advanced model, which we've tweaked to make it more conversational. The model's large context window is used when users are having long conversations with Live.<\/li>\n<\/ul>\n<\/blockquote>\n<p>Multimodal inputs are not supported<\/p>\n<p>Gemini Live doesn't yet have one of the features Google showed off at I \/ O: multimodal input.<\/p>\n<p>Google released a prerecorded video this past May showing Gemini Live seeing and reacting to a user's surroundings through photos and videos captured by a phone's camera, such as naming parts on a broken bike or explaining what part of the code on a computer screen does.<\/p>\n<p>Google said multimodal input will be available \"later this year,\" but declined to give specifics.<\/p>","protected":false},"excerpt":{"rendered":"<p>At its launch event for the Pixel 9 line of phones today, Google announced the Gemini Live service, which will be available first to English-speaking Gemini Advanced subscribers starting today. Promoting natural, fluid conversational exchanges Google says Gemini Live provides a mobile conversational experience that lets users engage in free-flowing conversations with Gemini. Gemini Live can be seen as a counterpart to OpenAI ChatGPT's newly launched Advanced Voice mode (limited alpha testing), which uses an enhanced voice engine to enable more coherent, emotionally expressive, and immersive multi-round conversations. Google<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[148,146],"tags":[3734,3997,3998,281],"collection":[],"class_list":["post-17905","post","type-post","status-publish","format-standard","hentry","category-headline","category-news","tag-ai","tag-gemini-live","tag-3998","tag-281"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/17905","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=17905"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/17905\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=17905"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=17905"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=17905"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=17905"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}