{"id":16910,"date":"2024-08-01T09:13:53","date_gmt":"2024-08-01T01:13:53","guid":{"rendered":"https:\/\/www.1ai.net\/?p=16910"},"modified":"2024-08-01T09:13:53","modified_gmt":"2024-08-01T01:13:53","slug":"gpt-4o%e8%af%ad%e9%9f%b3%e5%8a%9f%e8%83%bd%e5%bc%80%e5%90%af%e7%81%b0%e5%ba%a6%e6%b5%8b%e8%af%95-%e4%b8%8d%e4%bb%85%e8%83%bd%e8%ae%b2%e7%ac%91%e8%af%9d%e3%80%81%e5%ad%a6%e7%8c%ab%e5%8f%ab%e8%bf%98","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/16910.html","title":{"rendered":"GPT-4o voice function has been turned on for grayscale testing. It can not only tell jokes and imitate cat meows, but also help practice oral English"},"content":{"rendered":"<p data-pm-slice=\"0 0 []\">Scenarios from the sci-fi movie Her seem to be coming into reality.<a href=\"https:\/\/www.1ai.net\/en\/tag\/gpt-4o\" title=\"[View articles tagged with [GPT-4o]]\" target=\"_blank\" >GPT-4o<\/a>of<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e8%af%ad%e9%9f%b3%e5%8a%9f%e8%83%bd\" title=\"[Sees articles with [voice function] labels]\" target=\"_blank\" >Voice Function<\/a>It's finally on.<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e7%81%b0%e5%ba%a6%e6%b5%8b%e8%af%95\" title=\"[Sees articles with [grey test] labels]\" target=\"_blank\" >Grayscale test<\/a>Some ChatGPT Plus users have experienced this exciting new feature first. OpenAI\u2019s innovation not only allows AI to talk jokes, learn cat barks, but also helps to practice speech as a \u201cdiverse coach\u201d\u3002<\/p>\n<p data-track=\"48\">The GPT-4o voice mode brings a more natural, real-time conversation experience. Users can interrupt AI at will, and it can even sense and respond to user emotions. It is expected that this will be available to all ChatGPT Plus users this fall. It is more expected that video and screen sharing will also be rolled out in the near future, when users can \u201cface-to-face\u201d communication with ChatGPT\u3002<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-16911\" title=\"get-6\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/08\/get-6.jpg\" alt=\"get-6\" width=\"719\" height=\"395\" \/><\/div>\n<p data-track=\"49\">The output capacity of GPT-4o has also been dramatically increased. The number of output tokens for the new model has skyrocketed from 4,000 to 64,000, which means that the equivalent of four full-length movie scripts can be obtained at once.OpenAI quietly rolled out this beta version of the new model, gpt-4o-64k-output-alpha, in its official webpage.<\/p>\n<p data-track=\"50\">To ensure security and quality, OpenAI has been rigorously testing the GPT-4o voice feature for the past few months. With more than 100 red teamers, they tested 45 languages and trained the model to speak using only four preset voices to protect user privacy. In addition, content filtering was essential, and the team took steps to block the generation of violent and copyright-related content.<\/p>\n<p data-track=\"51\">Netizens have tested the GPT-4o Voice Mode with impressive results. Some people found that it can answer questions quickly and with almost no delay; some people used it to imitate different voices and accents; others let it act as a soccer match commentator and even tell stories vividly in Chinese. These cases demonstrate the power of GPT-4o in speech recognition and generation.<\/p>\n<p data-track=\"52\">It is worth noting that although OpenAI claims that video and screen sharing will be available later, some netizens have experienced these in advance. For example, one of the netizens showed ChatGPT his little nest for the new pet cat, and ChatGPT looked at the post-evaluation \"must be very comfortable\" and asked about cats with interest\u3002<\/p>\n<p data-track=\"53\">In addition, the long output feature of GPT-4o has quietly gone live.OpenAI officially announced the availability of the GPT-4o Alpha version to beta testers, which supports up to 64K tokens per request, equivalent to a 200-page novel. This feature was introduced based on user demand for longer output content.<\/p>\n<p data-track=\"54\">However, longer outputs also imply higher computation and price.GPT-4o Long Output is priced at $6 per million input tokens and $18 per million output tokens, which is an increase compared to the previous model. Nonetheless, some researchers believe that Long Output is mainly used for use cases such as data conversion and is very helpful in scenarios such as writing code and improving writing.<\/p>\n<p data-track=\"55\">Overall, GPT-4o's voice function and long output capability will undoubtedly bring users a richer and more convenient interactive experience. We have reason to believe that with the continuous progress of technology, AI will show its unique value in more fields.<\/p>\n<p>&nbsp;<\/p>","protected":false},"excerpt":{"rendered":"<p>The scene in science fiction film Her seems to be moving into reality. The GPT-4o voice function finally started the greyscale test, and some ChatGPT Plus users have experienced this exciting new feature first. OpenAI\u2019s innovation not only allows AI to talk jokes, learn cat barks, but also helps to practice speech as a \u201cdiverse coach\u201d. The GPT-4o voice mode brings a more natural, real-time conversation experience. Users can interrupt AI at will, and it can even sense and respond to user emotions. It is expected that this will be available to all ChatGPT Plus users this fall. More than expected, video and screen sharing will be rolled out in the near future, when users can work with ChatGP<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[2582,705,3733],"collection":[],"class_list":["post-16910","post","type-post","status-publish","format-standard","hentry","category-news","tag-gpt-4o","tag-705","tag-3733"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/16910","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=16910"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/16910\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=16910"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=16910"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=16910"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=16910"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}