{"id":22138,"date":"2024-10-28T09:44:31","date_gmt":"2024-10-28T01:44:31","guid":{"rendered":"https:\/\/www.1ai.net\/?p=22138"},"modified":"2024-10-28T09:44:31","modified_gmt":"2024-10-28T01:44:31","slug":"openai-%e8%af%ad%e9%9f%b3%e8%bd%ac%e5%86%99%e5%b7%a5%e5%85%b7-whisper-%e8%a2%ab%e6%9b%9d%e5%ad%98%e5%9c%a8%e9%87%8d%e5%a4%a7%e7%bc%ba%e9%99%b7%ef%bc%9a%e4%bc%9a%e5%87%ad%e7%a9%ba%e7%94%9f%e6%88%90","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/22138.html","title":{"rendered":"OpenAI speech transcription tool Whisper revealed to have a major flaw: generates large pieces of fake content out of thin air"},"content":{"rendered":"<p>On the 27th of local time, according to the Associated Press, more than a dozen software engineers, developers and academic researchers claimed that the<a href=\"https:\/\/www.1ai.net\/en\/tag\/openai\" title=\"[View articles tagged with [OpenAI]]\" target=\"_blank\" >OpenAI<\/a> of<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e8%af%ad%e9%9f%b3%e8%bd%ac%e5%86%99%e5%b7%a5%e5%85%b7\" title=\"[Sees articles with tags]\" target=\"_blank\" >speech transcription tool<\/a> <a href=\"https:\/\/www.1ai.net\/en\/tag\/whisper\" title=\"_Other Organiser\" target=\"_blank\" >Whisper<\/a> There is a major flaw: sometimes it is generated out of thin air<strong>Large paragraphs or even whole sentences of falsehoods<\/strong>.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-22139\" title=\"736c7812j00sm1m5200f0d000v900pzp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/10\/736c7812j00sm1m5200f0d000v900pzp.jpg\" alt=\"736c7812j00sm1m5200f0d000v900pzp\" width=\"1125\" height=\"935\" \/><\/p>\n<p>These experts noted that these generated texts (note: commonly referred to in the industry as \"AI hallucinations\") could involve<strong>Racial remarks, violent language, even fabricated medical advice<\/strong>.<\/p>\n<p>Experts found this issue to be particularly worrisome because Whisper is already being used in a wide range of industries around the world, including for translating and transcribing interviews, generating common consumer tech text, and creating video subtitles.<\/p>\n<p>Even more risky is the fact that while OpenAI<strong>\u00a0<\/strong><strong>Alerted<\/strong>The tool should not be used in \"high-risk areas\", but some health-care organizations<strong>Still in a hurry to adopt<\/strong>Whisper-based tool to document physician-patient consultations.<\/p>\n<p>According to the report, researchers and engineers often encounter Whisper's \"hallucinations\" in their work, and the full scale of the problem is unclear. For example, a researcher at the University of Michigan studying public meetings found that before attempting to improve the model, he examined the<strong>Ten audio transcriptions<\/strong>Middle.<strong>Eight contain fiction<\/strong>.<\/p>\n<p>A machine learning engineer revealed that in the more than 100 hours of Whisper transcripts he initially analyzed, he found that<strong>about half<\/strong>One developer further noted that of the 26,000 transcripts he had generated with Whisper, the content was \"hallucinatory\". One developer further noted that of the 26,000 transcripts he generated using Whisper<strong>Nearly every<\/strong>All with fictional content.<\/p>\n<p>Even brief audio samples with good sound quality are not immune to these problems. Recent research by computer scientists has shown that they review the\u00a0<strong>13,000+ segments<\/strong>In the clear audio clip, the<strong>There are 187 paragraphs<\/strong>The phenomenon of \"hallucinations\".<\/p>\n<p>The researchers believe that this trend means that tens of thousands of incorrect transcriptions will likely occur in millions of recordings.<\/p>\n<p>A spokesperson for OpenAI said the company continues to research ways to reduce hallucinations and thanked the researchers for their findings and will incorporate feedback in model updates.<\/p>","protected":false},"excerpt":{"rendered":"<p>More than a dozen software engineers, developers, and academic researchers say OpenAI's voice transcription tool Whisper has a major flaw: it sometimes generates large paragraphs or even entire sentences of false content out of thin air, according to a report by the Associated Press on 27 May. The experts note that the generated text, commonly referred to in the industry as \"AI hallucinations,\" can involve racial remarks, violent language, and even fabricated medical advice. This is particularly worrisome, according to the experts, because Whisper is already being used in a variety of industries around the world, including for translating and transcribing interviews, generating common consumer tech text, and creating video subtitles. Even more risky is the fact that, despite OpenAI's warnings that the tool should not be used in \"high-risk areas,\" some<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[190,871,4765],"collection":[],"class_list":["post-22138","post","type-post","status-publish","format-standard","hentry","category-news","tag-openai","tag-whisper","tag-4765"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/22138","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=22138"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/22138\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=22138"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=22138"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=22138"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=22138"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}