{"id":7417,"date":"2024-04-08T09:48:58","date_gmt":"2024-04-08T01:48:58","guid":{"rendered":"https:\/\/www.1ai.net\/?p=7417"},"modified":"2024-04-08T09:48:58","modified_gmt":"2024-04-08T01:48:58","slug":"%e7%ba%bd%e7%ba%a6%e6%97%b6%e6%8a%a5%e6%8c%87%e8%b4%a3openai%e3%80%81%e8%b0%b7%e6%ad%8c%e5%92%8cmeta%e7%bb%95%e8%bf%87%e6%b3%95%e5%be%8b%e8%be%b9%e7%95%8c%e8%bf%9b%e8%a1%8cai%e8%ae%ad%e7%bb%83","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/7417.html","title":{"rendered":"The New York Times accuses OpenAI, Google, and Meta of skirting legal boundaries for AI training data"},"content":{"rendered":"<p>according to<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e7%ba%bd%e7%ba%a6%e6%97%b6%e6%8a%a5\" title=\"[Sees articles with labels of the New York Times]\" target=\"_blank\" >The New York Times<\/a>Report,<a href=\"https:\/\/www.1ai.net\/en\/tag\/openai\" title=\"[View articles tagged with [OpenAI]]\" target=\"_blank\" >OpenAI<\/a>,<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e8%b0%b7%e6%ad%8c\" title=\"[View articles tagged with [Google]]\" target=\"_blank\" >Google<\/a>and <a href=\"https:\/\/www.1ai.net\/en\/tag\/meta\" title=\"[View articles tagged with [Meta]]\" target=\"_blank\" >Meta<\/a> Accused of training<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e4%ba%ba%e5%b7%a5%e6%99%ba%e8%83%bd%e6%a8%a1%e5%9e%8b\" title=\"_Other Organiser\" target=\"_blank\" >Artificial Intelligence Model<\/a>There is inappropriate behavior.<\/p>\n<p>The New York Times report states that OpenAI used a speech recognition tool called Whisper to transcribe audio from YouTube videos, and that OpenAI employees allegedly discussed how this action might violate the video site&#039;s rules. OpenAI ultimately transcribed more than 1 million hours of YouTube videos, assisted by OpenAI President Greg Brockman, and these transcriptions were used to train the GPT-4 model.<\/p>\n<p class=\"article-content__img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-7418\" title=\"202311231146402911_4\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/04\/202311231146402911_4.jpg\" alt=\"202311231146402911_4\" width=\"1000\" height=\"752\" \/><\/p>\n<p>Source Note: The image is generated by AI, and the image is authorized by Midjourney<\/p>\n<p>The report also said that Meta had considered acquiring publisher Simon &amp; Schuster to obtain long-form works for training AI, and also discussed &quot;collecting copyrighted data from the Internet, even if it might face litigation&quot;, and believed that &quot;negotiating licenses with publishers, artists, musicians and the news industry would take too long.&quot; Google was accused of transcribing YouTube videos to obtain text for AI model training, which the New York Times said &quot;probably&quot; violated the copyright of the videos, and said that Google modified its terms to allow data scraping of publicly available Google documents, restaurant reviews on Google Maps, and other online content for training AI.<\/p>\n<p>The New York Times seems to be trying to paint a dire picture of mass infringement, but generally avoids saying so directly. These are reasonable conversations that any company developing AI should have in order to treat others well and comply with the law. AI companies are doing exactly that, using data fairly, which is at the heart of OpenAI\u2019s defense against the New York Times lawsuit. The story didn\u2019t reveal that the New York Times was suing OpenAI until 17 paragraphs later, making the article seem like an attack on what the company considers to be an enemy.<\/p>\n<p>The New York Times report has sparked discussion about the legality and ethics of AI companies\u2019 training data, and has also highlighted the challenges and controversies the AI industry faces in data acquisition.<\/p>","protected":false},"excerpt":{"rendered":"<p>According to the New York Times, OpenAI, Google and Meta have been accused of improper behavior in training artificial intelligence models. The NYT report states that OpenAI used a speech recognition tool called Whisper to transcribe audio from YouTube videos, and that OpenAI employees allegedly discussed the possibility that this behavior might violate the video site's rules.OpenAI ended up transcribing more than one million hours of YouTube video, with the assistance of OpenAI President Greg OpenAI ultimately transcribed more than 1 million hours of YouTube video, assisted by OpenAI President Greg Brockman, and these transcriptions were used to train the GPT-4 model. Source Note: Image generated by AI, image licensed from provider Midjourney The report also said that Meta had considered acquiring<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[297,190,599,794,281],"collection":[],"class_list":["post-7417","post","type-post","status-publish","format-standard","hentry","category-news","tag-meta","tag-openai","tag-599","tag-794","tag-281"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/7417","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=7417"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/7417\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=7417"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=7417"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=7417"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=7417"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}