{"id":16802,"date":"2024-07-31T09:34:27","date_gmt":"2024-07-31T01:34:27","guid":{"rendered":"https:\/\/www.1ai.net\/?p=16802"},"modified":"2024-07-31T09:34:27","modified_gmt":"2024-07-31T01:34:27","slug":"24-%e5%b0%8f%e6%97%b6%e6%8a%93%e5%8f%96%e7%99%be%e4%b8%87%e6%ac%a1%ef%bc%8canthropic-ai-%e5%85%ac%e5%8f%b8%e8%a2%ab%e6%8c%87%e8%bf%87%e5%ba%a6%e6%8a%93%e5%8f%96%e7%bd%91%e7%ab%99%e6%95%b0%e6%8d%ae","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/16802.html","title":{"rendered":"Anthropic AI was accused of crawling website data excessively, crawling millions of times in 24 hours"},"content":{"rendered":"<p data-track=\"1\" data-pm-slice=\"0 0 []\">On July 31, the Financial Times (FT) published a blog post stating that <a href=\"https:\/\/www.1ai.net\/en\/tag\/ai%e5%85%ac%e5%8f%b8\" title=\"[SEES ARTICLES WITH [AI] LABELS]\" target=\"_blank\" >AI Companies<\/a> <a href=\"https:\/\/www.1ai.net\/en\/tag\/anthropic\" title=\"[View articles tagged with [Anthropic]]\" target=\"_blank\" >Anthropic<\/a> Although it claims to &quot;develop AI responsibly&quot;,<strong>But through ClaudeBot <a href=\"https:\/\/www.1ai.net\/en\/tag\/%e6%9c%ba%e5%99%a8%e4%ba%ba\" title=\"[Sees articles with [robots] labels]\" target=\"_blank\" >robot<\/a>Over-fetching<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e7%bd%91%e7%ab%99%e6%95%b0%e6%8d%ae\" title=\"_Other Organiser\" target=\"_blank\" >Website data<\/a>, used to train the Claude large language model.<\/strong><\/p>\n<p data-track=\"2\">Although using web crawlers to scrape data is a common practice in the artificial intelligence industry, Anthropic has been criticized for its aggressiveness.<\/p>\n<p data-track=\"3\">Freelancer, a freelance website, also said that ClaudeBot visited 3.5 million times in four hours and was forced to block it. Critics pointed out that Anthropic ignored the website&#039;s robots.txt protocol and forcibly obtained data, which was contrary to its declared concept of &quot;responsible AI&quot;.<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-16803\" title=\"get-945\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/07\/get-945.jpg\" alt=\"get-945\" width=\"598\" height=\"1047\" \/><\/div>\n<p data-track=\"4\">Kyle Wiens, CEO of repair team iFixit, tweeted on July 24, translated by IT Home as follows:<\/p>\n<blockquote>\n<p data-track=\"5\">@AnthropicAI, I know you&#039;re hungry for data, and Claude is a very smart model, but is it really necessary to hit our servers 1 million times in 24 hours?<\/p>\n<p data-track=\"6\">This traffic does not pay us and takes up our development resources, which is really unfair.<\/p>\n<p data-track=\"7\">Our Terms of Service clearly prohibit the use of our content in this way, but you quietly @AnthropicAI did it.<\/p>\n<p data-track=\"8\">If @AnthropicAI wants to communicate about commercial use licenses for our content, we\u2019re open to communicating.<\/p>\n<\/blockquote>\n<p>&nbsp;<\/p>","protected":false},"excerpt":{"rendered":"<p>On July 31st, the Financial Times (FT) published a blog post stating that AI company Anthropic, while claiming to \"develop AI responsibly,\" is excessively crawling website data with its ClaudeBot bot to train the Claude Big Language model. While the use of web crawlers to crawl data is a common practice in the AI industry, Anthropic has been criticized for its aggressiveness. Freelancer, a freelance website, also said that ClaudeBot was forced to block it after it was accessed 3.5 million times in four hours. Critics pointed out that Anthropic ignored the site's robots.txt protocol and forced access to data, in contrast to its claims that \"<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[155,320,909,3790],"collection":[],"class_list":["post-16802","post","type-post","status-publish","format-standard","hentry","category-news","tag-ai","tag-anthropic","tag-909","tag-3790"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/16802","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=16802"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/16802\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=16802"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=16802"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=16802"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=16802"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}