{"id":35442,"date":"2025-05-18T12:54:02","date_gmt":"2025-05-18T04:54:02","guid":{"rendered":"https:\/\/www.1ai.net\/?p=35442"},"modified":"2025-05-18T12:54:02","modified_gmt":"2025-05-18T04:54:02","slug":"%e6%96%b0%e7%a0%94%e7%a9%b6%e5%8f%91%e7%8e%b0-ai-%e6%97%a0%e6%b3%95%e8%af%bb%e6%87%82%e6%a8%a1%e6%8b%9f%e6%97%b6%e9%92%9f%ef%bc%8c%e8%bf%98%e4%b8%8d%e8%83%bd%e5%91%8a%e8%af%89%e4%bd%a0%e6%9f%90","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/35442.html","title":{"rendered":"New study finds AI can't read analog clocks, and can't tell you what day of the week it is"},"content":{"rendered":"<p>May 17, 2011 - According to a report today in the foreign media LiveScience, there are some tasks that humans can easily accomplish that<a href=\"https:\/\/www.1ai.net\/en\/tag\/ai\" title=\"[View articles tagged with [AI]]\" target=\"_blank\" >AI<\/a> But it's not up to the task. For example, AI can program, draw realistic images, generate text close to human tone, and even score well on some exams, but in everyday life the most basic<strong>\"Watch the clock,\" \"count the days.\"<\/strong>But there have been frequent mistakes in such matters--\u00a0<strong>Either you can't read the pointer or you can't figure out the day of the week.<\/strong>.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-35443\" title=\"eb2ebc10j00swfxkf00ksd000i200a6p\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2025\/05\/eb2ebc10j00swfxkf00ksd000i200a6p.jpg\" alt=\"eb2ebc10j00swfxkf00ksd000i200a6p\" width=\"650\" height=\"366\" \/><\/p>\n<p>The researchers presented the findings at the International Conference on Learning Representations (ICLR) 2025, and the paper, which has been published on arXiv, has not yet been peer-reviewed.<\/p>\n<p>Rohit Saxena, a researcher at the University of Edinburgh and author of the paper, said, \"Humans have been able to grasp the concept of time and calendars from a young age, and AI's shortcomings in this area are a<strong>Signs to be wary of<\/strong>.\" He noted that to apply AI to real-life, time-sensitive scenarios such as<strong>Scheduling, automated processes or assistive technology<\/strong>, this type of basic competency deficiency must be addressed.<\/p>\n<p>The research team fed several large language models with graphic processing capabilities a set of<strong>customized<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e6%97%b6%e9%92%9f\" title=\"[See articles with [Clock] labels]\" target=\"_blank\" >clocks<\/a>With calendar images<\/strong>The models tested include<strong>\u00a0<\/strong><strong>Meta's Llama 3.2-Vision, Anthropic's Claude-3.5 Sonnet, Google's Gemini 2.0 and <a href=\"https:\/\/www.1ai.net\/en\/tag\/openai\" title=\"[View articles tagged with [OpenAI]]\" target=\"_blank\" >OpenAI<\/a> GPT-4o<\/strong>. Tests showed that none of these models were more than half correct on the tasks of determining clock time or extrapolating the day of the week of a date.<\/p>\n<p>Saxena said, \"AI training in the past has relied on<strong>Numerous examples with labels<\/strong>while reading the clock requires<strong>spatial reasoning<\/strong>. The model not only recognizes whether the pointers overlap or not, but also<strong>Understanding angles, distinguishing various styles of dials<\/strong>, such as Roman numerals or artistic designs. It's far more complicated than just recognizing 'this is a clock'.\"<\/p>\n<p>Calendar problems are also difficult for AI. for example, in \"<strong>What day of the week is the 153rd day of the year?<\/strong>\"Error rates remain high on such questions. Studies have shown that<strong>The AI reads the clock correctly at only 38.7% and judges the calendar even less accurately at 26.3%<\/strong>.<\/p>\n<p>Saxena explains, \"Arithmetic is a breeze for traditional computers, but not for big models. ai doesn't execute algorithms, but rather<strong>Relying on patterns learned from training data<\/strong>to predict the answer.\" He noted that while AI can sometimes answer questions correctly, its<strong>Lack of consistency in the reasoning process<\/strong>, nor is it based on fixed rules, which is precisely the gap revealed by the study.<\/p>\n<p>The study also revealed another problem, which is that AIs tend to perform worse when their training samples lack a certain type of phenomenon, such as leap years or complex calendar rules, Saxena said, \"Even if the models understand the concept of a 'leap year,' that doesn't mean that they can correctly apply this knowledge to specific visual judgments.\"<\/p>\n<p>1AI learned from the report that the study emphasized two areas of improvement:<strong>One is that the training data should contain more representative examples; the other is that how AI integrates logical reasoning and spatial perception should be revisited<\/strong>, especially when dealing with infrequently encountered tasks.<\/p>","protected":false},"excerpt":{"rendered":"<p>On May 17th, according to LiveScience today, there are humans who can easily perform tasks that AI cannot do. For example, AI can program, draw real images, produce text that is close to human speech, and even achieve good results in some of the tests, but often makes mistakes in things like the most basic \u201cwatch clock\u201d in everyday life \u2014 either not to read the pointer or not to count a few weeks. This discovery was presented by researchers at the International Conference on Learning Forms (ICLR) in 2025, and the papers have been published in arXiv and have not yet been peer-reviewed. Rohit Saxona, a researcher and author of the thesis at the University of Edinburgh, said: \u201cHumans have been growing since childhood<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[411,190,6634],"collection":[],"class_list":["post-35442","post","type-post","status-publish","format-standard","hentry","category-news","tag-ai","tag-openai","tag-6634"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/35442","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=35442"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/35442\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=35442"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=35442"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=35442"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=35442"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}