{"id":47977,"date":"2025-12-29T10:41:10","date_gmt":"2025-12-29T02:41:10","guid":{"rendered":"https:\/\/www.1ai.net\/?p=47977"},"modified":"2025-12-29T10:41:10","modified_gmt":"2025-12-29T02:41:10","slug":"ai-pk-%e5%8c%97%e5%a4%a7%e5%8c%96%e5%ad%a6%e5%ad%a6%e7%94%9f%ef%bc%9a%e9%a1%b6%e5%b0%96%e6%a8%a1%e5%9e%8b%e4%bb%85%e4%b8%8e%e4%bd%8e%e5%b9%b4%e7%ba%a7%e6%9c%ac%e7%a7%91%e7%94%9f%e7%9a%84%e5%b9%b3","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/47977.html","title":{"rendered":"AI PK NORTH CHEMISTRY: TOP MODEL IS JUST THE SAME AS THE AVERAGE FOR JUNIOR UNDERGRADUATES"},"content":{"rendered":"<p>December 29, according to Xinhua<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%8c%97%e4%ba%ac%e5%a4%a7%e5%ad%a6\" title=\"Look at the article with the label\" target=\"_blank\" >Peking University<\/a>The recent results of the multi-modular in-depth reasoning assessment in the field of chemistry, SUPERChem, were released by a team from the Great North Computing Centre, the Computer Academy and the Yumpe Institute\u3002<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-47978\" title=\"ed18d28cj00t80ff600atd000iym\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2025\/12\/ed18d28cj00t80ff600atd000u000iym.jpg\" alt=\"ed18d28cj00t80ff600atd000iym\" width=\"1080\" height=\"682\" \/><\/p>\n<p>And in the near future, they're using this \"Northern Test Paper\" as a yardstick, so they're trying to measure it <a href=\"https:\/\/www.1ai.net\/en\/tag\/ai\" title=\"[View articles tagged with [AI]]\" target=\"_blank\" >AI<\/a> The true boundaries of scientific reasoning\u3002<\/p>\n<p>According to the information received, the examination was attended by two junior students from the North Great Chemical and Molecular Engineering Institute, in addition to the GPT, Gemini, DeepSeek and Qwen, among others\u3002<\/p>\n<p>According to the report, the SUPERChem library is made up of 500 deep adaptations of difficult questions and front-line professional literature, not from a web-based public repository. The library is also designed to create a set of topics that AI \u201cnot seen\u201d must rely on hard-power reasoning\u3002<\/p>\n<p>IN THIS CAREFULLY DESIGNED EXAMINATION, HUMANS HAVE SHOWN COMPLEX SCIENTIFIC INSTINCTS. AS A BASELINE, UNDERGRADUATE STUDENTS AT THE NORTH GREAT CHEMICAL INSTITUTE WHO PARTICIPATED IN THE TESTING ACHIEVED AN AVERAGE ACCURACY RATE OF 40.31 TP3T\u3002<\/p>\n<p>ON THE OTHER HAND, AI IS DOING WELL:<\/p>\n<p>Even the top-of-the-post model tested has only the same level of achievement as the average for undergraduate students in the lower grades. According to the list, the highest GPT-5 (High) is the correct rate of 39.61 TP3T, which is below human level\u3002<\/p>\n<p>Not only is the correct rate \"unusual\", but in some areas, modeling is confusing for the team:<\/p>\n<p>THE LANGUAGE OF CHEMISTRY IS GRAPHIC, AND THE MOLECULAR STRUCTURE, THE RESPONSE MACHINE, CONTAINS KEY INFORMATION. FOR SOME MODELS, HOWEVER, THE ACCURACY RATE IS NOT REVERSED WHEN IMAGE INFORMATION IS INTRODUCED. THIS SUGGESTS THAT THE CURRENT AI STILL HAS SIGNIFICANT SENSORY BOTTLENECKS IN TRANSLATING VISUAL INFORMATION INTO CHEMICAL SYNTAX\u3002<\/p>\n<p>EVEN IF THE RIGHT ANSWER IS CHOSEN, IT MAY BE DIFFICULT TO SOLVE THE PROBLEM. THE TEAM FOUND THAT THE AI CHAIN OF REASONING TENDED TO BREAK UP HIGH-LEVEL TASKS SUCH AS PRODUCT STRUCTURE PREDICTION, RESPONSE MACHINE RECOGNITION AND STRUCTURE RELATIONSHIP ANALYSIS. THE CURRENT TOP-OF-THE-ART MODEL, WITH ITS VAST KNOWLEDGE RESERVES, IS STILL ILL-EQUIPPED TO DEAL WITH HARD NUCLEAR CHEMISTRY, WHICH REQUIRES CAREFUL LOGIC AND DEEP UNDERSTANDING\u3002<\/p>\n<p>According to the report, the team released this result not to prove AI ' s short board, but to push it further. SUPERChem is like a signpost. It reminds us:<\/p>\n<p>There is still a long way to go from a general chat robot to a professional scientific assistant who can understand the structure of the relationship, and who can drive the response machine. It's from \"Remember Knowledge\" to \"Understanding Physical World.\"\u3002<\/p>","protected":false},"excerpt":{"rendered":"<p>On 29 December, according to Xinhua, the Beijing University School of Chemical and Molecular Engineering, a team from the North Great Computing Centre, the Computer Institute, and the Institute of Yuantuan, recently released the latest results of the multi-modular in-depth reasoning assessment of chemistry, SUPERChem. In the recent past, they have used this \"Northern Test Paper\" as a yardstick for measuring AI's true boundaries in scientific reasoning. According to the information received, the examination was attended by two junior students from the Great North Institute of Chemical and Molecular Engineering, in addition to GPT, Gemini, DeepSeek and Qwen, among others. According to the report, this SUPERChem collection is made up of 500 deep adaptations of difficult questions and front-line professional literature, not the title<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[411,5220],"collection":[],"class_list":["post-47977","post","type-post","status-publish","format-standard","hentry","category-news","tag-ai","tag-5220"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/47977","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=47977"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/47977\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=47977"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=47977"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=47977"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=47977"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}