{"id":30432,"date":"2025-03-11T10:05:18","date_gmt":"2025-03-11T02:05:18","guid":{"rendered":"https:\/\/www.1ai.net\/?p=30432"},"modified":"2025-03-11T10:05:18","modified_gmt":"2025-03-11T02:05:18","slug":"openai-%e5%8f%91%e5%b8%83-cot-%e6%80%9d%e7%bb%b4%e9%93%be%e7%a0%94%e7%a9%b6%e6%88%90%e6%9e%9c%ef%bc%8c%e7%9b%91%e6%8e%a7%e9%98%bb%e6%ad%a2%e5%a4%a7%e6%a8%a1%e5%9e%8b%e6%81%b6%e6%84%8f%e8%a1%8c","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/30432.html","title":{"rendered":"OpenAI Releases CoT Civic Chain Research Results, Monitoring to Stop Big Model Malicious Behavior"},"content":{"rendered":"<p>March 11th.<a href=\"https:\/\/www.1ai.net\/en\/tag\/openai\" title=\"[View articles tagged with [OpenAI]]\" target=\"_blank\" >OpenAI<\/a> today announced its latest research, which attempts to use CoT (Chain of Thought) monitoring to monitor the \"thoughts\" of reasoning models, thereby preventing AI from <a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%a4%a7%e6%a8%a1%e5%9e%8b\" title=\"[View articles tagged with [large models]]\" target=\"_blank\" >Large Model<\/a>behaviors such as babbling, hiding true intentions, etc., and it is also one of the effective tools for monitoring supermodels.<\/p>\n<p>Note: AI occasionally spouts nonsense, which we generally call AI hallucinations. In addition to this, AI can be lazy or deliberately bad, such as tricking the user or simply giving up when a problem is too difficult.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-30433\" title=\"6e46df10j00ssxsfd003cd000gg00gfp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2025\/03\/6e46df10j00ssxsfd003cd000gg00gfp.jpg\" alt=\"6e46df10j00ssxsfd003cd000gg00gfp\" width=\"592\" height=\"591\" \/><\/p>\n<p>OpenAI used the newly released cutting-edge model o3-mini as the monitored object and the weaker GPT-4o model as the monitor. The test environment was a coding task that required the AI to implement functionality in the codebase to pass unit tests.<\/p>\n<p>The results show that CoT monitors are excellent at detecting systematic \"reward hacking\" behavior, with a recall rate of 951 TP3T, far exceeding the 601 TP3T for monitoring behavior alone.<\/p>","protected":false},"excerpt":{"rendered":"<p>March 11, OpenAI today announced its latest research, trying to use CoT (chain of thought) monitoring way to monitor the reasoning model's \"ideas\", so as to stop the AI big model nonsense, hide the real intention and other behaviors, at the same time, this is also one of the effective tools to supervise the super model. Note: AI will occasionally talk nonsense, which we generally call AI hallucinations. In addition to this, AI can be lazy or deliberately bad, such as tricking the user or simply giving up when a problem is too difficult. OpenAI used the newly released cutting-edge model o3-mini as the monitored object and the weaker GPT-4o model as the monitor. The test environment was a coding task that required the AI to implement functionality in the codebase to pass unit tests. Conclusion<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[190,216],"collection":[],"class_list":["post-30432","post","type-post","status-publish","format-standard","hentry","category-news","tag-openai","tag-216"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/30432","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=30432"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/30432\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=30432"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=30432"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=30432"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=30432"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}