{"id":20346,"date":"2024-09-21T10:45:21","date_gmt":"2024-09-21T02:45:21","guid":{"rendered":"https:\/\/www.1ai.net\/?p=20346"},"modified":"2024-09-21T10:45:21","modified_gmt":"2024-09-21T02:45:21","slug":"openai-%e7%9a%84%e6%96%b0-ai%e6%a8%a1%e5%9e%8b-o1-preview-%e5%92%8c-o1-mini-%e5%9c%a8%e8%81%8a%e5%a4%a9%e6%9c%ba%e5%99%a8%e4%ba%ba%e6%8e%92%e5%90%8d%e4%b8%ad%e5%8f%96%e5%be%97%e6%9c%80%e9%ab%98","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/20346.html","title":{"rendered":"OpenAI's new AI models o1-preview and o1-mini achieve top scores in chatbot rankings"},"content":{"rendered":"<p>Tech media outlet The Decoder published a blog post reporting that at the Chatbot Arena, the<a href=\"https:\/\/www.1ai.net\/en\/tag\/openai\" title=\"[View articles tagged with [OpenAI]]\" target=\"_blank\" >OpenAI<\/a> The new AI models o1-preview and o1-mini topped the list.<\/p>\n<p><strong>Introduction to Chatbot Arena<\/strong><\/p>\n<p>Chatbot Arena, a platform for comparing AI models, evaluated the new OpenAI system using more than 6,000 community ratings.<\/p>\n<p><strong>result<\/strong><\/p>\n<p>The results show that o1-preview and o1-mini especially<strong>Excels in math tasks, complex prompts, and programming.<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-20347\" title=\"f57c9178j00sk567g007hd000lb00flp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/09\/f57c9178j00sk567g007hd000lb00flp.jpg\" alt=\"f57c9178j00sk567g007hd000lb00flp\" width=\"767\" height=\"561\" \/><\/p>\n<p>The mathematical model dominance charts provided by Lmsys clearly show that o1-preview and o1-mini scored over 1360 points, which is much higher than the performance of the other models.<\/p>\n<p>O1 AIMS TO SET A COMMON NEW STANDARD FOR ARTIFICIAL INTELLIGENCE REASONING, I.E. \u201cTHINKING\u201d LONGER BEFORE ANSWERING\u3002<\/p>\n<p>However, the O1 model is not superior to GPT-4o in all respects. many tasks do not require complex logical reasoning, and sometimes GPT-4o is more responsive.<\/p>\n<p><strong>Precautions<\/strong><\/p>\n<p>the votes of o1-preview and o1-mini were well below those of mature models such as GPT-4o or Anthropic's Claude 3.5, each with less than 3000 comments, so that a small sample might not accurately represent the actual results and limit the meaning of the results\u3002<\/p>","protected":false},"excerpt":{"rendered":"<p>The Decoder published an article reporting that OpenAI's new artificial intelligence model o1-preview and o1-mini is at the top of the Chatbot Arena. The chat robot arena is a platform for more artificial intelligence models that evaluates the new OpenAI system using over 6000 community ratings. Results show that o1-preview and o1-mini performed well, particularly in mathematical tasks, complex tips and programming. The mathematical model advantage diagram provided by Lmsys clearly shows o1-preview and o1-mini<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[190],"collection":[],"class_list":["post-20346","post","type-post","status-publish","format-standard","hentry","category-news","tag-openai"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/20346","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=20346"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/20346\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=20346"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=20346"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=20346"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=20346"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}