{"id":8161,"date":"2024-04-16T10:28:19","date_gmt":"2024-04-16T02:28:19","guid":{"rendered":"https:\/\/www.1ai.net\/?p=8161"},"modified":"2024-04-16T10:28:19","modified_gmt":"2024-04-16T02:28:19","slug":"gpt-4-turbo-%e5%87%bb%e8%b4%a5-claude-3%ef%bc%8c%e9%87%8d%e6%96%b0%e5%a4%ba%e5%9b%9e-%e6%9c%80%e4%bd%b3ai%e6%a8%a1%e5%9e%8b-%e7%a7%b0%e5%8f%b7","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/8161.html","title":{"rendered":"GPT-4 Turbo defeats Claude 3 and regains the title of &quot;Best AI Model&quot;"},"content":{"rendered":"<p><a href=\"https:\/\/www.1ai.net\/en\/tag\/openai\" title=\"[View articles tagged with [OpenAI]]\" target=\"_blank\" >OpenAI<\/a>\u00a0<span class=\"spamTxt\">up to date<\/span>Updated version released <a href=\"https:\/\/www.1ai.net\/en\/tag\/gpt-4\" title=\"[SEE ARTICLES WITH [GPT-4] LABELS]\" target=\"_blank\" >GPT-4<\/a>Turbo became available to developers and paid ChatGPT subscribers last week. When the model was launched, OpenAI said the new GPT-4Turbo offered several improvements over its predecessor, and users have found that to be true.<\/p>\n<p>Starting last Thursday, an updated version of GPT-4Turbo, gpt-4-turbo-2024-04-09, reclaimed the top spot on the Large Model Systems Organization (LMSYS) Chatbot Arena, a crowdsourced open platform where users can evaluate large language models (LLMs).<\/p>\n<p class=\"article-content__img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-8162\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/04\/6384885955371181794325483.png\" alt=\"\" width=\"539\" height=\"818\" \/><\/p>\n<p>In Chatbot Arena, users can communicate with two <a href=\"https:\/\/www.1ai.net\/en\/tag\/llms\" title=\"[View articles tagged with [LLMs]]\" target=\"_blank\" >LLMs<\/a> Chat side by side and compare their responses without knowing the identity of each model. These results were used to rank 82 LLMs in Chatbot Arena, including all<span class=\"spamTxt\">Most Popular<\/span>LLMs such as Gemini Pro, Claude3 series LLMs and Mistral-Large-2402.<\/p>\n<p class=\"article-content__img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-8163\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/04\/6384885957904839659397749.jpg\" alt=\"\" width=\"921\" height=\"658\" \/><\/p>\n<p>As of April 13<span class=\"spamTxt\">up to date<\/span>\u00a0Chatbot Arena update, the updated version of GPT-4Turbo is in the lead in the overall, encoding, and English categories. This means that less than a month later, Anthropic&#039;s Claude3Opus was pushed to second place in the overall category, followed by GPT-4-1106-preview, an older version of GPT-4Turbo.<\/p>\n<p>These results may be attributed to gpt-4-turbo-2024-04-09&#039;s improvements in coding, math, logical reasoning, and writing skills, which outperformed a series of benchmarks used to test the proficiency of AI models. Want to compare gpt-4-turbo-2024-04-09&#039;s performance with other LLMs for yourself? You can visit the Chatbot Arena website, click the &quot;Arena (side-by-side)&quot; option, and select the model you want to compare.<\/p>\n<p>It\u2019s worth noting that since you know the identity of the model in side mode, you won\u2019t be able to vote. If you want to vote and have it count towards the leaderboard, you can use the \u201cArena (battle)\u201d option to compare random models. If you want to skip the test and go directly to gpt-4-turbo-2024-04-09 in ChatGPT, you need to be a ChatGPT Plus subscriber, which costs $20 per month.<\/p>","protected":false},"excerpt":{"rendered":"<p>OpenAI's latest update, GPT-4Turbo, was made available to developers and paid ChatGPT subscribers last week. When launching the model, OpenAI said the new GPT-4Turbo made several improvements from its predecessor, and subscribers have found this to be true. As of last Thursday, the updated version of GPT-4Turbo, gpt-4-turbo-2024-04-09, reclaimed the top spot in the Large Model Systems Organization (LMSYS) Chatbot Arena, a crowdsourced, open-access platform where subscribers can evaluate large language Models (LLM)<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[510,1646,190],"collection":[],"class_list":["post-8161","post","type-post","status-publish","format-standard","hentry","category-news","tag-gpt-4","tag-llms","tag-openai"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/8161","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=8161"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/8161\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=8161"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=8161"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=8161"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=8161"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}