{"id":45756,"date":"2025-11-07T11:22:47","date_gmt":"2025-11-07T03:22:47","guid":{"rendered":"https:\/\/www.1ai.net\/?p=45756"},"modified":"2025-11-07T11:22:47","modified_gmt":"2025-11-07T03:22:47","slug":"kimi-%e8%bf%84%e4%bb%8a%e8%83%bd%e5%8a%9b%e6%9c%80%e5%bc%ba%e5%bc%80%e6%ba%90%e6%80%9d%e8%80%83%e6%a8%a1%e5%9e%8b%ef%bc%8c%e6%9c%88%e4%b9%8b%e6%9a%97%e9%9d%a2-kimi-k2-thinking-%e5%8f%91%e5%b8%83","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/45756.html","title":{"rendered":"Kimi The most powerful open-source thinking model so far, the dark side of the moon Kimi K2 Thinking"},"content":{"rendered":"<p>November 7th, the dark side of the moon <a href=\"https:\/\/www.1ai.net\/en\/tag\/kimi\" title=\"[View articles tagged with [Kimi]]\" target=\"_blank\" >Kimi<\/a> The most powerful so far<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bc%80%e6%ba%90\" title=\"[View articles tagged with [open source]]\" target=\"_blank\" >Open Source<\/a>Thinking model - Kimi K2 Thinking\u3002<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-45757\" title=\"6e4a1926j00t5c6oq003qd000u00gwp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2025\/11\/6e4a1926j00t5c6oq003qd000u000gwp.jpg\" alt=\"6e4a1926j00t5c6oq003qd000u00gwp\" width=\"1080\" height=\"608\" \/><\/p>\n<p>The model was described as the dark side of the moon based on \"The Model is <a href=\"https:\/\/www.1ai.net\/en\/tag\/agent\" title=\"[View articles tagged with [Agent]]\" target=\"_blank\" >Agent<\/a>\"Thinking Agent, the new generation of conceptual training, has the ability to think and use tools.\" Performance in many benchmark tests such as Humanity's Last Exam, Autonomous Web Browser Capability (BrowseComp), Complex Information Collection Logic (SEAL-0)<strong>REACHING SOTA LEVEL<\/strong>And there has been an overall improvement in the capacity of Agenic search, Agenic programming, writing and integrated reasoning\u3002<\/p>\n<p>Without human intervention, the model can autonomously achieve up to 300 rounds of tools to mobilize and continuously stabilize multiple rounds of thinking, thus helping users to solve more complex problems\u3002<\/p>\n<p>1AI with links to Huging Face, ModelScop deployment as follows:<\/p>\n<ul>\n<li>Hugging Face: https:\/\/huggingface.co\/moonshotai<\/li>\n<li>ModelScope: https:\/\/www.modelscope.cn\/organization\/moonshotai<\/li>\n<\/ul>\n<p>The Human Last Examination is a final closed academic test covering more than 100 areas of specialization. Kimi K2 Thinking achieved the SOTA performance of 44.9% in this benchmark assessment, where the tools - search, Python, web browsing - are allowed\u3002<\/p>\n<p>In the official example provided, Kimi K2 Thinking, after five rounds of search and reasoning, combined with new information from each round, layered in depth and ultimately deduced the answer:<\/p>\n<p>According to the presentation, the Kimi K2 Thinking model also performed well in complex search and browsing scenarios. BrowneComp is a benchmark test published by OpenAI to assess the ability of AI Agent to browse, which was originally designed to measure AI Agent\u00a0<strong>Persistence and creativity in an information overloading environment<\/strong>In other words, it is possible to \u201cscratch the bottom\u201d like human researchers. On average, humanity can achieve only 29.21 TP3T in this challenging task. Kimi K2 Thinking demonstrated a great ability to drill in this benchmark test<strong>A NEW SOTA MODEL WITH RESULTS FROM 60.2%<\/strong>.<\/p>\n<p>Kimi K2 Thinking, driven by long-range planning and autonomous search capabilities, can use up to<strong>Hundreds of rounds of \"thinking, searching, browse, browsing, browsing\" dynamic cycle<\/strong>, presents and refines assumptions on an ongoing basis, validates evidence, undertakes reasoning and constructs logical answers. This ability to actively search and think on a continuous basis enables Kimi K2 Thinking to break vague and open issues into clear, implementable sub-tasks\u3002<\/p>\n<p>In another example provided by official sources, Kimi K2 Thinking, after two rounds of search and reflection, first found the company that made the speedboat based on known information on stock buy-backs, and then found the stock buy-back announcement information on the United States Securities and Exchange Commission (SEC) online, giving the correct answer:<\/p>\n<p>The coding capacity of the Kimi K2 Thinking model has also been enhanced, with further improvements in performance in benchmarking tests such as the multilingual software engineering benchmark SWE-multilingual, the SWE-bench validation set and the Terminal terminal use\u3002<\/p>\n<p>The dark side of the moon indicates that the common base capacity of Kimi K2 Thinking has also been upgraded:<\/p>\n<ul>\n<li>Creative Writing: Kimi K2 Thinking has significantly improved writing skills, which translates crude inspiration into clear, moving and well-intended narratives that combine rhythm and depth. It can easily manage subtle textual differences and vague structures and maintain consistency in style in long speeches. In terms of creative writing, it has a more lively image and a stronger emotional resonance that integrates a precise expression with a rich performance\u3002<\/li>\n<li>Academia and research: Kimi K2 Thinking has significantly improved in terms of analytical depth, accuracy of information and logical structure in academic and professional fields. It analyses complex instructions in an orderly manner and expands thinking in a clear and rigorous manner. This makes it particularly specialized in dealing with academic papers, technical abstracts and long reports that are highly demanding for the integrity of information and the quality of reasoning\u3002<\/li>\n<li>Personal and Emotional: In response to personal or emotional questions, Kimi K2 Thinking's answer is more common and balanced. It is thoughtful and concrete and provides a nuanced perspective and practical follow-up recommendations. It helps users to streamline complex decision-making with clarity and concern, and its tone is both real and relevant and more human\u3002<\/li>\n<\/ul>","protected":false},"excerpt":{"rendered":"<p>On November 7th, the dark side of the moon launched Kimi's most powerful open-source thinking model - Kimi K2 Thinking. The model was described as a new generation of Tinking Agent, trained in the concept of the \"model is Agent\" on the dark side of the moon, where originals mastered the ability to \"think and use tools\". Performance at SOTA level in many benchmark tests such as Humanity's Last Exam, Autonomous Web Browser Capability (BrowseComp), Complex Information Collection Logic (SEAL-0), and achievement of full capacity in Agenic search,Agentic programming, writing and synthesis of reasoning<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[1405,167,1814,219],"collection":[],"class_list":["post-45756","post","type-post","status-publish","format-standard","hentry","category-news","tag-agent","tag-ai","tag-kimi","tag-219"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/45756","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=45756"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/45756\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=45756"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=45756"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=45756"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=45756"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}