Kimi-Researcher deep research model opens for internal testing: generates easily traceable 10,000-word reports

June 21, 2011 - 1AI has learned from theDark Side of the Moon Kimi Publicis has learned that Kimi's first AgentAgentKimi-Researcher Opens on the 20thlocalGrayscale test.

Kimi-Researcher deep research model opens for internal testing: generates easily traceable 10,000-word reports

Kimi-Researcher is a new generation of agent model trained based on end-to-end agentic RL technology, and is also an agent product specially designed for deep research tasks. Later, Dark Side of the Moon will gradually open source Kimi-Researcher's basic pre-trained models and reinforcement learning models.

For each problem, Kimi-Researcher autonomously plans the task execution process and ultimately delivers complete results:

  • Clarification: Proactive rhetorical questioning in understanding the problem and constructing a clearer problem space;
  • reflect in depth: An average of 23 steps of reasoning per task to autonomously sort out and address needs;
  • Proactive searchFor each task, an average of 74 keywords were planned, 206 URLs were found, and the top 3.2% contents with the highest information quality were judged and filtered by the model to eliminate redundant and low-quality information;
  • Invoke the tools and deliver the results: Autonomously invoke the browser, code and other tools to process raw data, automatically generate analysis conclusions, and complete the delivery end-to-end.

To ensure the quality of the output and the coverage of the information, Kimi-Researcher uses theasynchronous executionway, spending more time progressively reasoning, retrieving and writing content.

Kimi-Researcher deep research model opens for internal testing: generates easily traceable 10,000-word reports

Users will eventually receive 2 deliverables.

An informative and traceable in-depth research report

  • The average length of a report is betweenMore than 10,000 words;
  • An average of about 26 high-quality, traceable sources are cited;
  • All citations are embedded in the body of the text, clickable to jump and highlighting the original text for easy verification and traceability.

Kimi-Researcher deep research model opens for internal testing: generates easily traceable 10,000-word reports

An interactive, shareable and dynamic visualization report

  • Structured layout and mind maps make trends, exceptions and other important information visible at a glance;
  • The overall structure and core conclusions can be quickly grasped without having to read the entire text;
  • Support online link generation and sharing for easy display.

It's official, Kimi-Researcher has been recognized as one of the most successful players in the Humanity's Last Exam (HLE), a difficult benchmark designed specifically for AI, and has been recognized as one of the best in the world.Zero structure, no flow designThe scores for the settings are as follows:

  • Pass@1 Accuracy: 26.9%
  • Pass@4 Accuracy: 40.17%

This performance exceeds Claude 4 Opus (10.71 TP3T), Gemini 2.5 Pro (21.61 TP3T), is slightly higher than OpenAI Deep Research (26.61 TP3T), and is tied with Gemini-Pro's Deep Research Agent (26.91 TP3T), which This is one of the highest levels known to date. In the xbench benchmark released by Sequoia China, an AI capability evaluation system aligned with real task scenarios, Kimi-Researcher achieved an average pass rate of 69% in the DeepSearch task, ahead of other models in the list.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

Anthropic Warns: Most AI Models, Including Claude, Will Commit 'Blackmail' Behavior

2025-6-21 13:08:38

Information

Google exposed for training AI models with tons of YouTube videos, creators knew nothing about it

2025-6-21 13:11:43

Search