Preview of inference model DeepSeek-R1-Lite goes live, claims to rival OpenAI o1-preview

November 21st.DeepSeek announced that the newly developedinference model The preview version of DeepSeek-R1-Lite is now available.

Preview of inference model DeepSeek-R1-Lite goes live, claims to rival OpenAI o1-preview

Officially, the DeepSeek R1 series of models are trained using reinforcement learning, and the reasoning process involves a great deal of reflection and validation, with chains of thought that can be tens of thousands of words long. The series of models on math, code, and a variety of complex logical reasoning tasks, theAchieved reasoning results comparable to OpenAI o1-preview, and showed users the complete thought process of o1 that was not publicly available..

The DeepSeek-R1-Lite preview model has been judged in the AIME, the highest difficulty level in the American Mathematics Competition (AMC), as well as in the world's top programming competitions (codeforces), among others.Outperforms well-known models such as GPT-4o.

DeepSeek-R1-Lite's reasoning process is long and includes a great deal of reflection and validation. The graph below shows how the model's score on a math competition closely correlates with the length of reflection allowed by the test.

1AI notes that DeepSeek-R1-Lite is still in the iterative development stage, and only supports web use, not API calls for the time being.DeepSeek-R1-Lite also uses a smaller base model, which can't fully unleash the potential of a long chain of thought.

official claimOfficial version of DeepSeek-R1 model to be fully open sourceThe company also provides technical reporting and deployment of API services to the public.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

Microsoft releases AI Shell tool to put AI wings on the command line

2024-11-21 22:24:49

Information

NVIDIA to invest in several Indonesian cities, including building AI schools

2024-11-22 1:24:59

Search