DeepSeek's new model is exposed: MODEL1 code for a new architecture, most likely to be released in February

DeepSeek's new model is exposed: MODEL1 code predicts a new architecture, which is expected to be released in February

On January 21st, the Information broke out early in the month, saying:DeepSeek A new generation of flagships will be launched this year, mid-February, during the new calendar year AI Models — DeepSeek V4, which will have greater ability to write code。

DeepSeek's new model is exposed: MODEL1 code predicts a new architecture, which is expected to be released in February

On 20 January, on the first anniversary of the release of DeepSeek-R1, developers discovered that DeepSeek had updated a series of FlashMLA codes in GitHub, with 28 files across 114 documents mentioning unknown "MODEL1 " large model identifiers。

This identifier is listed or mentioned separately from the known existing model "V32" (i.e. DeepSeek-V3.2). Based on a code context analysis, MODEL 1 is likely to represent a new model that differs from the existing architecture。

THE DEVELOPERS ANALYSED THE DISTINCTION BETWEEN " MODEL1 " AND " V32 " IN KEY TECHNOLOGIES, MAINLY IN TERMS OF THE LAYOUT OF THE KEY VALUE (KV) CACHE, THE THINNESS OF PROCESSING AND THE DECODED SUPPORT FOR THE FP8 DATA FORMAT. THESE DIFFERENCES INDICATE THAT THE NEW ARCHITECTURE HAS BEEN TAILORED TO THE POTENTIAL FOR MEMORY OPTIMIZATION AND COMPUTING EFFICIENCY。

Previously, the DeepSeek research team also published two technical papers on a new training methodology entitled “Optimizing Disability Connections (mHC)” and a biologically inspired “AI Memory Module (Engram)”. This move leads users to speculate that the new model that DeepSeek is developing has the potential to integrate these latest research results. Please look forward。

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

DeepSeek's new model is exposed: MODEL1 code predicts a new architecture, which is expected to be released in February

BEFORE GOOGLE CEO SCHMIDT: EUROPE EITHER INVESTS IN OPEN SOURCE AI OR DEPENDS ON THE CHINESE MODEL

In 2025, our human robot complexes exceeded 140, issuing 330

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

BEFORE GOOGLE CEO SCHMIDT: EUROPE EITHER INVESTS IN OPEN SOURCE AI OR DEPENDS ON THE CHINESE MODEL

In 2025, our human robot complexes exceeded 140, issuing 330

DeepSeek V2 Series of AI Models Wraps Up, Connected Search Goes Live

DeepSeek-R2 AI model to be released on March 17, sources say

As competition in the AI ​​market intensifies, Cohere founder says sales model faces “zero profit” crisis

OpenAI's First Open Source Model? Mysterious Horizon Alpha Emerges, Surpassing Kimi K2 in EQ-Bench Creative Writing Rankings

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow

As competition in the AI market intensifies, Cohere founder says sales model faces “zero profit” crisis