Deepseek breaks through AI training burnout curse: $12,000 1/525 cost MT-Bench score comparable to GPT-4o

Deepseek Breaks AI Training Burnout Spell: $12K 1/525 Cost MT-Bench Scores Rival GPT-4o

April 8 News.search in depth（DeepSeekThe company has joined forces with Tsinghua University to launch a new AI alignment technology, SPCT (Self-Principled Critique Tuning), which breaks through the traditional model of relying on massive amounts of training data.Dynamically optimizing output quality through the inference phase.

According to a paper published by the research team on April 4, the technique is based on a recursive architecture of "principle synthesis-response generation-critical filtering-principle optimization".Enable models to dynamically correct outputs as they reason.

The SPCT approach is divided into two phases. First, rejection fine-tuning is used as a cold-start phase to allow the GRM to adapt to different input types and generate principles and critique content in the correct format. The second is the rule-based online reinforcement learning phase, which uses rule-based outcome rewards to encourage the GRM to generate better principles and critique content and improve the scalability of the inference phase.

Deepseek Breaks AI Training Burnout Spell: $12K 1/525 Cost MT-Bench Scores Rival GPT-4o

Tests of the DeepSeek-GRM model with 27 billion parameters show a performance level of a 671B scale model through inference computation with 32 samples per query. This hardware-aware design utilizes a hybrid expert system (MoE) and supports a 128k token context window with a single query latency of only 1.4 seconds.

The report points out that SPCT significantly reduces the deployment threshold of high-performance models, with the DeepSeek-GRM model, for example, costing about $12,000 to train (note: the current exchange rate is about RMB 87,871) and scoring 8.35 on the MT-Bench.

Model	ballpark	MT-Bench	Estimated training costs
DeepSeek-GRM	27B	8.35	$12,000
Nemotron-4	340B	8.41	$1.2 million
GPT-4o	1.8T	8.72	$6.3 million

For comparison, 340B's Nemotron-4, which costs $1.2 million, scored 8.41 points. OpenAI's 1.8T parameter GPT-4o scored 8.72 points, but the cost is as high as $6.3 million (RMB 46.132 million at current exchange rate), while the cost of DeepSeek-GRM is only one part in 525. This technology reduces the need for human annotation by 90% and energy consumption by 73% compared to DPO, providing new possibilities for dynamic scenarios such as real-time robot control.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

Deepseek Breaks AI Training Burnout Spell: $12K 1/525 Cost MT-Bench Scores Rival GPT-4o

The New York Times and other media coalitions declare war on "whoring" content to train AI.

Google AI Mode Adds Multimodal Search with Image Question Support

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

The New York Times and other media coalitions declare war on "whoring" content to train AI.

Google AI Mode Adds Multimodal Search with Image Question Support

DeepSeek: In addition to the official account, any other DeepSeek or related person in charge of the name of the external release of information related to the company is a counterfeit account

"New God on the scene," sources say DeepSeek R2 model expected by May

Response to "DeepSeek-R2 Model Released Early": Subject to Official Information

Microsoft Embraces DeepSeek, Copilot+ PCs Run 7B and 14B Models Locally

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow