DeepSeek releases another cost-cutting move: NSA official release, accelerates inference to reduce costs and doesn't sacrifice performance

DeepSeek Releases Another Cost-Reducing Move: NSA Announces Release, Accelerates Inference to Reduce Costs and Doesn't Sacrifice Performance

February 18th.DeepSeek Today's official announcement of the launch of NSA (Native Sparse Attention), which is a hardware-aligned and natively trainable sparse attention mechanism for ultra-fast long context training and inference.

DeepSeek Releases Another Cost-Reducing Move: NSA Announces Release, Accelerates Inference to Reduce Costs and Doesn't Sacrifice Performance

The core components of the NSA include:

Dynamic Hierarchical Sparse Strategy
Coarse-grained token compression
Fine-grained token selection

DeepSeek officials say the mechanism optimizes modern hardware designs.Accelerating inference while reducing pre-training costs without sacrificing performance.. Performance is comparable to or better than the full-attention model on generic benchmarks, long context tasks, and instruction-based reasoning.

Attached paper link:

https://arxiv.org/abs/2502.11089

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

DeepSeek Releases Another Cost-Reducing Move: NSA Announces Release, Accelerates Inference to Reduce Costs and Doesn't Sacrifice Performance

Kimi Slashes Budget, DeepSeek Shocks Dark Side of the Moon, Holds Off on 'Burning' Ads, Sources Say

Shenzhen responds to AI civil servants on duty: only auxiliary government affairs, can not make decisions alone

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Kimi Slashes Budget, DeepSeek Shocks Dark Side of the Moon, Holds Off on 'Burning' Ads, Sources Say

Shenzhen responds to AI civil servants on duty: only auxiliary government affairs, can not make decisions alone

Italian agency asks DeepSeek for data protection information

NVIDIA: DeepSeek-R1 Model Now Live on NIM Microservices Platform

DeepSeek R1 671B model can be deployed and run on a standalone machine, Wave Information launches Metabrain R1 inference server.

Kimi Slashes Budget, DeepSeek Shocks Dark Side of the Moon, Holds Off on 'Burning' Ads, Sources Say

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow