Tsinghua team open-sources large model inference engine "Chitu", realizing DeepSeek inference cost halved, performance doubled

Tsinghua team open-sources large model inference engine "Chitu", realizing DeepSeek inference to halve cost and double performance

March 14th.Tsinghua UniversityProfessor Zhai Jidong's team at the Institute of High Performance Computing at the University, and Tsinghua-based startup Qingcheng Jizhi, jointly announced today that theLarge Model Inference Engine“Chiba Chitu" is now open source.

Tsinghua team open-sources large model inference engine "Chitu", realizing DeepSeek inference to halve cost and double performance

According to the introduction, this engine realizes for the first time to run FP8 accuracy models natively on non-NVIDIA Hopper architecture GPUs and various types of domestic chips, halving the cost and doubling the performance of DeepSeek inference. Positioned as a "production-grade large model inference engine", it offers the following features:

Diversified computing power adaptation: not only supports NVIDIA's latest flagship to the old series of products, but also provides optimization support for domestic chips.
Full Scenario Scalability: From CPU-only deployments, single GPU deployments to large-scale cluster deployments, Red Rabbit Engine provides scalable solutions.
Long-term stable operation: can be applied to the actual production environment, stable enough to carry concurrent business traffic.

Officially, the current open-source Red Rabbit Engine, when deployed with the DeepSeek-R1-671B full-blooded version, achieved a 3.15 times increase in inference speed while reducing GPU usage by 50% in comparison to some foreign open-source frameworks in the tests on the A800 cluster.

1AI with open source address: https://github.com/thu-pacman/chitu

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

Tsinghua team open-sources large model inference engine "Chitu", realizing DeepSeek inference to halve cost and double performance

Net Information Office and other four departments issued "artificial intelligence generated synthetic content labeling measures", effective September

Canadian Startup Launches Command A Lightweight AI Model, Claims to Require Only Two NVIDIA A100 / H100 GPUs for Deployment

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Net Information Office and other four departments issued "artificial intelligence generated synthetic content labeling measures", effective September

Canadian Startup Launches Command A Lightweight AI Model, Claims to Require Only Two NVIDIA A100 / H100 GPUs for Deployment

7B exceeds 10 billion level, Peking University open sources aiXcoder-7B strongest code model, the best choice for enterprise deployment

Baidu Smart Cloud announces support for Llama3 full series training and inference

Mianbi Intelligent Releases Eurux-8x22B Open Source Large Model: Code Performance Exceeds Llama3-70B

Tencent Hunyuanwen raw image open source large model acceleration library released: image generation time shortened by 75%

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow