Industry's first: Tencent hybrid-A13B model released and open source, extreme conditions 1 low-end GPU card can be deployed

June 27th.TencentThe Hybrid Grand Model family announced today the arrival of a new member -- the The Hybrid-A13B model was released andOpen SourceThe first MoE open-source software to be released at the 13B level, it is claimed to be "the industry's first 13B level MoE open-source software".mixed inference model”.

Industry's first: Tencent's hybrid-A13B model is released and open-sourced, and can be deployed on one low-end GPU card under extreme conditions.

Hybrid-A13B As a large model based on the Mixing of Experts (MoE) architecture.Total parameters 80 billion, activation parameters 13 billionThe company claims that it "dramatically reduces inference latency and computational overhead while delivering results comparable to top open source models".

This is undoubtedly good news for individual developers and small and medium-sized enterprises (SMEs), said Tencent Mixed Elements.Deployable with only 1 low to mid-range GPU card in extreme conditionsUsers can download and use the model API from Github, HuggingFace and other technical communities. Users can download and use it from Github, HuggingFace, and other technical communities, and the model API is available on the Tencent Cloud website.

Hybrid-A13B model passed MoE ArchitectureThe model, which selectively activates relevant model components for each input, is claimed to be "fast and economical" compared to dense models of the same size, and provides a "scalable and efficient alternative" for individual developers and SMEs.

In the pre-training, the model uses a 20 trillion high-quality web lexical meta-corpus, which raises the upper limit of the model's inference ability; the Scaling Law (i.e., the law of scaling) theoretical system of the MoE architecture is improved, which provides quantifiable engineering guidance for the design of the MoE architecture, and enhances the pre-training effect of the model.

Users can choose their thinking mode on demand, with the fast thinking mode providing concise, efficient outputs suitable for simple tasks where speed and minimal computational overhead are sought; the slow thinking mode involves deeper, more comprehensive reasoning steps. This optimizes the allocation of computational resources, balancing efficiency and accuracy.

Hybrid has also open-sourced two new datasets to fill the gaps in the industry's relevant assessment standardsThe ArtifactsBench is mainly used for code evaluation. Among them, ArtifactsBench is mainly used for code evaluation, constructing a new benchmark with 1825 tasks; C3-Bench is designed for Agent scenario model evaluation, with 1024 test data to find out the shortcomings of model capability.

In terms of concrete results, in terms of mathematical reasoning, for example, by inputting "Who is bigger, 9.11 or 9.9?", the model can accurately complete decimal comparisons and demonstrate step-by-step parsing capabilities.

For the popular intelligent body (Agent) application, the model can call tools to generate complex command responses such as travel tips and data file analysis.

Let's look at the data and results. The model demonstrated "leading results" on math, science, and logical reasoning tasks on multiple public data test sets.

1AI Attached open source address:

https://github.com/Tencent-Hunyuan/Hunyuan-A13B

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

Industry's first: Tencent's hybrid-A13B model is released and open-sourced, and can be deployed on one low-end GPU card under extreme conditions.

Google officially releases Gemma 3n mini-steel model: 2GB of RAM to play AI multimodal locally

Byte Seed's Multiple Robotics-Related Businesses Recruit No. 1 Position, Form Independent Company, Sources Say

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Google officially releases Gemma 3n mini-steel model: 2GB of RAM to play AI multimodal locally

Byte Seed's Multiple Robotics-Related Businesses Recruit No. 1 Position, Form Independent Company, Sources Say

Tencent opens source lip-syncing tool AniPortrait to let photos sing and talk

Tencent's Hunyuan Wenshengtu model is open source: equipped with the first Chinese-English bilingual DiT architecture, free for commercial use

Tencent Launches Hunyuan-Large Large Model: 389B Total Parameters, Industry's Largest Transformer-Based MoE Model Open-Sourced

Tencent open source hybrid 3D 2.1 large model: the first full-link open source industrial-grade 3D generation of large models, PC can also "run"

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow