Big models help you write novels, Step Star launches Step-2 "Cost-effective Edition" and "Literary Master Edition".

January 21st.Step StarTwo new models in the Step-2 series of language models were launched yesterday -- Step-2 mini, a smaller participant size and more cost-effective model, and Step Literature Master, a model specifically for the content creation field.

Big models help you write novels, Step Star launches Step-2 "Cost-effective Edition" and "Literary Master Edition".

1AI learned from the official introduction that Step-2 mini and Trillion ParametersLarge Model Compared with Step-2, it retains its modeling performance above 80% with a parameter count around 3%.

Meanwhile, Step-2 mini Faster generation speeds and excellent value for moneyThe average initial word latency of Step-2 mini is only 0.17 seconds with 4000 tokens. In the case of inputting 4000 tokens, the average first-word latency of Step-2 mini is only 0.17 seconds. At present, you can already call the API interface of Step-2 mini on the open platform of Step Star. Input 1 yuan/million tokens; Output 2 yuan/million tokens.

Step-2 mini adopts a new attention mechanism architecture independently developed by Step-Star - MFA (Multi-matrix Factorization Attention) and its variant MFA-Key-Reuse, which saves nearly 94% KV cache overhead and significantly reduces inference cost compared with the commonly used MHA (Multi-Head Attention) architecture. Compared with the commonly used MHA (Multi-Head Attention) architecture, it saves nearly 94% of KV cache overhead, has faster inference speed and significantly reduces inference cost.

According to the official introduction, Step-2 Literary Master Edition is a model developed specifically for the creation of textual content, following Step-2's knowledge base, the ability to control the text of powerful details.Featuring more robust content creation capabilitiesStep-2 Literary Masters Edition seeks to solve the problem of over-alignment of language models in the market, which leads to "false and empty" content and a lack of novelty and true feelings.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

More Controlled Generation: Wisdom Spectrum's AI Video Generation Tool Gets 2.0 Update, Supports Large Motion of Picture Subjects

2025-1-21 20:37:29

Information

Tencent Hybrid 3D Generation Big Model 2.0 Open Source Release, Simultaneously Launched the "Industry's First One-Stop 3D Content AI Creation Platform"

2025-1-21 20:40:47

Search