The first original-to-end mega-speech model of MiMo-Audio, with a natural, interactive fit to humanize

The first primary-to-end mega-speech model of Mimo-Audio, with a natural, interactive and humanist dialogue

The news of September 19thMilletAnnounced todayOpen SourceFirst parent to endVoice big model It's not like it's a bad ideaFOR THE FIRST TIME, ICL-BASED SMALL SAMPLES WERE GENERALIZED IN THE VOICE FIELD.

The first primary-to-end mega-speech model of Mimo-Audio, with a natural, interactive and humanist dialogue

According to Mi, for the first time five years ago GPT-3 showed the ability to acquire In-Context Learning (ICL, context learning) through self-regressive language models + large-scale data-unspectled training, while in the area of voiceExisting large models still rely heavily on large-scale labelling data,It's hard to adapt to new assignments to human intelligence.

The Xiaomi-MiMo-Audio model, which breaks this bottleneck, is based on innovative pre-training structures and hundreds of hours of training data, and increases the ability to cross-modular alignment in terms of IQ, intelligence, performance and securityHumanization of nature, emotional expression and interaction.

The specific innovation points of the model are as follows:

For the first time, Scaling to 100 million hours of pre-training in sound undamaged compression was shown to be “emerging” across the mission, in the form of Few-Shot Learning。

The first clear target and definition of voice generation pre-training and an open source set of full voice pre-training programmes, including tokenizer, a completely new model structure, training methods and assessment systems, are available。

At present, Mi has provided pre-training, command fine-tuning models for this model at the opening of the Huggingface platform, while at the Github platform, the Tokenizer model with parameters of 1.2B, based on Transformer architecture, supports audio reconstruction and audio-transtexting tasks。

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

The first primary-to-end mega-speech model of Mimo-Audio, with a natural, interactive and humanist dialogue

CHINESE INTERNET BASE 3.0 PUBLICATION: DATA VOLUME 120GB, MASTER MODELS TRAINING AND AI DEVELOPMENT

5 BILLION DOLLARS TO BUY SHARES OF 4%, NVIDIA AND INTEL WILL JOINTLY DEVELOP AI INFRASTRUCTURE AND PERSONAL COMPUTING PRODUCTS

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

CHINESE INTERNET BASE 3.0 PUBLICATION: DATA VOLUME 120GB, MASTER MODELS TRAINING AND AI DEVELOPMENT

5 BILLION DOLLARS TO BUY SHARES OF 4%, NVIDIA AND INTEL WILL JOINTLY DEVELOP AI INFRASTRUCTURE AND PERSONAL COMPUTING PRODUCTS

Xiaomi open-sources "Xiaomi MiMo" large model: built for inference, surpasses OpenAI o1-mini with 7B parameters

Xiaomi's multimodal large model MiMo-VL open source, officially said to be leading in many aspects Qwen2.5-VL-7B

Xiaomi's Sound Understanding Large Model MiDashengLM-7B Released and Open-Sourced in Full Volume, 22 Public Review Sets Refresh Best Scores

China's first full-size universal humanoid robot open source public version "Qinglong" released: 185cm tall / 80kg, computing power supports 400TOPS

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow