DeepSeek-VL2 AI visual model open source: support dynamic resolution, processing scientific research charts, parsing various terrier diagrams, etc.

DeepSeek-VL2 AI visual model open source: support for dynamic resolution, processing scientific research charts, parsing various terrain maps, etc.

DeepSeek's official public website published a blog post yesterday (December 13) announcing thatOpen Source The DeepSeek-VL2 model, which achieved highly favorable results in all evaluation metrics, is officially known as theVisual ModelFormally entered the era of Mixture of Experts (MoE).

DeepSeek-VL2 AI visual model open source: support for dynamic resolution, processing scientific research charts, parsing various terrain maps, etc.

Citing the official press release, 1AI attached the DeepSeek-VL2 highlights as follows:

Data: double the quality of training data than the first generation of DeepSeek-VL, introducing new capabilities such as terse map understanding, visual localization, visual story generation, etc.
Architecture: the visual part uses a cut-over strategy to support dynamic resolution images, and the linguistic part adopts the MoE architecture for low-cost and high performance.
Training: Inheriting the three-phase training process of DeepSeek-VL, while adapting to the difficulty of variable number of image slices through load balancing, using different streaming parallelism strategies for image and text data, and introducing expert parallelism for the MoE language model to realize efficient training.

The DeepSeek-VL2 model supports dynamic resolution by using only one SigLIP-SO400M as an image encoder, and by slicing the image into multiple sub-images and a global thumbnail to achieve dynamic resolution image support. This strategy allows DeepSeek-VL2 to support resolutions up to 1152×1152 and extreme aspect ratios of 1:9 or 9:1 for more application scenarios.

The DeepSeek-VL2 model also benefits from the learning of more scientific document data, allowing it to easily understand various scientific charts and generate Python code based on the images through Plot2Code.

Both the model and the paper have been published:

Model Download:https://huggingface.co/deepseek-ai

GitHub homepage:https://github.com/ deepseek-ai/DeepSeek-VL2

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

DeepSeek-VL2 AI visual model open source: support for dynamic resolution, processing scientific research charts, parsing various terrain maps, etc.

Google starts rolling out a new version of its Gemini AI voice assistant to its smart speakers, first batch covers Nest Audio / 2nd generation Mini

New Google NotebookLM feature goes live: users can interact with AI anchors

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Google starts rolling out a new version of its Gemini AI voice assistant to its smart speakers, first batch covers Nest Audio / 2nd generation Mini

New Google NotebookLM feature goes live: users can interact with AI anchors

Open source AI platform Lightning AI releases AI compiler "Thunder" to accelerate model training

Zhou Hongyi calls himself an "open source believer": Announces that the 360 Brain 7B model will be open source, supporting 500,000-word long text input

Ali Tongyi Qianwen open-sources Qwen2-Audio 7B voice interaction model: free interaction without text input

Melodisco, an open-source AI music player, contains 300,000 AI music tracks

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow