Smart Spectrum Open Source Next Generation Universal Visual Language Model

Yesterday,ZhipuOfficially launched andOpen SourceA new generation of general-purposevisual language model GLM-4.1V-Thinking, which is claimed to be "the key leap from perception to cognition achieved by the GLM family of visual models.

Smart Spectrum Open Source Next Generation Universal Visual Language Model

Specifically, GLM-4.1V-Thinking is a general-purpose inference-based grand model that supports multimodal inputs such as images, videos, documents, etc., and is designed for complex cognitive tasks. Based on the GLM-4V architecture, it introduces "CoT Reasoning" and adopts "Reinforcement Learning Strategies for Lesson Sampling (RLCS)", which systematically improves the model's cross-modal causal reasoning ability and stability.

The lightweight version of the GLM-4.1V-9B-Thinking model is controlled at the 10B level, and it has achieved the best results of 10B in 23 out of 28 authoritative evaluations by MMStar, MMMU-Pro, and so on, of which 18 of them are equal to or better than the Qwen-2.5-VL, which has a parameter count as high as 72B.

Officially, the GLM-4.1V-9B-Thinking demonstrates a high degree of versatility and robustness by excelling in five major areas: graphic comprehension, mathematical and scientific reasoning, video comprehension, GUI and web-intelligence body tasks, and visual anchoring and entity localization.

Currently, GLM-4.1V-9B-Thinking has been open-sourced in GitHub, HuggingFace, and Magic Hitch communities, and published technical papers, API interface documents, and this time, two models, GLM-4.1V-9B-Base base model and GLM-4.1V-9B-Thinking are online.

Link to paper:https://arxiv.org/abs/2507.01006

GitHub:https://github.com/THUDM/GLM-4.1V-Thinking

HuggingFace:https://huggingface.co/collections/THUDM/glm-41v-thinking-6862bbfc44593a8601c2578d

Magic Match Community:https://modelscope.cn/collections/GLM-41V-35d24b6def9f49

API interface documentation:https://www.bigmodel.cn/dev/api/visual-reasoning-model/glm-4.1v-thinking

Smart Spectrum has also launched a new eco-platform"Agent Application Space.

It is reported that "Agent Application Space" is an AI Agent capability aggregation platform for enterprise customers and developers, which aggregates rich Agent applications and Model Plug-ins (MCPs), provides out-of-the-box, flexible and orchestrated component services and Agents applications, and helps enterprises eliminate the need to build their own large model teams.

In addition, on July 2, at the Smart Spectrum Open Platform Industry Ecological Conference, it was also announced that Pudong Venture Capital Group and Zhangjiang Group had made a total of 1 billion yuan of strategic investment in Smart Spectrum, and the first delivery was completed recently.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

Multiple colleges exposed to hidden commands in papers to induce AI to score high

2025-7-3 11:49:32

Information

Global AI Talent List Revealed for the First Time: Chinese Hold Up Half of the Sky, No One on DeepSeek's List

2025-7-4 11:34:01

Search