{"id":41141,"date":"2025-08-12T10:29:13","date_gmt":"2025-08-12T02:29:13","guid":{"rendered":"https:\/\/www.1ai.net\/?p=41141"},"modified":"2025-08-12T10:29:13","modified_gmt":"2025-08-12T02:29:13","slug":"%e6%99%ba%e8%b0%b1%e8%a7%86%e8%a7%89%e6%8e%a8%e7%90%86%e6%a8%a1%e5%9e%8b-glm-4-5v-%e4%b8%8a%e7%ba%bf%e5%b9%b6%e5%bc%80%e6%ba%90%ef%bc%8c%e5%8f%b7%e7%a7%b0%e5%85%a8%e7%90%83-100b-%e7%ba%a7","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/41141.html","title":{"rendered":"Smart Spectrum Visual Reasoning Model GLM-4.5V is online and open source, claiming to be \"the world's best at 100B\"."},"content":{"rendered":"<p>August 12 News.<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e6%99%ba%e8%b0%b1\" title=\"[View articles tagged with [Smart Spectrum]]\" target=\"_blank\" >Zhipu<\/a> AI today launched the world's most effective 100B-class<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bc%80%e6%ba%90\" title=\"[View articles tagged with [open source]]\" target=\"_blank\" >Open Source<\/a><a href=\"https:\/\/www.1ai.net\/en\/tag\/%e8%a7%86%e8%a7%89%e6%8e%a8%e7%90%86%e6%a8%a1%e5%9e%8b\" title=\"[Sees articles with visual reasoning labels]\" target=\"_blank\" >visual inference model<\/a> GLM-4.5V (total parameters 106B, activation parameters 12B), and synchronized with Hugging Face open source in the Magic Hitch community. In addition, the API call price is as low as $2\/M tokens for input and $6\/M tokens for output.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-41142\" title=\"21aa5882j00t0v07t006dd000u000a0p\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2025\/08\/21aa5882j00t0v07t006dd000u000a0p.jpg\" alt=\"21aa5882j00t0v07t006dd000u000a0p\" width=\"1080\" height=\"360\" \/><\/p>\n<p>1AI learned from the official introduction that GLM-4.5V is based on GLM-4.5-Air, the flagship text base model of the new generation of Smart Spectrum, and continues the technical route of GLM-4.1V-Thinking, which has reached the performance of the same level of open source model SOTA in the 41 public visual multimodal lists, and covers common tasks such as image, video, document understanding, and GUI Agent. It covers common tasks such as image, video, document understanding, and GUI agent.<\/p>\n<p>Beyond the multimodal list, it places more emphasis on the model's performance and usability in real-world scenarios.GLM-4.5V By efficiently mixing the training<strong>Ability to cover a wide range of visual content<\/strong>, which enables full-scene visual reasoning, including:<\/p>\n<ul>\n<li>Image Reasoning (Scene Understanding, Complex Multi-graph Analysis, Location Recognition)<\/li>\n<li>Video understanding (long video split analysis, event recognition)<\/li>\n<li>GUI tasks (screen reading, icon recognition, desktop operation assistance)<\/li>\n<li>Complex charts and long documents parsing (research paper analysis, information extraction)<\/li>\n<li>Grounding capabilities (pinpointing visual elements)<\/li>\n<\/ul>\n<p>at the same time,<strong>The model has a new \"Thinking Mode\" switch, which allows users to flexibly choose between fast response and deep reasoning.<\/strong>The GLM-4.5V is a desktop assistant application that balances efficiency and effectiveness. In order to help developers intuitively experience the modeling capabilities of GLM-4.5V and create their own multimodal applications, Smart Spectrum AI has synchronously open-sourced a desktop assistant application.<\/p>\n<p><strong>The desktop application can take screenshots and record screenshots in real time to get screen information<\/strong>GLM-4.5V can handle a variety of visual reasoning tasks, daily processing such as code assistance, video content analysis, game answers, document interpretation and other types of visual tasks, and become a partner that can look at the screen and work and play with you. We also hope that through the model open source and API services, we can empower more developers with ideas to utilize their creativity and imagination based on the multimodal base model, and turn the scenes in the past sci-fi movies into reality.<\/p>","protected":false},"excerpt":{"rendered":"<p>August 12 news, Wisdom Spectrum AI today launched the world's 100B level of the best open source visual inference model GLM-4.5V (total parameters 106B, activation parameters 12B), and synchronized in the magic ride community and Hugging Face open source. In addition, the API call price is as low as $2\/M tokens for input and $6\/M tokens for output. 1AI learned from the official introduction that GLM-4.5V is based on the new generation of flagship text base model GLM-4.5-Air of Wisdom Spectrum, which is a continuation of the GLM-4.1V-Thinking technology route, and has achieved the same SOTA performance of open source models with the same level of performance on the list of 41 publicly available visual multimodal lists. level open source model SOTA performance.<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[219,2680,5299],"collection":[],"class_list":["post-41141","post","type-post","status-publish","format-standard","hentry","category-news","tag-219","tag-2680","tag-5299"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/41141","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=41141"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/41141\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=41141"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=41141"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=41141"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=41141"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}