{"id":27649,"date":"2025-01-23T14:33:51","date_gmt":"2025-01-23T06:33:51","guid":{"rendered":"https:\/\/www.1ai.net\/?p=27649"},"modified":"2025-01-23T14:33:51","modified_gmt":"2025-01-23T06:33:51","slug":"%e8%87%aa%e4%b8%bb%e6%93%8d%e4%bd%9c%e7%94%b5%e8%84%91%e7%9a%84%e5%a4%9a%e6%a8%a1%e6%80%81-agent-%e5%8d%87%e7%ba%a7%ef%bc%8c%e6%99%ba%e8%b0%b1-glm-pc-%e5%bc%80%e6%94%be%e4%bd%93%e9%aa%8c","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/27649.html","title":{"rendered":"Multimodal Agent Upgrade for Autonomous Operating Computers, Smart Spectrum GLM-PC Open Experience"},"content":{"rendered":"<p>January 23rd, Beijing<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e6%99%ba%e8%b0%b1\" title=\"[View articles tagged with [Smart Spectrum]]\" target=\"_blank\" >Zhipu<\/a>Wachovia Technologies Ltd. today issued a letter announcing that<strong>Its Smart Spectrum GLM-PC Open Experience<\/strong>The claim that \"autonomous operation of computers<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%a4%9a%e6%a8%a1%e6%80%81\" title=\"[View articles tagged with [multimodal]]\" target=\"_blank\" >Multimodality<\/a> <a href=\"https:\/\/www.1ai.net\/en\/tag\/agent\" title=\"[View articles tagged with [Agent]]\" target=\"_blank\" >Agent<\/a> Re-Escalation\".<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-27650\" title=\"cb8c9999j00sqj3ii005bd000u000g6p\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2025\/01\/cb8c9999j00sqj3ii005bd000u000g6p.jpg\" alt=\"cb8c9999j00sqj3ii005bd000u000g6p\" width=\"1080\" height=\"582\" \/><\/p>\n<p>It is reported that GLM-PC is based on the Chi-Spectrum Multi-Modal Large Model CogAgent.<strong>The world's first publicly accessible, turnkey computerized intelligence (agent).<\/strong>GLM-PC v1.0 was released in open beta on November 29, 2024. GLM-PC v1.0 was released on November 29, 2024, and is currently in open beta, with a new \"Deep Thinking\" mode, additional features dedicated to logical reasoning and code generation, and support for Windows.<\/p>\n<p>1AI learned from the official information of Smart Spectrum that GLM-PC has the following capabilities:<\/p>\n<h3 data-vmark=\"b050\">Code Generation and Logic Execution<\/h3>\n<blockquote>\n<ul class=\"small-size list-paddingleft-2\">\n<li>\n<p data-vmark=\"1bcf\">Planning: Supports comprehensive analysis of goals as well as available resources, generates execution roadmaps, and automatically breaks down large tasks into manageable sub-tasks to build out clear execution paths.<\/p>\n<\/li>\n<li>\n<p data-vmark=\"31d6\">Loop Execution: After the planning phase, the support launches the code generation module to execute a logical loop that progressively advances the task to completion. This looping mechanism ensures precise execution and a high degree of automation of tasks, thus realizing a complete closed loop from input to output without human intervention.<\/p>\n<\/li>\n<li>\n<p data-vmark=\"14eb\">Long-thinking ability: support real-time adjustment, reflection and correction and self-correction to continuously optimize the solution. Specific performance: when the process is interrupted by external factors, the logical path can be reconstructed; when encountering a lack of information, it can take the initiative to interact with the user and improve the task execution plan by asking questions.<\/p>\n<\/li>\n<\/ul>\n<\/blockquote>\n<h3 data-vmark=\"62fd\">Graphics and GUI Cognition<\/h3>\n<blockquote>\n<ul class=\"small-size list-paddingleft-2\">\n<li>\n<p data-vmark=\"55a4\">GUI image understanding: accurately recognize graphical interface elements (e.g., buttons, icons, layouts, etc.) and understand their functions and interaction logic<\/p>\n<\/li>\n<li>\n<p data-vmark=\"3338\">User behavior cognition: Combining the learning of the user interface and the understanding of historical operation information, it provides users with intelligent recommended operations for the current interface<\/p>\n<\/li>\n<li>\n<p data-vmark=\"fdba\">Image Semantic Parsing: In-depth semantic analysis of complex images to extract key information such as text, identifiers, and trends and metrics in data visualization charts and graphs.<\/p>\n<\/li>\n<li>\n<p data-vmark=\"085b\">Multi-modal information fusion: Fusing image and text information to form a comprehensive perceptual result. For example, recognizing both button positions and text labels in the user interface helps the \"left brain\" to make precise operation plans.<\/p>\n<\/li>\n<\/ul>\n<\/blockquote>","protected":false},"excerpt":{"rendered":"<p>January 23 news, Beijing Zhi Spectrum Huazhang Technology Co., Ltd. today announced that its Zhi Spectrum GLM-PC open experience, claiming that \"autonomous operation of the computer's multimodal Agent upgraded\". According to the introduction, GLM-PC is the world's first public-oriented, back to the car that is used computer intelligence (agent), based on the Chi Spectrum multimodal large model CogAgent. GLM-PC v1.0 was released on November 29, 2024 and opened for internal testing, and now it has newly introduced the \"Deep Thinking\" mode, added functions dedicated to logical reasoning and code generation, and provided support for Windows systems. 1A<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[1405,592,2680],"collection":[],"class_list":["post-27649","post","type-post","status-publish","format-standard","hentry","category-news","tag-agent","tag-592","tag-2680"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/27649","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=27649"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/27649\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=27649"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=27649"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=27649"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=27649"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}