{"id":25922,"date":"2024-12-30T14:36:58","date_gmt":"2024-12-30T06:36:58","guid":{"rendered":"https:\/\/www.1ai.net\/?p=25922"},"modified":"2024-12-30T14:36:58","modified_gmt":"2024-12-30T06:36:58","slug":"%e7%81%b5%e5%88%9d%e6%99%ba%e8%83%bd%e5%8f%91%e5%b8%83%e9%a6%96%e4%b8%aa%e5%9f%ba%e4%ba%8e%e5%bc%ba%e5%8c%96%e5%ad%a6%e4%b9%a0%e7%9a%84%e7%ab%af%e5%88%b0%e7%ab%af%e5%85%b7%e8%ba%ab%e6%a8%a1%e5%9e%8b-p","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/25922.html","title":{"rendered":"Psi R0, the First Reinforcement Learning-Based End-to-End Embodiment Model for Dual Dexterous Hands to Perform Complex Operations"},"content":{"rendered":"<p>December 30th.<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e7%81%b5%e5%88%9d%e6%99%ba%e8%83%bd\" title=\"_Other Organiser\" target=\"_blank\" >Spiritual Intelligence<\/a>release<strong>The first end-to-end reinforcement learning (RL) based<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%85%b7%e8%ba%ab%e6%a8%a1%e5%9e%8b\" title=\"[Sees articles with labels]\" target=\"_blank\" >body model<\/a> Psi R0<\/strong>.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-25923\" title=\"8bb2c144j00spanoe00cfd000u000hsp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/12\/8bb2c144j00spanoe00cfd000u000hsp.jpg\" alt=\"8bb2c144j00spanoe00cfd000u000hsp\" width=\"1080\" height=\"640\" \/><\/p>\n<p>1AI has learned that the model supports<strong>Dual dexterity for complex operations<\/strong>The Psi R0 can be used to generate an intelligent body with reasoning ability to accomplish and close the loop of long-range dexterous operation tasks by mixing and training multiple skills in tandem. Moreover, Psi R0 can also generalize across item and scene levels.<\/p>\n<p><a href=\"https:\/\/weibo.com\/tv\/show\/1034:5117237460140056?from=old_pc_videoshow\" target=\"_blank\" rel=\"noopener\"><img id=\"dingyue_15_1735540558089\" \/><\/a><\/p>\n<p>Taking an e-commerce scenario as an example, the packing of goods is a typical long-distance task, requiring tens of thousands of items to be grabbed, scanned, placed, and tied in plastic bags, etc. The Psi R0 is able to complete this series of actions smoothly with a pair of dexterous hands (officially known as<strong>This series of movements can replace a complete workstation at the customer's site.<\/strong>), becoming the first embodied robot trained to perform long-range dexterous manipulation tasks based on reinforcement learning.<\/p>\n<p>Officially, the RL-based Psi R0 model uses massive simulation data to train a two-handed operating intelligence, and connects multiple skills in tandem through a bidirectional training framework, which is the first in the industry to complete long-range tasks in open environments, with strong generalization capabilities and high robustness.<\/p>\n<p>This skill training framework abstracts key information from object spatio-temporal trajectories to construct a generalized objective function, thus solving the problem of difficult reward function design. In the post-training phase, the success rate of long-range tasks is further improved by aligning a small amount of high-quality real-machine data.<\/p>\n<p>In addition, the transfer feasibility function in the bi-directional training framework plays an important role in fine-tuning the skills to improve the success rate and generalization of the tandem, and at the same time gives the model the ability to switch skills autonomously, so that it can quickly adjust its strategy when it encounters operational failures to ensure a high success rate.<\/p>","protected":false},"excerpt":{"rendered":"<p>December 30, 2011 - Lingchu Intelligence released the first end-to-end Reinforcement Learning (RL)-based embodied model, Psi R0. 1AI has learned that the model supports dual dexterous hands to collaborate on complex operations, mixing multiple skills in tandem and generating intelligences with reasoning capabilities to complete and close the loop on long-range dexterous operation tasks. Moreover, Psi R0 can also realize cross-item and cross-scene level generalization. Taking e-commerce scenario as an example, product packing is a typical long-range task, which requires tens of thousands of products to be grasped, scanned, placed, and tied in a plastic bag, etc. Psi R0 is able to use two dexterous hands to smoothly complete this series of actions (the official claim is that this series of actions can replace a complete workstation in the customer's site), and it has become the first intelligent body to complete the long range dexterity operation based on the training of reinforcement learning.<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[5342,5341],"collection":[],"class_list":["post-25922","post","type-post","status-publish","format-standard","hentry","category-news","tag-5342","tag-5341"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/25922","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=25922"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/25922\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=25922"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=25922"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=25922"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=25922"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}