{"id":16478,"date":"2024-07-26T09:31:46","date_gmt":"2024-07-26T01:31:46","guid":{"rendered":"https:\/\/www.1ai.net\/?p=16478"},"modified":"2024-07-26T09:31:46","modified_gmt":"2024-07-26T01:31:46","slug":"%e4%b8%8a%e6%b5%b7%e7%a7%91%e6%8a%80%e5%a4%a7%e5%ad%a6%e6%8e%a8%e6%96%b0ai%e6%a8%a1%e5%9e%8bclay%ef%bc%9a%e4%bb%8e%e6%96%87%e5%ad%97%e5%92%8c%e5%9b%be%e5%83%8f%e7%94%9f%e6%88%90%e8%af%a6%e7%bb%863d","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/16478.html","title":{"rendered":"ShanghaiTech University launches new AI model CLAY: Generate detailed 3D objects from text and images"},"content":{"rendered":"<p data-pm-slice=\"0 0 []\"><a href=\"https:\/\/www.1ai.net\/en\/tag\/%e4%b8%8a%e6%b5%b7%e7%a7%91%e6%8a%80%e5%a4%a7%e5%ad%a6\" title=\"[Sees articles with tags from Shanghai University of Technology]\" target=\"_blank\" >ShanghaiTech University<\/a>Scientists have recently developed a<a href=\"https:\/\/www.1ai.net\/en\/tag\/clay\" title=\"_OTHER ORGANISER\" target=\"_blank\" >CLAY<\/a>of<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e4%ba%ba%e5%b7%a5%e6%99%ba%e8%83%bd%e6%a8%a1%e5%9e%8b\" title=\"_Other Organiser\" target=\"_blank\" >Artificial Intelligence Model<\/a>CLAY is a machine learning model that can generate detailed 3D objects from text descriptions or 2D images. Compared with previous technologies, CLAY has achieved significant breakthroughs in the quality and diversity of generated 3D objects.<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-16479\" title=\"get-820\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/07\/get-820.jpg\" alt=\"get-820\" width=\"1643\" height=\"639\" \/><\/div>\n<p data-track=\"61\">The core of the CLAY model consists of a multi-resolution variational autoencoder (VAE) and a diffusion transformer (DiT). The VAE is responsible for encoding 3D geometries at different levels of detail into the latent space, while the DiT is responsible for generating these geometries. Unlike many other systems, CLAY is able to process 3D content directly without converting it to 2D images first.<\/p>\n<p data-track=\"62\">CLAY&#039;s training data exceeds 500,000 3D models, covering a variety of objects from simple everyday objects to complex fantasy creatures. In addition, CLAY has the ability to be controlled by additional inputs, and users can achieve precise control over the generated results by specifying rough shapes (such as voxel structures, point clouds) or bounding boxes. This flexibility allows CLAY to generate entire urban scenes and even reconstruct detailed 3D models from hand-drawn sketches.<\/p>\n<p data-track=\"63\"><strong>When compared with other systems (such as Shap-E, DreamFusion, and Wonder3D), CLAY shows clear advantages.<\/strong>Whether it is text to 3D or image to 3D, CLAY can generate more consistent geometry, smoother surfaces and finer details.<strong>CLAY also generates high-quality 3D assets incredibly quickly, in just about 45 seconds, whereas some comparable systems may take hours to optimize.<\/strong><\/p>\n<p data-track=\"64\">The potential applications of CLAY are wide-ranging, including areas such as game development, filmmaking, and 3D printing. Still, the researchers are aware of the potential risks of AI-generated virtual content, so they plan to add more safety measures to ensure responsible use.<\/p>\n<p data-track=\"65\">In the future, the researchers also plan to further expand the training data, improve the model quality, and integrate geometry generation and material synthesis into a single model to achieve more comprehensive capabilities. A version of CLAY can be accessed through the 3D-Gen service Rodin.<\/p>\n<p data-track=\"66\">Product portal: https:\/\/hyperhuman.deemos.com\/rodin<\/p>\n<p>&nbsp;<\/p>","protected":false},"excerpt":{"rendered":"<p>Scientists at the Shanghai University of Science and Technology have recently developed an artificial intelligence model called CLAY, which can generate detailed 3D objects from textual descriptions or 2D images. Compared to previous techniques, CLAY represents a significant breakthrough in the quality and diversity of 3D objects generated. The core of the CLAY model consists of a multi-resolution Variable Auto-Encoder (VAE) and a Diffusion Transformer (DiT).The VAE is responsible for encoding 3D geometries at different levels of detail into the latent space while the DiT is responsible for generating these geometries. Unlike many other systems, CLAY is able to process 3D content directly without first converting it to 2D images. CLAY's training data exceeds 500,000 3D models, covering everything from simple everyday objects to complex phantom<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[167,3710,3709,599],"collection":[],"class_list":["post-16478","post","type-post","status-publish","format-standard","hentry","category-news","tag-ai","tag-clay","tag-3709","tag-599"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/16478","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=16478"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/16478\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=16478"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=16478"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=16478"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=16478"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}