{"id":10656,"date":"2024-05-19T11:20:35","date_gmt":"2024-05-19T03:20:35","guid":{"rendered":"https:\/\/www.1ai.net\/?p=10656"},"modified":"2024-05-19T11:20:35","modified_gmt":"2024-05-19T03:20:35","slug":"%e8%85%be%e8%ae%af%e6%b7%b7%e5%85%83%e6%96%87%e7%94%9f%e5%9b%be%e5%a4%a7%e6%a8%a1%e5%9e%8b%e5%bc%80%e6%ba%90%ef%bc%9a%e9%80%82%e5%90%88%e5%9b%bd%e4%ba%ba%e7%9a%84%e6%96%87%e7%94%9f%e5%9b%be%e6%a8%a1","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/10656.html","title":{"rendered":"Tencent Hunyuan Wenshengtu Large Model Open Source: Wenshengtu Model Suitable for Chinese People"},"content":{"rendered":"<p data-pm-slice=\"0 0 []\">First OpenAI released GPT-4o, then Google released Imagen3, and now, the<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e8%85%be%e8%ae%af\" title=\"[View articles tagged with [Tencent]]\" target=\"_blank\" >Tencent<\/a>Delivered his share as well:<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e6%b7%b7%e5%85%83\" title=\"_Other Organiser\" target=\"_blank\" >origin of the universe<\/a>Vincent Figure<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%a4%a7%e6%a8%a1%e5%9e%8b\" title=\"[View articles tagged with [large models]]\" target=\"_blank\" >Large Model<\/a>Fully upgraded and<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bc%80%e6%ba%90\" title=\"[View articles tagged with [open source]]\" target=\"_blank\" >Open Source<\/a>!<\/p>\n<p data-track=\"242\">This is the industry's first Chinese native DiT architecture open source model , and supports bilingual input and understanding .<\/p>\n<p data-track=\"244\">What is DiT architecture? Simply put, Stable diffusion3 and Sora are also using this architecture, but at present Sora is not open to the public, and Stable diffusion3 is not completely open source as previously said, while the hybrid big model is completely open source, from this point of view, I think Tencent hybrid team is still very sincere! (Kudos)<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10657\" title=\"get-352\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-352.jpg\" alt=\"get-352\" width=\"1080\" height=\"608\" \/><\/div>\n<p data-track=\"245\">Note: the official release of the architecture diagram, interested in can look at, do not understand you can let Kimi or GPT4 teach you, the test is valid!<\/p>\n<p data-track=\"247\">So how does the Hybrid-DiT model perform? Allow me to tell you how<\/p>\n<p data-track=\"249\">1\u3001Support Chinese prompt word<\/p>\n<p data-track=\"250\">As mentioned earlier, the Hybrid-DiT model supports both Chinese and English inputs, so it's a relatively big plus for domestic friends who don't have to go through the process of converting Chinese to English.<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10658\" title=\"get-353\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-353.jpg\" alt=\"get-353\" width=\"1080\" height=\"656\" \/><\/div>\n<p data-track=\"251\">Note: Here are a couple of renderings that were released<\/p>\n<p data-track=\"253\">2. Long text comprehension skills<\/p>\n<p data-track=\"254\">Simply put, it is able to analyze and understand the information in long texts and generate corresponding artworks, and this is the official effect image released<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10659\" title=\"get-354\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-354.jpg\" alt=\"get-354\" width=\"1080\" height=\"535\" \/><\/div>\n<p data-track=\"256\">3\u3001Support multi-round dialog<\/p>\n<p data-track=\"257\">This means that it is possible to keep modifying the image through multiple conversations to achieve our requirements, after all, sometimes a single conversation doesn't work well enough to generate a satisfactory image.<\/p>\n<p data-track=\"259\">If you read my previous post introducing the Vincennes video tool Pika, you probably won't be unfamiliar with this feature, as Pika also supports multiple rounds of dialog to modify videos.<\/p>\n<p data-track=\"264\">So how do you experience the Hybrid-DiT model? Unfortunately, at the moment, if at all, I have not found a place where I can experience it online!<\/p>\n<p data-track=\"266\">Although the official website of the Hybrid-DiT model mentions that you are welcome to experience it in the Tencent Hybrid Assistant, I logged in and found that the model in there is not the new open source one (or am I not in the grayscale?). , for three reasons:<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10660\" title=\"get-355\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-355.jpg\" alt=\"get-355\" width=\"1080\" height=\"491\" \/><\/div>\n<p data-track=\"268\">The first is that it's marked at the bottom as being based on Tencent's Hybrid Grand Model V1.7.6, and there's no latest open source news in the message center either<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10661\" title=\"get-356\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-356.jpg\" alt=\"get-356\" width=\"1010\" height=\"249\" \/><\/div>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10663\" title=\"get-358\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-358.jpg\" alt=\"get-358\" width=\"740\" height=\"565\" \/><\/div>\n<p data-track=\"270\">The second is that in the official video put out, the version I saw demoed was actually 2.0<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10662\" title=\"get-357\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-357.jpg\" alt=\"get-357\" width=\"1010\" height=\"539\" \/><\/div>\n<p data-track=\"272\">The third is by feel, in the hybrid assistant generated in the picture obviously feel not as good as the official website put out, and also need to \"generate a picture\" and other prompt words trigger.<\/p>\n<p data-track=\"274\">So if you want to experience it, you can only refer to the instructions on Github to install and experience it, and it just so happens that my computer configuration meets the requirements, which will be followed up by a separate installment of the instruction<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10664\" title=\"get-359\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-359.jpg\" alt=\"get-359\" width=\"1080\" height=\"467\" \/><\/div>\n<p data-track=\"276\">Finally in terms of how it compares to other Venn diagram models, here's a test comparison put up on Github:<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-10665\" title=\"get-360\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/05\/get-360.jpg\" alt=\"get-360\" width=\"1080\" height=\"524\" \/><\/div>\n<p data-track=\"277\">Note: This is the result of a review conducted by more than 50 professional reviewers<\/p>\n<p data-track=\"279\">In general, compared to other open source models is improved, but for some closed source models there is still a gap, I hope to become better under the power of open source!<\/p>\n<p data-track=\"281\"><strong>Related Addresses:<\/strong><\/p>\n<p data-track=\"282\">The official website of Hybrid-DiT:<strong>https:\/\/dit.hunyuan.tencent.com\/<\/strong><\/p>\n<p data-track=\"283\">Hybrid-DiT Github Address:<strong>https:\/\/github.com\/Tencent\/HunyuanDiT<\/strong><\/p>\n<p data-track=\"284\">Hybrid Assistant Address:<a href=\"https:\/\/www.1ai.net\/en\/6765.html\/\">https:\/\/www.1ai.net\/6765.html<b>\u00a0<\/b><\/a><\/p>","protected":false},"excerpt":{"rendered":"<p>OpenAI released GPT-4o, Google released Imagen3, and now, Tencent has also delivered his answer sheet: the mixed element literate diagram model is fully upgraded and open source! This is the industry's first Chinese native DiT architecture open source model, and supports bilingual input and understanding. What is DiT architecture? Simply put, Stable diffusion3 and Sora are also used in this architecture, but at present Sora is not open to the public, and Stable diffusion3 is not completely open source as previously stated, while the hybrid model is completely open source, from this point of view, I think Tencent hybrid team is still very sincere! (Points of praise) Note: the official release of the architecture of the diagram, feeling happy<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[144],"tags":[216,219,2656,323],"collection":[],"class_list":["post-10656","post","type-post","status-publish","format-standard","hentry","category-baike","tag-216","tag-219","tag-2656","tag-323"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/10656","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=10656"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/10656\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=10656"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=10656"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=10656"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=10656"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}