{"id":15111,"date":"2024-07-09T09:22:53","date_gmt":"2024-07-09T01:22:53","guid":{"rendered":"https:\/\/www.1ai.net\/?p=15111"},"modified":"2024-07-09T09:22:53","modified_gmt":"2024-07-09T01:22:53","slug":"%e5%bf%ab%e6%89%8b%e5%bc%80%e6%ba%90%e5%9b%be%e5%83%8f%e7%94%9f%e6%88%90%e6%a8%a1%e5%9e%8b%e5%8f%af%e5%9b%bekolors-%e6%94%af%e6%8c%81%e5%9c%a8%e7%94%bb%e9%9d%a2%e4%b8%ad%e7%94%9f%e6%88%90%e6%96%87","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/15111.html","title":{"rendered":"Kuaishou open-sources image generation model Kolors to support text generation in the picture"},"content":{"rendered":"<p><a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bf%ab%e6%89%8b\" title=\"[See articles with [fast-hand] labels]\" target=\"_blank\" >quick worker<\/a>A big move was made.<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bc%80%e6%ba%90\" title=\"[View articles tagged with [open source]]\" target=\"_blank\" >Open Source<\/a>I've got my own image generation model<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%8f%af%e5%9b%be\" title=\"[See articles with [graphable] labels]\" target=\"_blank\" >Ketu<\/a> Kolors. This is not an ordinary model; it has trained on billions of text images, carrying the Universal Language Model (GLM) as a text encoder, supporting bilingual Chinese and English language hints, and processing a context of 256 tokens\u3002<\/p>\n<p><strong>Kolors Features at a Glance:<\/strong><\/p>\n<ul>\n<li><strong>Chinese and English bilingual support:<\/strong>The general language model (GLM) is used as the text encoder, so that the model is not only proficient in English, but also can perfectly understand and use Chinese prompts.<\/li>\n<li><strong>Long text processing capabilities:<\/strong>Supporting a context length of up to 256 tokens, it allows creators to describe their ideas in detail, whether it is a complex scene or a rich story.<\/li>\n<li><strong>Massive data training:<\/strong>Trained on billions of text-image pairs, the model has a large knowledge base and is able to generate diverse and accurate images.<\/li>\n<li><strong>Optimization of Chinese cultural elements:<\/strong>Special optimization has been carried out for Chinese cultural elements, making the generated images more in line with Chinese cultural characteristics and meeting localization needs.<\/li>\n<li><strong>Chinese text generation:<\/strong>The Graphical Kolors not only understands Chinese, but also embeds Chinese text in the resulting picture, adding more expression to the image\u3002<\/li>\n<\/ul>\n<p>After testing, I found that the performance of inserting Chinese into pictures is better now, and it can basically be output correctly, but for English, it is easy to miss words or make mistakes.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15113\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/07\/6385603496134512452924687.jpg\" alt=\"\" width=\"1000\" height=\"569\" \/><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-15112\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/07\/6385603498194057149685401.jpg\" alt=\"\" width=\"1000\" height=\"505\" \/><\/p>\n<p>As you can see, the Chinese version of the lying cat generated above is completely fine, but when I change it to &quot;AIbase&quot;, some characters are missing. As far as Chinese output is concerned, Ketu&#039;s performance is remarkable, but please note that the text should not be too long, otherwise it is easy to make mistakes.<\/p>\n<p>This model is not just a simple tool, it has the powerful technical support of Kuaishou. It is trained on massive data and has special optimization for Chinese cultural elements, so the generated images are more Chinese. This is not only a technological breakthrough, but also a cultural inheritance.<\/p>\n<p>The open source plan also includes CN (ControlNet) support, LoRa (low-rank adaptation), IPA (image prompt adaptation) and ComfyUI direct support, all of which are designed to make your creative process more smooth and personalized.<\/p>\n<p><strong>Technical details:<\/strong><\/p>\n<ul>\n<li>\u201cFartable Kolors\u201d is based on the SDXL model structure and incorporates ChatGLM256 technology to enhance bilingual understanding and text generation\u3002<\/li>\n<li>It is worth noting that running this model requires a large amount of video memory, about 19GB, which may have certain requirements on the hardware device.<\/li>\n<\/ul>\n<p>The quick-hand open source, Kolors, is not only a contribution to the technological community, but also a bold boost to creative freedom. This shows the technical resolve and strength of fast hands in AI and shows us the infinite potential of AI in artistic creation\u3002<\/p>\n<p>Ketu official website: https:\/\/top.aibase.com\/tool\/kuaishouketudamoxingkolors<\/p>\n<p>Project address:<a href=\"https:\/\/www.1ai.net\/en\/12103.html\/\">https:\/\/www.1ai.net\/12103.html\u00a0<\/a><\/p>","protected":false},"excerpt":{"rendered":"<p>A big move was made by the quick hand, which opened up a home-grown image generation model \u2014 \u201cFartable Kolors\u201d. This is not an ordinary model; it has trained on billions of text images, carrying the Universal Language Model (GLM) as a text encoder, supporting bilingual Chinese and English language hints, and processing a context of 256 tokens. Fig. Kolors Feature Summary: bilingual support in Chinese and English: Using the Universal Language Model (GLM) as a text encoder, the model is not only fluent in English, but also perfectly understands and uses Chinese language tips. Long text processing: Supports the length of the context of 256 tokens, allowing creators to elaborate on what they want, whether complex scenes or rich stories. Mass<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[2903,2861,219,2859],"collection":[],"class_list":["post-15111","post","type-post","status-publish","format-standard","hentry","category-news","tag-kolors","tag-2861","tag-219","tag-2859"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/15111","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=15111"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/15111\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=15111"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=15111"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=15111"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=15111"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}