{"id":53656,"date":"2026-06-04T11:55:35","date_gmt":"2026-06-04T03:55:35","guid":{"rendered":"https:\/\/www.1ai.net\/?p=53656"},"modified":"2026-06-04T11:55:35","modified_gmt":"2026-06-04T03:55:35","slug":"%e4%b8%80%e9%94%ae%e7%94%9f%e6%88%90ai%e7%9f%ad%e7%89%87%ef%bc%8cgpt-%e5%88%86%e9%95%9cseedance-2-0%e5%85%a8%e6%b5%81%e7%a8%8b%e8%af%a6%e8%a7%a3%e6%95%99%e7%a8%8b","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/53656.html","title":{"rendered":"One key to generate an AI short, GPT lens +Seedance 2.0 full process solution"},"content":{"rendered":"<p>Recently a particularly interesting stream of work has been painted:<strong>GPT TO GENERATE ACTION LABELS FIRST<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%88%86%e9%95%9c%e5%9b%be\" title=\"[Sees articles with [simulation] labels]\" target=\"_blank\" >Mirror<\/a>And feed Feedance 2.0 directly to a consistent short film\u3002<\/strong><\/p>\n<p>I've been testing one of them lately<a href=\"https:\/\/www.1ai.net\/en\/tag\/ai%e8%a7%86%e9%a2%91\" title=\"[View articles tagged with [AI Video]]\" target=\"_blank\" >AI Video<\/a>Workstream\u3002<\/p>\n<p>Used to<a href=\"https:\/\/www.1ai.net\/en\/tag\/ai%e7%9f%ad%e7%89%87\" title=\"_OTHER ORGANISER\" target=\"_blank\" >AI short film<\/a>What's the worst headache<\/p>\n<p>It's not pretty enough. It's: moving around. The shot's out of control. The character is like this for a second. The next second is like an actor. It wasn't easy to produce a video, and the result wasn't exactly what it wanted. But recently I found a very useful combination:<\/p>\n<p>GPT + GPT Image + Seedance 2.0 first let GPT design the action lens\u3002<\/p>\n<p>Let GPT draw the specs with arrows. It's the last time you throw it to Seedance to generate a video\u3002<\/p>\n<p>The key is the direction of the action in the spectroscope, the trajectories of the lens, the movement of people, and the ability of Seedance to understand\u3002<\/p>\n<p>The resulting video was executed almost according to the lens 1:1\u3002<\/p>\n<p>THE VIDEO WAS FINALLY TURNED FROM A \"TICK CARD\" TO A \"DIRECTOR\"\u3002<\/p>\n<p>We'll tear the whole thing down to you today. It'll work\u3002<\/p>\n<p><strong>STEP 1: GET GPT TO WRITE A SCRIPT OF THE TEXT<\/strong><\/p>\n<p>A lot of people went straight to the spectroscopy, and half of them found that the action logic was not right and the lens was broken\u3002<\/p>\n<p>The right thing to do is..<strong>Write a text version of the schedule first<\/strong>And make it visual\u3002<\/p>\n<p>How do you write a hint<\/p>\n<p>YOU NEED TO GET GPT INTO THE ROLE OF A MOVIE ACTION DESIGNER:<\/p>\n<p><strong>Tips:<\/strong><\/p>\n<p><span class=\"wx_text_underline\">You're a film-class martial arts designer, and you're good at designing coherent, robust action lenses\u3002<\/span><\/p>\n<p><span class=\"wx_text_underline\">Please generate an action spectroscopy script of about 15 seconds, including:<\/span><\/p>\n<ol class=\"list-paddingleft-1\">\n<li>\n<section><span class=\"wx_text_underline\">Number and duration of each shot<\/span><\/section>\n<\/li>\n<li>\n<section><span class=\"wx_text_underline\">Speculation of the lens (character\/median view\/vision\/topography, etc.)<\/span><\/section>\n<\/li>\n<li>\n<section><span class=\"wx_text_underline\">Role position and track in the picture<\/span><\/section>\n<\/li>\n<li>\n<section><span class=\"wx_text_underline\">Camera motion (push\/push\/smash\/smash)<\/span><\/section>\n<\/li>\n<li>\n<section><span class=\"wx_text_underline\">Key action rhythm description<\/span><\/section>\n<\/li>\n<\/ol>\n<p><span class=\"wx_text_underline\">Subject:<\/span><\/p>\n<p><span class=\"wx_text_underline\">Role: [insert role description]<\/span><\/p>\n<p><span class=\"wx_text_underline\">Style: [insert style keyword]<\/span><\/p>\n<p>Give me a chestnut<\/p>\n<p>FOR EXAMPLE, I WANT A 15-SECOND SHORT FILM OF THE OLD WIND'S GATE, AND GPT RETURNS TO A TEXT LENS LIKE THIS:<\/p>\n<ul>\n<li><strong>camera 1 (2s)<\/strong>: Vision\/set, Mountain Gate Panorama, morning fog, role above step<\/li>\n<\/ul>\n<ul>\n<li><strong>camera 2 (2s)<\/strong>\u00a0: Midview\/Stalking, role step up, dress up<\/li>\n<\/ul>\n<ul>\n<li><strong>camera 3 (1.5s)<\/strong>: close up, the role pupils are shrunk, the dark fog is rising far away<\/li>\n<\/ul>\n<ul>\n<li><strong>camera 4 (2s)<\/strong>: Vision\/push lens, black fog swallows the gates, oppression is full<\/li>\n<\/ul>\n<ul>\n<li><strong>camera 5 (1.5s)<\/strong>: Midview\/opposite, the part pulls the sword around, the sword slashs through the dark<\/li>\n<\/ul>\n<ul>\n<li><strong>camera 6 (2s)<\/strong>\u00a0: close-up\/slow motion, blades of black fog, fragments scattered<\/li>\n<\/ul>\n<ul>\n<li><strong>camera 7 (2s)<\/strong>\u00a0: Vision\/Link, part of the debris of independence, all around<\/li>\n<\/ul>\n<ul>\n<li><strong>camera 8 (2s)<\/strong>\u00a0: close-up, character eyeballs display red light, picture set<\/li>\n<\/ul>\n<p>With this text, there is evidence for the second step\u3002<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-53657\" title=\"317f841aj00tg39a100avd000ir00byp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2026\/06\/317f841aj00tg39a100avd000ir00byp.jpg\" alt=\"317f841aj00tg39a100avd000ir00byp\" width=\"675\" height=\"430\" \/><\/p>\n<p><strong>Step 2: Generate a spectroscopy with labels using GPT Image 2<\/strong><\/p>\n<p>Pure spectroscopy you can read, but Seedance can't\u3002<\/p>\n<p>So let GPT Image 2 put the word \"draw\" out..<strong>The point is to take the track<\/strong>.<\/p>\n<p>Cue word templates<\/p>\n<p><strong>Tips:<\/strong><\/p>\n<p>Please generate a 3x4 film spectroscopy grid map (out of 12), each of which contains:<\/p>\n<ol class=\"list-paddingleft-1\">\n<li>\n<section>Image content (fireman\/simplistic drawings, focus on action attitude and location)<\/section>\n<\/li>\n<li>\n<section>Red arrow indicates role direction<\/section>\n<\/li>\n<li>\n<section>Blue arrow points the camera track<\/section>\n<\/li>\n<li>\n<section>Number and length of lens per cell<\/section>\n<\/li>\n<\/ol>\n<p>Require:<\/p>\n<ul class=\"list-paddingleft-1\">\n<li>\n<section>All visual characters look the same (physical, clothing, identity)<\/section>\n<\/li>\n<li>\n<section>The movement direction must be clear and the arrows must not overlap<\/section>\n<\/li>\n<li>\n<section>The whole picture is clean, the lines are simple and powerful<\/section>\n<\/li>\n<li>\n<section>style: film pre-visible spectroscope style<\/section>\n<\/li>\n<\/ul>\n<p>Text Script:<\/p>\n<p>[Pasting text spectroscopy generated by the first step]<\/p>\n<p>Why the matchmaker\/simple<\/p>\n<p>because<strong>THE CORE OF THE SPECS IS NOT TO MAKE IT LOOK GOOD<\/strong>.<\/p>\n<p>Instead, a fine-drawing wind interferes with Seedance's identification of action -- it focuses on the details of the picture, not on the track. The matchesman plus arrow points out that Seedance understands better\u3002<\/p>\n<p>Actual effects<\/p>\n<p>GPT Image 2 produces a spectrograph about this:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-53658\" title=\"4238765ej00tg39ay00dad000ip00dep\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2026\/06\/4238765ej00tg39ay00dad000ip00dep.jpg\" alt=\"4238765ej00tg39ay00dad000ip00dep\" width=\"673\" height=\"482\" \/><\/p>\n<ul>\n<li>12 grids, one shot each<\/li>\n<li>The role is presented in simple matches<\/li>\n<li>Red arrows indicate the movement and movement of people<\/li>\n<li>Blue Arrow Marks the camera and pulls<\/li>\n<li>Number of hours per grid lower right corner<\/li>\n<\/ul>\n<p>This is the input certificate for step three\u3002<\/p>\n<p><strong>Step three: Feed Feedance 2.0, mirror directly to video<\/strong><\/p>\n<p>This is the best step\u3002<\/p>\n<p>Throws the spectroscopy generated by the previous step together with the role-setting diagram to Seedance 2.0, with a precise hint, to generate consistent video following the arrow direction and action logic in the spectroscopy\u3002<\/p>\n<p>Cue word templates<\/p>\n<p><strong><span class=\"wx_text_underline\">Tips:<\/span><\/strong><\/p>\n<p><span class=\"wx_text_underline\">This is a role-setting diagram {Portrait}. Please follow the lens-by-scope action in this spectrograph\u3002<\/span><\/p>\n<p><span class=\"wx_text_underline\">Here's the text of the action script script:<\/span><\/p>\n<p><span class=\"wx_text_underline\">[Playing text lens for first step]<\/span><\/p>\n<p><span class=\"wx_text_underline\">Require:<\/span><\/p>\n<ul class=\"list-paddingleft-1\">\n<li>\n<section><span class=\"wx_text_underline\">Follow strictly the direction and trajectory of the lens as indicated in the spectrograph<\/span><\/section>\n<\/li>\n<li>\n<section><span class=\"wx_text_underline\">The role looks and sets are consistent<\/span><\/section>\n<\/li>\n<li>\n<section>It's natural to connect, not jump frame or mutation<\/section>\n<\/li>\n<li>\n<section>The image style corresponds to the spectroscopy<\/section>\n<\/li>\n<\/ul>\n<p>Key Tips<\/p>\n<ol>\n<li><strong>The character set must be given<\/strong>:Seedance needs to know what the role looks like, or change the face of each frame<\/li>\n<li><strong>The text specs are to be posted together<\/strong>: Pure spectroscopy may throw out the details. Text supplements lock the action logic<\/li>\n<li><strong>Run the short clip test first<\/strong>: Don't run for 15 seconds at a time. Three to five seconds to try<\/li>\n<\/ol>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-53660\" title=\"40c5a5d7j00tg3900c9000b7d000hn00csp\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2026\/06\/40c5a5d7j00tg39c900b7d000hn00csp.jpg\" alt=\"40c5a5d7j00tg3900c9000b7d000hn00csp\" width=\"635\" height=\"460\" \/><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-53659\" title=\"54984624j00tg39k00bfd000ip00b9p\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2026\/06\/54984624j00tg39ck00bfd000ip00b9p.jpg\" alt=\"54984624j00tg39k00bfd000ip00b9p\" width=\"673\" height=\"405\" \/><\/p>\n<p>Full Workstream Overview<\/p>\n<p>Three steps, a simple version:<\/p>\n<section>\n<table>\n<thead>\n<tr>\n<th>\n<section>move<\/section>\n<\/th>\n<th>\n<section>tool<\/section>\n<\/th>\n<th>\n<section>enter<\/section>\n<\/th>\n<th>\n<section>Output<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>1 Writing Text Spectrum<\/section>\n<\/td>\n<td>\n<section>GPT<\/section>\n<\/td>\n<td>\n<section>scene + character + style requirements<\/section>\n<\/td>\n<td>\n<section>Text Script with View\/Trace\/Time<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>2 Draw tape speculation<\/section>\n<\/td>\n<td>\n<section>GPT Image 2<\/section>\n<\/td>\n<td>\n<section>Text Script<\/section>\n<\/td>\n<td>\n<section>3x4 grid spectrometry (including motion arrows)<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>3 Mirror Video<\/section>\n<\/td>\n<td>\n<section>Seedance 2.0<\/section>\n<\/td>\n<td>\n<section>Role + Mirror + Text script<\/section>\n<\/td>\n<td>\n<section>Coherent Action Short Film<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/section>\n<p>It's common for rookies to roll over<\/p>\n<ol>\n<li><strong>Skip the first step straight<\/strong>: The logic of the movement is confused and the lens cannot be connected<\/li>\n<li><strong>It's too delicate<\/strong>:Seedance's attention was distracted by the details of the scene, but the movement was lost<\/li>\n<li><strong>No character charting<\/strong>It's not the same for each role. It's like changing an actor<\/li>\n<li><strong>It's too short<\/strong>: Light Write \"Generate Video by this lens\" is not enough, locks the action logic again with text<\/li>\n<\/ol>\n<p><strong>Final Thoughts<\/strong><\/p>\n<p>The most exciting thing about this work stream is..<strong>THE SCHEMATIC IS NO LONGER JUST A REFERENCE FOR PEOPLE, BUT A \u201cCONSTRUCTION DRAWING\u201d THAT CAN BE READ DIRECTLY BY THE AI VIDEO TOOL\u3002<\/strong><\/p>\n<p>The arrows and trajectories drawn by GPT Image 2, which Seedance can really identify and execute, are a key step from the \"random card\" to \"precision control\"\u3002<\/p>\n<p>The threshold of the tool is low and the process is not complex, but the effect is visible\u3002<\/p>\n<p>There are no particularly high thresholds for the entire process\u3002<\/p>\n<p>But the improvement in the quality of video is clear\u3002<\/p>\n<p>IF YOU'RE DOING THE AI SHORTS, THE AI COMICS, THE AI ADS OR THE AI MOVIE PREVIEWS\u3002<\/p>\n<p>This workstream is worth a try\u3002<\/p>","protected":false},"excerpt":{"rendered":"<p>A particularly interesting workstream has recently been drawn: GPT is used first to generate action-labeled spectroscopy, then feeds to Seedance 2.0 and directly produces a coherent short film. I've recently been tested for an amazing AI video stream. What's the worst thing that ever happened to an AI short? It's not pretty enough. It's: moving around. The shot's out of control. The character is like this for a second. The next second is like an actor. It wasn't easy to produce a video, and the result wasn't exactly what it wanted. But recently I found a very good combination: GPT + GPT Image + Seedance 2.0 to get GPT to design the action lens. Let GPT draw the specs with arrows. It's the last time you throw it to Seedance to generate a video. Top<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[149,144],"tags":[3264,956,5321,8574],"collection":[],"class_list":{"0":"post-53656","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"hentry","6":"category-jiaocheng","7":"category-baike","8":"tag-ai","11":"tag-8574"},"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/53656","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=53656"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/53656\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=53656"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=53656"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=53656"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=53656"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}