{"id":53936,"date":"2026-06-18T15:58:43","date_gmt":"2026-06-18T07:58:43","guid":{"rendered":"https:\/\/www.1ai.net\/?p=53936"},"modified":"2026-06-18T15:59:08","modified_gmt":"2026-06-18T07:59:08","slug":"ai%e8%a7%86%e9%a2%91%e4%bf%9d%e6%8c%81%e4%ba%ba%e7%89%a9%e4%b8%80%e8%87%b4%e6%80%a7%e7%9a%84%e6%96%b9%e6%b3%95%ef%bc%8c%e4%b8%bb%e4%bd%93%e5%8f%82%e8%80%83-%e8%a7%92%e8%89%b2%e5%8f%82%e8%80%83","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/53936.html","title":{"rendered":"THE AAI VIDEO'S WAY OF KEEPING PEOPLE TOGETHER, SUBJECT REFERENCE + ROLE REFERENCE + FULL REFERENCE"},"content":{"rendered":"<p>Still saying \"<a href=\"https:\/\/www.1ai.net\/en\/tag\/ai%e8%a7%86%e9%a2%91\" title=\"[View articles tagged with [AI Video]]\" target=\"_blank\" >AI Video<\/a>People change their faces every frame?\"<\/p>\n<p><strong>Then you're probably still using the old 2024 method\u3002<\/strong><\/p>\n<p>IN 2026, THE AI VIDEO WAS GENERATED, AND THE MAN-CONFORMITY TECHNOLOGY HAS GONE THROUGH<strong>Three-generation evolution<\/strong>:<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Phase<\/section>\n<\/th>\n<th>\n<section>technology<\/section>\n<\/th>\n<th>\n<section>Features<\/section>\n<\/th>\n<th>\n<section>Summary<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>First generation<\/section>\n<\/td>\n<td>\n<section>Image to Video<\/section>\n<\/td>\n<td>\n<section>We barely keep our characters on the front line<\/section>\n<\/td>\n<td>\n<section>If you move, you fall<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Second generation<\/section>\n<\/td>\n<td>\n<section>Role Reference<\/section>\n<\/td>\n<td>\n<section>Specially target character features<\/section>\n<\/td>\n<td>\n<section>Cross-scenes<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Third generation<\/section>\n<\/td>\n<td>\n<section>Multimedia Reference<\/section>\n<\/td>\n<td>\n<section>Picture + Video + Audio Multimodular Lock<\/section>\n<\/td>\n<td>\n<section>professional level<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Today, I put on the latest 2026\u00a0<strong>3 Large reference model<\/strong>\u00a0Everything:<\/p>\n<ol>\n<li><strong>Subject Reference Mode<\/strong>\u00a0- Single-person lock-in, for close-up and single-person scenes<\/li>\n<li><strong>Role Reference Mode<\/strong>\u00a0\u2014 IP-CLASS PEOPLE HOLD, CROSS-SCENES DON'T FAIL<\/li>\n<li><strong>Full Reference Mode<\/strong>\u00a0- Multi-modular integrated control, professional creator matching<\/li>\n<\/ol>\n<p>Each model is clear:<strong>What's the principle, how it works, which tool is the best, where the hole is\u3002<\/strong><\/p>\n<p>AFTER READING THIS, YOUR AI VIDEO CHARACTER'S CONSISTENCY WENT STRAIGHT FROM BEING BARELY VISIBLE TO BEING PROFESSIONAL\u3002<\/p>\n<p>Model I: Subject Reference - The most accurate single-person lockout<\/p>\n<p>The subject reference is currently one of the most accurate character-locking techniques, none\u3002<\/p>\n<p><strong>It's different from the nature of the map<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th><\/th>\n<th>\n<section>Traditional drawings<\/section>\n<\/th>\n<th>\n<section>Subject Reference<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>AI UNDERSTANDING<\/strong><\/td>\n<td>\n<section>The style of the \"Reference\" diagram<\/section>\n<\/td>\n<td>\n<section>It's the main subject of the \"Blank.\"<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Character consistency<\/strong><\/td>\n<td>\n<section>Maybe it's off<\/section>\n<\/td>\n<td>\n<section>Forced to keep five officials, hair, clothing<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>One word<\/strong><\/td>\n<td>\n<section>Take a picture<\/section>\n<\/td>\n<td>\n<section>Put it in<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>What tools support the subject reference<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>tool<\/section>\n<\/th>\n<th>\n<section>Subject reference capacity<\/section>\n<\/th>\n<th>\n<section>Maximum Reference Chart<\/section>\n<\/th>\n<th>\n<section>Special function<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Vidu Q3<\/strong><\/td>\n<td>\n<section>It's perfect<\/section>\n<\/td>\n<td>\n<section>Single Subject<\/section>\n<\/td>\n<td>\n<section>@grammatical call, subject lock strength reconciliation<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Seedance 2.0<\/strong><\/td>\n<td>\n<section>It's strong<\/section>\n<\/td>\n<td>\n<section>Single Subject<\/section>\n<\/td>\n<td>\n<section>Multimodular Reference<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>AI 2.0<\/strong><\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50 Stability<\/section>\n<\/td>\n<td>\n<section>Single Subject<\/section>\n<\/td>\n<td>\n<section>Long video segment maintained<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>General, 2.6<\/strong><\/td>\n<td>\n<section>Nice<\/section>\n<\/td>\n<td>\n<section>Support multiple subject<\/section>\n<\/td>\n<td>\n<section>Role play mode<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Vidu Master Reference Practice (most recommended)<\/p>\n<p>Vidu 's main reference is the current industry pole, which was achieved after version 1.5<strong>SINGLE SUBJECT 95%+ ACCURACY<\/strong>.<\/p>\n<p><strong>Step 1: Prepare high-quality reference maps<\/strong><\/p>\n<p>The quality of the reference map directly determines the locking effect, and the criteria are strict:<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>It has to be done<\/section>\n<\/th>\n<th>\n<section>Never<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>HEADS OR MICRO-SIDES, HUMAN FACE RATIO \u2265 301 TP3T<\/section>\n<\/td>\n<td>\n<section>DON'T MIX IT WITH MULTIPLE ANGLES\/ EMOTICONS\/ CLOTHING<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Resolution not less than 1024 x 1024<\/section>\n<\/td>\n<td>\n<section>Do not use vague, low, watermarked maps to identify failure<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>The light is even, no strong shadow or exposure<\/section>\n<\/td>\n<td>\n<section>\u2014<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Clear or clean background, avoiding interference with subject recognition<\/section>\n<\/td>\n<td>\n<section>\u2014<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>No one else or anything<\/section>\n<\/td>\n<td>\n<section>\u2014<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>\ud83d\udca1\u00a0<strong>Professional skills<\/strong>: BETTER USE THE \"STANDARD MAP\" CREATED BY AI AS A REFERENCE MAP, RATHER THAN A PHOTO OF THE PERSON. AI GENERATES A CLEARER IMAGE CHARACTER AND BETTER LOCKING\u3002<\/p>\n<p><strong>Step 2: Enable subject reference<\/strong><\/p>\n<ol>\n<li>Enter VIDU Studio, choose \"Image to Video\"<\/li>\n<li>Upload ready reference maps<\/li>\n<li>Waiting for the system to parse the subject (to show \"Subject Analyzed\" complete)<\/li>\n<li>Note \"Reference Charactor Locked\" on the top right corner. Locked<\/li>\n<\/ol>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-53938\" title=\"02127e2jgthkn00xud000ugwm\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2026\/06\/021127e2j00tgthkn00xud000u000gwm.jpg\" alt=\"02127e2jgthkn00xud000ugwm\" width=\"1080\" height=\"608\" \/><\/p>\n<p><strong>Step 3: Call with @grammatical precision<\/strong><\/p>\n<p>This is the core game of the subject reference -- to bind people in the hint:<\/p>\n<p>@Serial number<\/p>\n<p>@Figure 1 Put on a blue windsuit, turn around and smile at the Tokyo Shibuya intersection, the background is blurry and the camera is moving slowly<\/p>\n<p>Three priorities:<\/p>\n<ul>\n<li>@Figure 1 to put on the hint<strong>First<\/strong><\/li>\n<li>It's only about actions, scenes, cameras<strong>Stop describing people<\/strong><\/li>\n<li>Don't write \"like\" or \"like\" or \"like\" or \"like\" or something<\/li>\n<\/ul>\n<p>A multi-person scene @<\/p>\n<p>@Figure1 Reach out to @Figure 2<\/p>\n<blockquote>\n<ul>\n<li>\u26a0\ufe0f\u00a0<strong>Notice<\/strong>: Generates a maximum of 3 support @subject calls at a time, more than solves the failure or integration of characters\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p>3 intensity levels for main reference<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Strength<\/section>\n<\/th>\n<th>\n<section>Writing<\/section>\n<\/th>\n<th>\n<section>effect<\/section>\n<\/th>\n<th>\n<section>Applicable scenarios<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Strong Lock<\/strong><\/td>\n<td><code>@Figure 1<\/code><\/p>\n<section>\u00a0+ Do not modify the description<\/section>\n<\/td>\n<td>\n<section>NUMBER 95%, SAME, LIMITED<\/section>\n<\/td>\n<td>\n<section>Close-up, conversation, slow shot<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Center Lock<\/strong><\/td>\n<td><code>@Figure 1<\/code><\/p>\n<section>\u00a0+ Minor changes in clothing<\/section>\n<\/td>\n<td>\n<section>THE CHARACTER 801 TP3T IS THE SAME AND CAN CHANGE<\/section>\n<\/td>\n<td>\n<section>Different scenes of the same person<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Weak Lock<\/strong><\/td>\n<td>\n<section>Reference Chart Style + Text Description<\/section>\n<\/td>\n<td>\n<section>Keep your breath and your flexibility<\/section>\n<\/td>\n<td>\n<section>SAME IP AGE<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>\ud83d\udca1\u00a0<strong>Experience<\/strong>: DO NOT PURSUE 100% SIMILAR. SIMILARITY AROUND 90% IS THE BEST BALANCE POINT - BOTH RECOGNITION AND NOT RIGIDITY OF MOVEMENT DUE TO THE DEATH OF THE LOCK\u3002<\/p>\n<p>Model II: Role Reference - IP<\/p>\n<p>The role reference, by definition, is dedicated to<strong>ROLE IP<\/strong>\u00a0Designed reference mode\u3002<\/p>\n<p><strong>Core differences with subject reference<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th><\/th>\n<th>\n<section>Subject Reference<\/section>\n<\/th>\n<th>\n<section>Role Reference<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Lock what<\/strong><\/td>\n<td>\n<section>\"This man in this picture.\"<\/section>\n<\/td>\n<td>\n<section>\"Who is this character?\"<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Level<\/strong><\/td>\n<td>\n<section>Accurate reproduction at visual level<\/section>\n<\/td>\n<td>\n<section>Maintenance of identity at the conceptual level<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>suitability<\/strong><\/td>\n<td>\n<section>Single-sight lock<\/section>\n<\/td>\n<td>\n<section>Cross scene\/cross lens series<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Take a chestnut -- you have a little new reference for crayons:<\/p>\n<ul>\n<li><strong>Subject Reference<\/strong>The generation of the new ones is similar to the position, expression and angle in the picture<\/li>\n<li><strong>Role Reference<\/strong>: You can do anything you want, anything you want, anything you want, anything you want to wear<\/li>\n<\/ul>\n<p><strong>ROLE REFERENCES ARE BETTER FOR SERIAL CONTENT, IP ACCOUNTS, SERIAL STORIES<\/strong>- Because what you need is this character, not this picture\u3002<\/p>\n<p>What tools have role reference functions<\/p>\n<p><strong>Universality 2.6 - Best role playing<\/strong><\/p>\n<p>ALI IS THE FIRST VIDEO MODEL TO SUPPORT ROLE-PLAYING IN THE COUNTRY AND IS NOW THE MOST APPROPRIATE TOOL TO DO AN IP\u3002<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Core competencies<\/section>\n<\/th>\n<th>\n<section>clarification<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>Video reference<\/section>\n<\/td>\n<td>\n<section>UPLOADING A CHARACTER VIDEO, AI, LEARNING CHARACTER APPEARANCE, EXPRESSION, ACTION STYLE<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Sound Synchronization<\/section>\n<\/td>\n<td>\n<section>Reference to the sound in the video, generate a video of mouth type + voice<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u266a Multi-mainstream \u266a<\/section>\n<\/td>\n<td>\n<section>Upload two characters to interact with the frame<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u266a All things can be played \u266a<\/section>\n<\/td>\n<td>\n<section>IT'S NOT JUST PEOPLE, PETS, CARTOONS, IPS, HANDY<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Steps:<\/p>\n<ol>\n<li>Select Role Play mode<\/li>\n<li>Upload reference video (10-30 seconds best, with multiple angles and expressions)<\/li>\n<li>Enter a playtip (support mirror script format)<\/li>\n<li>One key to generate a full video with role, voice and performance<\/li>\n<\/ol>\n<p><strong>PixVerse \u2014 Best in multisession narratives<\/strong><\/p>\n<p>The Character Ref function of PixVerse is designed for multi-photo narratives:<\/p>\n<ul>\n<li>Supporting 50+ Snippets to keep roles aligned<\/li>\n<li>It's for a series and a series<\/li>\n<li>It works better with multiple frames<\/li>\n<\/ul>\n<p><strong>Pika Labs - Animation \/ Second F\u00fchrer<\/strong><\/p>\n<p>One of the best tools for the consistency of the binomial role is the first choice of the opera creator\u3002<\/p>\n<p>Role Reference Progress Method<\/p>\n<p><strong>Play Number One: Flow of role files<\/strong><\/p>\n<p>Professional creators do this now:<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>move<\/section>\n<\/th>\n<th>\n<section>manipulate<\/section>\n<\/th>\n<th>\n<section>Outputs<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>\u2460<\/section>\n<\/td>\n<td>\n<section>We'll start with a set of character candidates<\/section>\n<\/td>\n<td>\n<section>5-10 Candidates<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2461<\/section>\n<\/td>\n<td>\n<section>Select the best one and produce multiple angles (head, side, 45\u00b0, back) with a graphic<\/section>\n<\/td>\n<td>\n<section>4 Angle<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2462<\/section>\n<\/td>\n<td>\n<section>Import this set of character references and create a role profile<\/section>\n<\/td>\n<td>\n<section>Role Files<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2463<\/section>\n<\/td>\n<td>\n<section>All subsequent footage is generated from this character file<\/section>\n<\/td>\n<td>\n<section>CONSISTENCY UP 30%+<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Zenium\u00a0<strong>How<\/strong>: Pick your face first, then take a picture with multiple angles, then finish the file \u2014 a logic with actors\u3002<\/p>\n<p><strong>Play II: Emoticon migration<\/strong><\/p>\n<p>With a role reference, you can control the role expression:<\/p>\n<ul>\n<li>There's no need for a vague description of \"happy expression.\"<\/li>\n<li>I'm going to use \"@smear stares\" and \"@smiling eyes.\" This expression label<\/li>\n<li>You can even upload an emoticon reference video to recapitulate the same face<\/li>\n<\/ul>\n<p><strong>Play III: Multiplaying<\/strong><\/p>\n<p>2.6 Support 2-3 role interaction:<\/p>\n<ul>\n<li>Upload references for each character separately<\/li>\n<li>WRITING HINTS IN \"PERFORMANCE A + ACTION + ROLE B + RESPONSE\"<\/li>\n<li>AI AUTOMATICALLY HANDLES SPACE RELATIONS AND VISION COMMUNICATION<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-53937\" title=\"139e19a8j00tgthkn00ucd000ugwm\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2026\/06\/139e19a8j00tgthkn00ucd000u000gwm.jpg\" alt=\"139e19a8j00tgthkn00ucd000ugwm\" width=\"1080\" height=\"608\" \/><\/p>\n<blockquote>\n<ul>\n<li>Zenium\u00a0<strong>Example<\/strong>\"Sitting at the stone table, shaving with your left hand, holding a glass with your right hand, looking at him on the table with a cat on his head, shiking candles, inside the ancient windhouse.\"<\/li>\n<li>(UPLOADING OF ROLE REFERENCES FOR CUSTOMS PLUMS AND CATS, AI AUTO-GENERATED INTERACTIVE SCENES)<\/li>\n<\/ul>\n<\/blockquote>\n<p>Mode III: Multimedia Reference - Professional creator matching<\/p>\n<p>If the main reference is a \"snipers\" and the role reference is a \"rifles\" \u2014 the full reference model is a \"missile system\"\u3002<\/p>\n<p>It's not a reference to an element, it's a simultaneous reference<strong>Pictures, videos, audio<\/strong>FOR A VARIETY OF MATERIALS, AI AUTOMATICALLY LEARNS AND RESETS:<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Learning dimensions<\/section>\n<\/th>\n<th>\n<section>Control over what<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>Role characteristics<\/section>\n<\/td>\n<td>\n<section>What do you look like<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Motion style<\/section>\n<\/td>\n<td>\n<section>How<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Lens Language<\/section>\n<\/td>\n<td>\n<section>How<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u266a The light and the light \u266a<\/section>\n<\/td>\n<td>\n<section>What atmosphere<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\ud83d\udd0a Sound<\/section>\n<\/td>\n<td>\n<section>What was that<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>One word<\/strong>: YOU GIVE AI A BUNCH OF REFERENCE MATERIAL, AI GIVES YOU A SET OF STYLE, A STABLE CHARACTER, A QUALITY PROFESSIONAL\u3002<\/p>\n<p>What tools support universal reference<\/p>\n<p><strong>Seedance 2.0 - the strongest full reference at present<\/strong><\/p>\n<p>The multi-model reference for Seedance 2.0 is the industry ceiling:<\/p>\n<ul>\n<li>Maximum support\u00a0<strong>12<\/strong>Reference file (photogram + video + audio mix)<\/li>\n<li>AI AUTOMATIC IDENTIFICATION OF REFERENCE TYPES, EXTRACTING SEPARATE FEATURES<\/li>\n<li>Supporting reference combination strategies, different combinations responding to different scenarios<\/li>\n<\/ul>\n<p><strong>Wan 2.7 - Command Edit+Multiform<\/strong><\/p>\n<p>A hundred degrees of Wan 2.7 feature: Supports \"directive editing\" - when generated, you can continue to modify the text without regeneration\u3002<\/p>\n<p>3 gold combination formulas with full reference<\/p>\n<p><strong>Combining formula 1: Role + scene + action (short play tag)<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Material Type<\/section>\n<\/th>\n<th>\n<section>quantity<\/section>\n<\/th>\n<th>\n<section>element<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>Role Reference Chart<\/section>\n<\/td>\n<td>\n<section>3 sheets<\/section>\n<\/td>\n<td>\n<section>Heads, sides, faces<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Site reference<\/section>\n<\/td>\n<td>\n<section>2<\/section>\n<\/td>\n<td>\n<section>Main scenes, sub-scenes<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Action Reference Video<\/section>\n<\/td>\n<td>\n<section>Paragraph 1<\/section>\n<\/td>\n<td>\n<section>Walking, fighting, etc<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>APPLICATION: AI SHORT PLAY, DRAMA VIDEO, CHARACTER STORY<\/p>\n<p>Zenium\u00a0<strong>How<\/strong>: Three graphs, two scenes, short scripts\u3002<\/p>\n<p><strong>COMBINING FORMULA 2: MIRROR + MUSIC + ORAL (MV \/ PROMOTIONAL)<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Material Type<\/section>\n<\/th>\n<th>\n<section>quantity<\/section>\n<\/th>\n<th>\n<section>element<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>Mirror Reference Chart<\/section>\n<\/td>\n<td>\n<section>9<\/section>\n<\/td>\n<td>\n<section>One for each shot<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Background Music<\/section>\n<\/td>\n<td>\n<section>Paragraph 1<\/section>\n<\/td>\n<td>\n<section>BGM AUDIO<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Oral reference video<\/section>\n<\/td>\n<td>\n<section>Paragraph 2<\/section>\n<\/td>\n<td>\n<section>Spoken from different emotions<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>APPLICATION: MV, PRODUCT PROMOTIONAL FILM, ORAL VIDEO<\/p>\n<p>Zenium\u00a0<strong>How<\/strong>: NINE LENSES, TWO MOUTHS, MV\u3002<\/p>\n<p><strong>Group formula 3: style + mirror + sound (creative video)<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Material Type<\/section>\n<\/th>\n<th>\n<section>quantity<\/section>\n<\/th>\n<th>\n<section>element<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>Style reference diagram<\/section>\n<\/td>\n<td>\n<section>4<\/section>\n<\/td>\n<td>\n<section>Determine overall visual tone<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Mirror Reference Video<\/section>\n<\/td>\n<td>\n<section>Paragraph 2<\/section>\n<\/td>\n<td>\n<section>Lens Mode Reference<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Sound Audio<\/section>\n<\/td>\n<td>\n<section>Paragraph 3<\/section>\n<\/td>\n<td>\n<section>Ambient sound, special effects reference<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Application: creative short films, art videos, advertising films<\/p>\n<p>Zenium\u00a0<strong>How<\/strong>: Three sound effects, creative matching\u3002<\/p>\n<p>Full Reference Operational Steps (as in Seedance 2.0)<\/p>\n<p><strong>Step 1: Collate reference material<\/strong><\/p>\n<p>\"Performance-Scene-Action-Voice\" preparation material with clear names:<\/p>\n<p>Reference material\/<br \/>\nideas - role - female heads.png<br \/>\nideas - role - female master side.png<br \/>\ni miss the role of the hostess<br \/>\nideas-caf\u00e9 scenes.jpg<br \/>\nideas - scene - rain night street.jpg<br \/>\n\u2514 - actions _ walking.mp4<\/p>\n<p><strong>Step 2: Batch upload reference file<\/strong><\/p>\n<p>All reference files are uploaded once in the \"Alternative Reference\" model of Seedance 2.0. The system automatically classifies: person, scene, action, style, audio\u3002<\/p>\n<p><strong>Step 3: Write prompts in @grammatics<\/strong><\/p>\n<p>Similar to the subject reference but more flexible:<\/p>\n<p>@Girl_Girl walked into a caf\u00e9 from the rain, took her umbrellas and fell, found a seat by the window<br \/>\nI ordered a cup of coffee, looked out the window, my eyes were kind of blue, warmed yellow, cold blue<br \/>\nRain night, film quality, background music: gentle jazz<\/p>\n<p><strong>Step 4: Reconciliation of reference weights<\/strong><\/p>\n<p>Advanced function: The impact intensity of each category of reference can be reconciled separately -<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Type of reference<\/section>\n<\/th>\n<th>\n<section>Recommended weights<\/section>\n<\/th>\n<th>\n<section>reason<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>Role<\/section>\n<\/td>\n<td>\n<section>80%<\/section>\n<\/td>\n<td>\n<section>We need to keep the same<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>take<\/section>\n<\/td>\n<td>\n<section>60%<\/section>\n<\/td>\n<td>\n<section>It's fine<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>movements<\/section>\n<\/td>\n<td>\n<section>40%<\/section>\n<\/td>\n<td>\n<section>It's not exactly the same<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>music<\/section>\n<\/td>\n<td>\n<section>50%<\/section>\n<\/td>\n<td>\n<section>Rhythms and emotions match<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>\ud83d\udca1\u00a0<strong>Professional tip<\/strong>: FULL REFERENCE IS NOT AS GOOD AS MORE. TOO MANY REFERENCES MAKE AI CONFUSED, BUT QUALITY FALLS. GENERAL\u00a0<strong>5-8 reference documents<\/strong>is the best number\u3002<\/p>\n<p>\u26a1 Progress technique: First-end frame control + multi-frame reference<\/p>\n<p>In addition to the three reference models, there are two new 2026 functions that allow for the consistency of character to take another step\u3002<\/p>\n<p>Techniques I: End frame control (Keyframe-to-Video)<\/p>\n<p>IT'S THE \"KING BANG\" FUNCTION OF THE 2026 AI VIDEO, NONE OF WHICH\u3002<\/p>\n<p><strong>Rationale<\/strong>: UPLOAD FIRST FRAME AND LAST FRAME, AI AUTOMATICALLY PRODUCES INTERMEDIATE TRANSITION VIDEO\u3002<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>PREVIOUS AI<\/section>\n<\/th>\n<th>\n<section>There's an end frame<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>From the beginning to the end, the people are moving away<\/section>\n<\/td>\n<td>\n<section>BOTH OF THEM WILL BE DEAD<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>It's like \"back and forth insurance\" for people\u3002<\/p>\n<p><strong>Operational steps (in the case of Vidu)<\/strong>:<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>move<\/section>\n<\/th>\n<th>\n<section>manipulate<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>\u2460<\/section>\n<\/td>\n<td>\n<section>Select Keyframe-to-Video mode<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2461<\/section>\n<\/td>\n<td>\n<section>Uploading headchart (Persons Start Posture)<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2462<\/section>\n<\/td>\n<td>\n<section>Upload the ending frame<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2463<\/section>\n<\/td>\n<td>\n<section>Enter Transitional Action Description<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2464<\/section>\n<\/td>\n<td>\n<section>Generate 4-8 seconds of consistent transition video<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Applicable scenarios<\/strong>:<\/p>\n<ul>\n<li>ROLE FROM SCENE A TO SCENE B<\/li>\n<li>Looks from anger to sadness<\/li>\n<li>Processes from integrity to fragmentation<\/li>\n<li>From vision to close-up<\/li>\n<\/ul>\n<blockquote>\n<ul>\n<li>\u26a0\ufe0f\u00a0<strong>Hide the pit<\/strong>IT'S A FIRST-END FRAME THAT HAS THE SAME CHARACTER -- IT'S NOT A LONG-HAIRED END-OF- HAIR FRAME, IT'S A SHORT HAIR, IT'S UNDERSTOOD BY AI TO BE A SHORT HAIR, AND IT CREATES A STRANGE MIDDLE\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p>Skills II: Multiframe Reference<\/p>\n<p>The first frame is two key frames. Multi-frame reference\u00a0<strong>2-20<\/strong>Key frame\u3002<\/p>\n<p><strong>Rationale<\/strong>: GIVE AI A SET OF KEY FRAMES, AI ALIGNS THEM, AND PRODUCES A LONG SHOT IN THE END\u3002<\/p>\n<p><strong>When<\/strong><\/p>\n<ul>\n<li>Complex action sequences (e.g., martial arts, dance)<\/li>\n<li>Long shot (one shot over 10 seconds)<\/li>\n<li>Needing precise control of the mirror trajectory<\/li>\n<\/ul>\n<p><strong>% of gold with multiple frame reference<\/strong>:<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Video duration<\/section>\n<\/th>\n<th>\n<section>Key frame recommended<\/section>\n<\/th>\n<th>\n<section>Notice<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>10 sec<\/section>\n<\/td>\n<td>\n<section>3-5<\/section>\n<\/td>\n<td>\n<section>\u2014<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>20 sec<\/section>\n<\/td>\n<td>\n<section>6-8<\/section>\n<\/td>\n<td>\n<section>\u2014<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Longer<\/section>\n<\/td>\n<td>\n<section>Not as much as possible<\/section>\n<\/td>\n<td>\n<section>Too much will lead to action, Carton<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>2026 Consistency of mainstream tool figures<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>tool<\/section>\n<\/th>\n<th>\n<section>Subject Reference<\/section>\n<\/th>\n<th>\n<section>Role Reference<\/section>\n<\/th>\n<th>\n<section>Full Reference<\/section>\n<\/th>\n<th>\n<section>opening and closing frames<\/section>\n<\/th>\n<th>\n<section>Multi-frame reference<\/section>\n<\/th>\n<th>\n<section>Recommended scene<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Vidu Q3<\/strong><\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>Personalized, single-screened, high-coherent<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Seedance 2.0<\/strong><\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>Short opera, multi-modular, full-power creative<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>General, 2.6<\/strong><\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>IP ACCOUNT, ROLE-PLAYING, SPECTROSCOPY<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>AI 2.0<\/strong><\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>Long video, dynamic effects, scale<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Wan 2.7<\/strong><\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>Command editing, later modification, professional production<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>PixVerse<\/strong><\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>Multisession narratives, series<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Pika Labs<\/strong><\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50\u2b50<\/section>\n<\/td>\n<td>\n<section>Comic, binary, creative video<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>3 Principles for Selection of Tools<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>What are you doing<\/section>\n<\/th>\n<th>\n<section>Choose what<\/section>\n<\/th>\n<th>\n<section>Why<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>Single-person close-up \/ high-face value<\/section>\n<\/td>\n<td><strong>Vidu<\/strong><\/td>\n<td>\n<section>Subject Reference Precision<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>SERIES IP \/ SEQUENCE<\/section>\n<\/td>\n<td><strong>General, 2.6<\/strong><\/td>\n<td>\n<section>Role Reference<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Professional shorts \/ Integrated Production<\/section>\n<\/td>\n<td><strong>Seedance 2.0<\/strong><\/td>\n<td>\n<section>Fullest Reference<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>\u26a0\ufe0f Hole avoidance guide: seven common mistakes for character consistency<\/p>\n<p><strong>PIPE ONE: POOR REFERENCE MAP QUALITY<\/strong><\/p>\n<p>THE REFERENCE FIGURE IS THE FOUNDATION. THE MAP, THE LIGHT, THE ANGLE, THE MASK -- YOU GIVE AI A BUNCH OF BAD REFERENCES, AND THE FAIRY CAN'T SAVE IT\u3002<\/p>\n<blockquote>\n<ul>\n<li>\u2705\u00a0<strong>Correct approach<\/strong>: 10 minutes to make a standard person map, 100 times more useful than later\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p><strong>PIPE II: I'M LOOKING AT TOO MANY PEOPLE, AI, AND I'M GIVING YOU \"INTEGRATION ODD.\"<\/strong><\/p>\n<p>MANY PEOPLE THINK THAT THE MORE THE REFERENCE, THE MORE THE REFERENCE, THE MORE THE CHARACTER REFERENCE, THE MORE THE AI GETS MIXED UP, THE MORE THE \"FOUR UNLIKE\"\u3002<\/p>\n<blockquote>\n<ul>\n<li>\u2705\u00a0<strong>Correct approach<\/strong>: Single person with 1 primary reference + 2 supporting reference; multiple person must be clearly distinguished by @ syntax\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p><strong>Pit three: Too much movement, face to face<\/strong><\/p>\n<p>Whatever the mode of reference, the movement will fall -- this is the physical limit of current technology\u3002<\/p>\n<blockquote>\n<ul>\n<li>\u2705\u00a0<strong>Correct approach<\/strong>: Important scenes with small, slow moves; large action scenes with visions or back shadows to avoid face\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p><strong>Pipe Four: The scene is too different. The character is \"changed.\"<\/strong><\/p>\n<p>THE SAME PERSON, IN WARM AND COLD LIGHT, SEEMS TO BE COMPLETELY TWO. AI'S UNDERSTANDING OF LIGHT IS NOT YET HUMAN\u3002<\/p>\n<blockquote>\n<ul>\n<li>\u2705\u00a0<strong>Correct approach<\/strong>The content of the series is as luminous as possible; indeed, it needs to be changed, with the addition of the phrase \u201cto keep the person in the same colour\u201d\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p><strong>Pipe Five: One way and death<\/strong><\/p>\n<p>Only subject reference? People like wood. Only character references? Details are easy to float. Only with the key words? All by luck\u3002<\/p>\n<blockquote>\n<ul>\n<li>\u2705\u00a0<strong>Correct approach<\/strong>:<strong>Third floor<\/strong>\u2014Reft to define the profile + to set the end of the frame at both ends + to set the details of the keyword, and to use the best combination\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p><strong>PIPE SIX: SEEKING 1001 TP3T<\/strong><\/p>\n<p>THE REAL ACTORS ARE ACTING, WITH DIFFERENT IMAGES, ANGLES AND FACES, AND THE VIEWERS DON'T FEEL \"CHANGED.\" SO IS AI VIDEO..<strong>THE NATURAL VARIABILITY OF 80% + 20% = THE BEST VIEWING EXPERIENCE\u3002<\/strong>\u00a0HARD PURSUIT OF 100% IS CONSISTENT, WITH THE RESULT THAT ACTIONS ARE RIGID, EXPRESSIONAL, LIKE WAX\u3002<\/p>\n<p><strong>Pipe 7: Using old version tools, no new functionality known<\/strong><\/p>\n<p>A lot of people are still using the old 2024-2025 method, and they don't know that the 2026 reference is so powerful. And those who want to do good, they will surely profit from their means. Using tools and methods, efficiency increases by more than 10 times\u3002<\/p>\n<p>\ud83c\udfac FULL FIELD CASE: 3 CAMERA-BUILDING CONSISTENT AI SHORT PLAY HOST<\/p>\n<p>With so many theories, a full-scale battle\u3002<\/p>\n<p><strong>Target<\/strong>: Make a 3-scenes video of the old wind shorts, and the hostess keep the same people and the same style in 3 shots\u3002<\/p>\n<p><strong>Tool Set<\/strong>: Vidu (main reference) + Clip (late-termination)<\/p>\n<p>Step 1: Production of standard human reference maps<\/p>\n<p>First, there must be a high-quality reference map \u2014 the basis for all consistency\u3002<\/p>\n<p>The young woman of the ancient wind, around 20 years old, the face of the goose, the eyes of Dan, the high nostrils, the thin lips, the long hair of the black<br \/>\nIn a silver hairbar, wearing a light blue veiled man with a white embroidered collar, white skin, cold air<br \/>\nEYES WITH A BIT OF BLUE EYES, HEAD-TO-HEADS, FACE-TO-FACE LENSES, SOFT AND NATURAL LIGHT, FILM SENSES, 8K SUPER CLEAR, PURE COLOR BACKGROUND<\/p>\n<p>Generate 4-6 sheets, select the most satisfactory one, save it as\u3002<\/p>\n<p>homemaker_standard reference chart.png<\/p>\n<blockquote>\n<ul>\n<li>\ud83d\udca1\u00a0<strong>Chile<\/strong>: The selection is not just about \"sweet\" but about \"unsure\" - the five officials are clear, the light is even, and there are no strange angles and expressions\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p><strong>Step 2: Camera I - The hostess walks in the garden (main reference mode)<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>project<\/section>\n<\/th>\n<th>\n<section>element<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Lens Description<\/strong><\/td>\n<td>\n<section>The hostess took a slow walk in the old wind garden, medium view, and the camera slowly followed<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>tool<\/strong><\/td>\n<td>\n<section>Vidu Q3 + Master Reference Mode<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Reference Image<\/strong><\/td>\n<td><code>homemaker_standard reference chart.png<\/code><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Operation:<\/p>\n<ol>\n<li>Uploading reference diagrams, waiting for the system to parse the subject (show \"Subject Analyzed\")<\/li>\n<li>Enter the prompt word:<\/li>\n<\/ol>\n<p>@Figure 1 Walking slowly in the old wind garden with plum trees and fake mountains, morning fog, soft and morning light<br \/>\nThrough the leaves, the meso scene, the side, the movement, the quality of the film, the light gold, the cold poetic atmosphere<\/p>\n<ol>\n<li>Generation time: 8 seconds<\/li>\n<\/ol>\n<blockquote>\n<ul>\n<li>EXPECTED EFFECTS: THE PROFILE OF THE PERSON AND REFERENCE FIGURE 90% OR MORE IS CONSISTENT, THE MOVEMENT IS NATURAL AND THE PICTURE IS STABLE\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p>Step 3: Camera two -- smile back<\/p>\n<p>It's a big shot, and it's easy to crash<strong>First End Frame Control<\/strong>To lock both ends\u3002<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>project<\/section>\n<\/th>\n<th>\n<section>element<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Lens Description<\/strong><\/td>\n<td>\n<section>The hostess stopped, looked back, looked in the direction of the camera, smiled<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>tool<\/strong><\/td>\n<td>\n<section>Vidu Keyframe-to-Video mode<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>Policy<\/strong><\/td>\n<td>\n<section>First-end frame control locks both ends<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Operation:<\/p>\n<ol>\n<li><strong>First Frame<\/strong>: Intercept a clear image of the side of the walk in a Step 2-generated video<\/li>\n<li><strong>End Frame<\/strong>: Generates a picture of the hostess' smile (maintaining the same person as the person in the head)<\/li>\n<li>Upload headline and ending frame<\/li>\n<li>Enter a transitional hint:<\/li>\n<\/ol>\n<p>The character slows down, the body slowly turns in the direction of the lens, the head lifts up, the mouth turns up<br \/>\nWith a light smile, the eyes were softer from the blues, and the twirl was swaying as they turned<br \/>\nClothes and fabrics have natural wrinkles<\/p>\n<ol>\n<li>Generation time: 6 seconds<\/li>\n<\/ol>\n<blockquote>\n<ul>\n<li>\ud83d\udca1\u00a0<strong>Skill<\/strong>: The last frame best uses the front frame<strong>Change it out<\/strong>instead of regeneration. The \"same image\" is much more consistent than the \"two different pictures\"\u3002<\/li>\n<\/ul>\n<\/blockquote>\n<p>Step 4: Lens Three - Special Pistol (Fact Reference + Full Power Mode)<\/p>\n<p>A close-up camera requires the highest degree of consistency of character and uses role references to ensure the accuracy of the five officials\u3002<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>project<\/section>\n<\/th>\n<th>\n<section>element<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Lens Description<\/strong><\/td>\n<td>\n<section>The hostess sits in front of the old genre, with her fingers softly moving the chords, close-up<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>tool<\/strong><\/td>\n<td>\n<section>2.6 Role-playing mode<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td><strong>References<\/strong><\/td>\n<td>\n<section>The leading video clip of the woman (10 seconds or so, with multiple angles)<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Enter spectrophs:<\/p>\n<p>Speculation 1 [0-3 seconds], the master's hand gently lays on the old chords, her fingers are long, her fingernails are light powder<br \/>\nThe camera moved slowly upwards, showing the low eyes of the hostess, and the eyes focused and calm\u3002<\/p>\n<p>Camera 2 [3-6 seconds] Periphery, the hostess slightly lowers her head and drops a few threads over her cheeks<br \/>\nThe horns of the mouth had a faint smile, warmed the yellow candles and cast a soft shadow over her face, and the moonlight was spilled over her outside the window\u3002<\/p>\n<p>Generation time: 6 seconds<\/p>\n<p>Step 5: Collapse and Harmonize<\/p>\n<p>After all three shots were generated, the import clipping was finally unified:<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>move<\/section>\n<\/th>\n<th>\n<section>manipulate<\/section>\n<\/th>\n<th>\n<section>parameter<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>\u2460<\/section>\n<\/td>\n<td>\n<section>Scroll<\/section>\n<\/td>\n<td>\n<section>Camera one. Camera two. Camera three<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2461<\/section>\n<\/td>\n<td>\n<section>Rollover<\/section>\n<\/td>\n<td>\n<section>Add between each shot\u00a0<strong>0.3-second stacking circuit<\/strong><\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2462<\/section>\n<\/td>\n<td>\n<section>Unified Colour<\/section>\n<\/td>\n<td>\n<section>Warm gold +10 \/ Contrast +5 \/ Saturation-5<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2463<\/section>\n<\/td>\n<td>\n<section>PLUS BGM<\/section>\n<\/td>\n<td>\n<section>Light and ancient music<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>\u2464<\/section>\n<\/td>\n<td>\n<section>Subtitles<\/section>\n<\/td>\n<td>\n<section>Subtitles to the script<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Common problems and solutions in the field<\/strong><\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>question<\/section>\n<\/th>\n<th>\n<section>reason<\/section>\n<\/th>\n<th>\n<section>Workaround<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>One shot is good, two is different<\/section>\n<\/td>\n<td>\n<section>There's a big difference in the first frame<\/section>\n<\/td>\n<td>\n<section>Do not regenerate<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>The close-up shot went down<\/section>\n<\/td>\n<td>\n<section>Too much action<\/section>\n<\/td>\n<td>\n<section>Less movement, slower speed<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>The three lenses are different<\/section>\n<\/td>\n<td>\n<section>Differences between tools\/models<\/section>\n<\/td>\n<td>\n<section>Align modulation, filtering<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Multiple scenes<\/section>\n<\/td>\n<td>\n<section>Not separated<\/section>\n<\/td>\n<td>\n<section>Mark everyone clearly in @grammatics<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Long-haired model\/haired flying<\/section>\n<\/td>\n<td>\n<section>Dynamic hair is a disaster area<\/section>\n<\/td>\n<td>\n<section>Psychics and \"Hair drops naturally, light with the wind, don't float too much.\"<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>\ud83c\udfaf\u00a0<strong>Core approach<\/strong>:<\/p>\n<p>Personal consistency is not achieved by a particular technique, but is the result of a three-tiered superimposed \"good reference map + correct model + lateral uniform monetization.\"\u3002<\/p>\n<p>Each floor is 80 minutes, three floors is more than 95 minutes\u3002<\/p>\n<p>Summarizing the evolution of the 2026 identity:<\/p>\n<table>\n<thead>\n<tr>\n<th>\n<section>Phase<\/section>\n<\/th>\n<th>\n<section>technology<\/section>\n<\/th>\n<th>\n<section>Positioning<\/section>\n<\/th>\n<th>\n<section>Age<\/section>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\n<section>Phase I<\/section>\n<\/td>\n<td>\n<section>Thumbnail + Feed Value<\/section>\n<\/td>\n<td>\n<section>Foundation<\/section>\n<\/td>\n<td>\n<section>2024<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Phase II<\/section>\n<\/td>\n<td>\n<section>Subject Reference + @ Syntax:<\/section>\n<\/td>\n<td>\n<section>Precision<\/section>\n<\/td>\n<td>\n<section>2025 Universal<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Phase III<\/section>\n<\/td>\n<td>\n<section>Role Reference + Video Reference<\/section>\n<\/td>\n<td>\n<section>IP PARAGRAPH<\/section>\n<\/td>\n<td>\n<section>2026 Mainstream<\/section>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<section>Phase IV<\/section>\n<\/td>\n<td>\n<section>An all-powerful model reference<\/section>\n<\/td>\n<td>\n<section>Professional Section<\/section>\n<\/td>\n<td>\n<section>2026 Frontline<\/section>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<div class=\"ries-writing-assistant-badge-host\" style=\"position: fixed !important; z-index: 2147483647 !important; pointer-events: none !important; margin: 0px !important; padding: 0px !important; border: none !important; background: none !important; width: 0px !important; height: 0px !important; overflow: visible !important; left: 1345px !important; top: 374.5px !important;\"><\/div>\n<div class=\"ries-translation-extension-container\"><\/div>","protected":false},"excerpt":{"rendered":"<p>Still saying, \"Ai video character changes face every frame\"? Then you're probably still using the old 2024 method. Image to Video barely keeps the person on the front line, moves and falls<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[149,144],"tags":[956,5321],"collection":[],"class_list":["post-53936","post","type-post","status-publish","format-standard","hentry","category-jiaocheng","category-baike","tag-ai"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/53936","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=53936"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/53936\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=53936"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=53936"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=53936"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=53936"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}