anal

blowjob

breasts out

downblouse

graphic female nudity

graphic male nudity

graphic violence or gore

one breast out

peeing

physical violence

scat

suggestive

undressed

urine

vore

weapon violence

white supremacy

child on child

disturbing

emaciated bodies

exposed female nipple

female swimwear

female swimwear or underwear

female underwear

futanari

hate speech

hate symbols

hentai

illustrated explicit nudity

lingerie

male swimwear or underwear

male underwear

no panties

nude

nudity

oral

oral invitation

partial nudity

porn

revealing clothes

sexual intent

sexual situations

sexy

sitting on face

thick thighs

wide hips

adult toys

breast out

convenient censoring

corpses

covered nipples

diapers

dildo riding

extremist

female nudity

genitals

gigantic breasts

hair over breasts

hanging

huge breasts

huge butt

incest

male nudity

nazi party

nsfw

pg-13

self injury

sexual activity

strapless leotard

bukkake

fellatio

bikini

cumshot

implied fellatio

eat_cum

cumdrip

cum in pussy

cum on face

after fellatio

cum on hair

cum on body

cum on tongue

cum on hands

cum in mouth

presenting ass

Designed to be visually compact and simplified for ease of use. Personally, I think this is the best streamlined workflow there is. The overall layout is designed to be user friendly, intuitive, and wastes as little space as possible while fitting very well into the ComfyUI workflow window. All in all, it's a one stop shop for all your WAN video generation needs._________This workflow generates a 5 second 480x480 video in 60 seconds on a 4070ti with the Q8 GGUF or FP8 models without Sage Attention enabled and uses LCM sampling with Light X2V to speedup generation time.This workflow uses mostly basic and common nodes and extensions, so it should be very easy to get working with minimal effort. Click "show more" for details like requirements and model download links.Notable features include: an Infinite LoRA Loader, Sage Attention, the ability to grab the Last Frame of a generation for usage with extending videos (last frame must be manually saved and loaded), a standalone Video Combiner utility workflow, and a standalone Upscaling / Interpolation utility workflow, allowing for selective and easy post processing of your generated videos, and designed with consideration for a variety of use cases, from big to small, from power PC to potato PC._________For WAN 2.2:Same design as previously, but geared toward running the WAN 2.2 Low Noise model only. See "required models" section below for new workflow requirements.The Light X2V LoRA works with WAN 2.2 at a strength of 1.1 to 2.0, and can dramatically alter the behavior of the model, in either beneficial or detrimental ways, so after testing I chose 1.5 as the default strength since that seemed to be the most reliable, but experiment to find what works for you.WAN 2.2 is much more dynamic, which means it requires a slightly different prompting style than you might have used in WAN 2.1. The same goes for its affect on LoRA's, where they tend to be amplified in strength, which can be a good and bad thing, but overall I'm seeing some pretty good results with lots of keepers. So the main differences to get good results is to focus on learning how to prompt it and you may also need to tinker with LoRA strengths, depending on the LoRA and how it's behaving with your prompt and image input. Even changing it to 6 or 8 steps can also potentially improve results.The workflow sampler/scheduler settings seem to work pretty well, but more experimentation is required, there could be other combinations that work better, especially in the RES4LYF custom samplers and schedulers extension (part of the requirements below).There can be some bad generations that go off the rails, but all in all once you dial things in, WAN 2.2 can generate a lot of keepers that you could never get with WAN 2.1._________For WAN 2.1:The main settings that you may want to change would be primarily just output resolution or sampler steps. Other samplers or schedulers may work, but I find LCM/SGM Uniform or LCM/Beta57 provides the most coherent output. The only other setting you might want to fiddle with is the LoRA strengths. There are however other settings you can fiddle with, such as "SHIFT", which can somewhat work like a CFG setting. In my experience, it can be used to drastically change how a prompt/LoRA is expressed, while also creating more dramatic changes in movements, but generally this should be left at its default setting. A resolution of 512x512 or 640x640 can also be used, but can potentially lose some prompt adherence._________Note: Sage Attention is disabled by default. To enable Sage Attention (if you have the pre-requisites installed) simply select the "Enable for Sage Attention" node and press Ctrl+B to enable it, then below it change the "sage_attention" option from disabled to enabled. Even if you don't plan on using Sage Attention, you will still need to install the extension for the workflow to operate._________Required and alternative models:GGUF WAN 2.2 i2v models (use only "low noise" version):<a target="_blank" rel="ugc" href="https://huggingface.co/bullerwins/Wan2.2-I2V-A14B-GGUF/tree/main">https://huggingface.co/bullerwins/Wan2.2-I2V-A14B-GGUF/tree/main</a>GGUF WAN 2.1 i2v models:<a target="_blank" rel="ugc" href="https://huggingface.co/city96/Wan2.1-I2V-14B-480P-gguf/tree/main">https://huggingface.co/city96/Wan2.1-I2V-14B-480P-gguf/tree/main</a>FP8 WAN 2.1 Light X2V (for FP8 workflow, no independent acceleration LoRA needed, recommended as new default way to run WAN, very good prompt adherence and many quality issues solved):<a target="_blank" rel="ugc" href="https://huggingface.co/lightx2v/Wan2.1-Distill-Models/blob/main/wan2.1_i2v_480p_scaled_fp8_e4m3_lightx2v_4step_comfyui.safetensors">https://huggingface.co/lightx2v/Wan2.1-Distill-Models/blob/main/wan2.1_i2v_480p_scaled_fp8_e4m3_lightx2v_4step_comfyui.safetensors</a>CLIP model:<a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/text_encoders/umt5_xxl_fp8_e4m3fn_scaled.safetensors">https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/text_encoders/umt5_xxl_fp8_e4m3fn_scaled.safetensors</a>Or the higher precision BF16 CLIP model:<a target="_blank" rel="ugc" href="https://huggingface.co/minaiosu/Felldude/blob/main/wan21UMT5XxlFP32_bf16.safetensors">https://huggingface.co/minaiosu/Felldude/blob/main/wan21UMT5XxlFP32_bf16.safetensors</a>CLIP Vision model:<a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/clip_vision/clip_vision_h.safetensors">https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/clip_vision/clip_vision_h.safetensors</a>Or the custom NSFW geared CLIP Vison model (recommended):<a target="_blank" rel="ugc" href="https://huggingface.co/ricecake/wan21NSFWClipVisionH_v10/tree/main">https://huggingface.co/ricecake/wan21NSFWClipVisionH_v10/tree/main</a>VAE model:<a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/vae/wan_2.1_vae.safetensors">https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/vae/wan_2.1_vae.safetensors</a>Light X2V T2V LoRA: <a target="_blank" rel="ugc" href="https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Wan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32.safetensors">https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Wan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32.safetensors</a>Or the proper Light X2V I2V LoRA:<a target="_blank" rel="ugc" href="https://huggingface.co/lightx2v/Wan2.1-Distill-Loras/tree/main">https://huggingface.co/lightx2v/Wan2.1-I2V-14B-480P-StepDistill-CfgDistill-Lightx2v/blob/main/loras/Wan21_I2V_14B_lightx2v_cfg_step_distill_lora_rank64.safetensors</a>Or the other Light X2V experimentations by Kijai:<a target="_blank" rel="ugc" href="https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Lightx2v">https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Lightx2v</a>Or the even newer v2.0 of the Light X2V LoRA released in October 2025 (recommended if not using a model with the LoRA baked in):<a target="_blank" rel="ugc" href="https://huggingface.co/lightx2v/Wan2.1-Distill-Loras/tree/main">https://huggingface.co/lightx2v/Wan2.1-Distill-Loras/tree/main</a><a target="_blank" rel="ugc" href="https://huggingface.co/lightx2v/Wan2.2-Distill-Loras/tree/main">https://huggingface.co/lightx2v/Wan2.2-Distill-Loras/tree/main</a>RES4LYF custom samplers and schedulers:<a target="_blank" rel="ugc" href="https://github.com/ClownsharkBatwing/RES4LYF">https://github.com/ClownsharkBatwing/RES4LYF</a>_________Secret Pro Tip: using a transparent or solid colored image, such as black, can turn the i2v model into essentially a t2v model. It will rapidly transition from the blank input image and generate something from scratch to try to follow your prompt. It's an easy way to get t2v capabilities without changing workflows/models._________Other useful information:WAN can change behavior dramatically with output resolution changes, it tends to respond best if the resolution is 480 in either width or height. WAN 2.2 is supposed to be a 480p and 720p model, but it may still behave differently at different resolutions, and require either settings tweaks or just not work well at certain resolutions. Some things can work well at 480x480, some things can work better or worse at 512x512 or higher resolutions, but typically you get the most stable outputs with 480 or 720 in the width or height.

• Upgraded to use the newer FP8 version of WAN 2.1 released by Light X2V, with built in acceleration without the need for a LoRA. This new version of Light X2V, even at FP8, generally solves some quality and prompt adherence issues over previous versions.• There are also GGUF versions of this newer baked-in Light X2V model for WAN 2.1 and 2.2, along with independent LoRAs for both too, and for that, v5.2 of this workflow is geared toward running GGUF versions (just make sure to not also run the Light X2V LoRA alongside anything that already has the LoRA built in).• Link library at the bottom of the main page has been updated to reflect the latest models that are available as of November 2025, along with fixing any broken links to models that were no longer available at a previous link.• With the newest Light X2V implementation, a SHIFT value of 3 or 4 is recommended for maximum coherency to the starting image. Best samplers are LCM/SGM Uniform or LCM/Beta57.

v6.0 for WAN 2.1 (FP8)

• Added "EZ Video Utilities" standalone workflows, which includes a Video Combiner focused workflow for combining videos extended using a last frame, and an Upscaling / Interpolation focused workflow. With these two extra utility workflows, you can basically do everything you could want when it comes to post processing your video generations• With this milestone reached, Fast WAN I2V Compact is now a fully featured flagship when it comes to WAN 2.1 video generation. Light X2V accelerated, use as many LoRA as you like, grab the last frame to extend videos, combine extended videos, and upscale or interpolate videos. All in a compact, well thought out, and user friendly design.

v5.2 for WAN 2.1 (GGUF)

• Added a small standalone utility workflow that combines two generated videos into a single video or GIF. Primarily intended for use with videos extended using last frame to generate a second video• Output can be loaded as an input and combined further• Not a completely lossless process, so too many combines could result in visible quality degradation eventually, depending largely on the initial quality of the inputs and outputs• For a lossless process that combines many videos all at once, a third party program would be better suited• Pro tip: Adjusting "frame load cap" to 1 frame less than the total number of frames on the first video can keep from duplicating the last frame used between videos, ensuring a smooth transition in the output video. Alternatively, and maybe even more simple, you can also use "skip first frames" with a setting of "1" in the second video input to remove the duplicate frame there instead.• Re-download this workflow if your video combiner utility uses paths instead of thumbnails for selecting videos

v5.1 for WAN 2.1

• Added a simple feature to automatically pull the last frame for use with extending a video. Image must be manually saved and loaded, videos must be manually combined• Redownload for fix if you're getting red lines

v5.0 for WAN 2.1

• Default video file format changed to AV1 because of other formats having issues displaying properly with the latest version of ComfyUI• AV1 is a higher quality format with a slightly larger file size (equivalent to using H.265 at a CRF of 12) and still outputs MP4 files• Not sure if this affects AMD users, but if it does, just select another format like H.264. Note that H.264 does not play well with Windows thumbnail previews, which is why AV1 was chosen instead

v4.6 for WAN 2.1

• Same as v4.0 but with a playfully decorative UI

v4.5 for WAN 2.1

• Infinite LoRA loader with toggling functionality• Slight UI adjustments• Bug Fixes• Default models changed to highest quality versions, but use whatever you wish• Use "shift" 4 to 8 or higher to adjust prompt adherence / model behavior• Can still be used with WAN 2.2 Low Noise model

v4.0 for WAN 2.1

• Same design as previously• Geared toward running the WAN 2.2 Low Noise model only• See main page or below for more information and see "required models" section for new requirements• Light X2V LoRA works at a strength of 1.1 to 2.0, and can dramatically alter the behavior of the model, in both beneficial or detrimental ways, so after testing I chose 1.5 as the default strength since that seemed to be the most reliable, but experiment. Note: I have since found a Light X2V strength of 1.688 comes closest to WAN 2.1 behavior, but still not perfect.• WAN 2.2 is much more dynamic, which means it requires a slightly different prompting style than you might have used in WAN 2.1. The same goes for its affect on LoRA's, where they tend to be amplified in strength, which can be a good and bad thing, but overall I'm seeing some pretty good results with lots of keepers. So the main differences to get good results is to focus on learning how to prompt it and you may also need to tinker with LoRA strengths, depending on the LoRA and how it's behaving with your prompt and image input. Even changing it to 6 or 8 steps can also potentially improve results.• There can be some bad generations that go off the rails, but all in all once you dial things in, WAN 2.2 can generate a lot of keepers that you could never get with WAN 2.1.