WAN2.1-VACE-14B 1.3B GGUF 6 steps All in One (t2v-i2v-v2v-FLF-controlnet-masking) simple ComfyUI workflow
53
717
21
Comfyui workflow for text to video, image to video, video to video, video stylize, video character replacement, clothes swapper, all in one simple workflow.
MODEL GUIDE
Use VACE model + CauseVid and/or Self-Forcing lora
14B for quality
1.3B for faster inference
Change GGUF Loader node to Load Diffusion Model node for .safetensor files
==========================================================================
14B VACE model GGUF + CauseVid lora (6 steps only)
https://huggingface.co/QuantStack/Wan2.1_14B_VACE-GGUF/tree/main
or
14B FusionX VACE GGUF (CauseVid merged)
https://huggingface.co/QuantStack/Wan2.1_T2V_14B_FusionX_VACE-GGUF
==========================================================================
1.3B VACE Self-Forcing model used (6 steps only, no CauseVid needed)
*1.3B VACE GGUF fails to give good result
==========================================================================
SWITCH GUIDE
Text to video = all OFF
Image reference to video = Image1 ON
Image to video = Image1 + FLF ON
First & Last Frame to video = Image1+2+FLF ON
FLF video control = Image1+2+VidRef+FLF+control ON
V2V style change = Image1+VidRef+controlnet ON
V2V subject change = Image1+VidRef+control+SAM ON
V2V background change = same as above+invert mask