Sign In

Wan 2.2 workflow optimized for RTX 3060 12 GB VRAM GPU

27

Wan 2.2 workflow optimized for RTX 3060 12 GB VRAM GPU

[Edit:

This article is out of date. Please have a look at: https://civitai.com/models/1852904/wan-22-workflow-optimized-for-rtx-3060-12-gb-vram-gpu

to get the latest version and information]

Quick Overview:

  • Optimized Wan 2.2 workflow, runs perfect on RTX 3060 12 GB VRAM GPU and 32 GB RAM

  • Installation of Triton und Sage Attention not required

  • Fast and high quality video generation with LightX2V Lora

  • Reduced VRAM usage with BlockSwap

  • Text/Image 2 Video generation in one workflow

  • "Easy" installation/model downloading, all necessary sources are specified

  • "Easy" to use workflow, clearly structured, all necessary steps are explained

  • A 5 Second long high quality video generation takes about 10 - 15 minutes (see below).

Short introduction:

I struggled hard to find a suitable Wan workflow that runs with 12 GB VRAM delivering good and fast results. With starting up Wan 2.2 a couple of days ago I started a new try. Since it's running satisfactorily now I would like to publish it here and I hope it well help some others too.

This workflow is based on elements of a variety of allready published workflows. My "job" was only to put things together, optimize it for a small machine and create a most simple and hopfully user or even "beginner" friendly workflow.

The facts in short:

  • LightX2V Lora: fast (4 to 8 step) generation with high quality outputs.

  • BlockSwap: To avoid "out of memory" errors. Significantly reduces VRAM usage without noticeably affecting speed. It is possible to run some other tasks on the same machine while genarating.

  • Upscaling and Framerate multiply: 2x fast upscale, 4x fast framerate multiply, generates very smooth and high quality video ouputs up to 1440x960 at 60fps.

Tested generation times:

As a rough guide value for RTX 3060 GPU: generating a 5 second long high quality 1440 x 960 60 fps video with 6 steps it will take:

  • t2v: around 10 - 12 minutes,

  • i2v: around 15 minutes.

Short Conclusion:

I`m not an "expert" - just a user who wants to get it running on "available" hardware.

There are many things I don't really understand. If you find mistakes or better solutions please give me a hint.

And I really hope that even "beginners" have a chance to go the first steps...

For testing/understanding/experimenting/changing the workflow:

  • Klick "Toggle Link Visibility" to see the links.

  • Move nodes to see all "covered" nodes.

  • For quick testing you may lower the settings for: steps, clip lenght and video resolution.

And as usual: Have Fun 🙂🙂

27