home models images videos posts articles bounties challenges events updates shop

Wan & SkyReels Text to Image Workflows

Name: Wan & SkyReels Text to Image Workflows
Rating: 5 (102 reviews)
Author: vrgamedevgirl

101

1.3k

Updated: Jul 19, 2025

base model

t2i text to image skyreels wan2.1 fusionx

Download (2.74 MB)

Verified: a month ago

Other

Details

Type	Workflows
Stats	951 0
Reviews	Very Positive (89)
Published	Jul 4, 2025
Base Model	Wan Video 14B t2v
Hash	AutoV2 0F99E41D36

1 File

default creator card background decoration

vrgamedevgirl

📷 HD IMAGE OUTPUT – Now with ControlNet & Img2Img! (Up to 2048P)
🌀 SkyReels and WAN – Text-to-Video Models, Stunning Still Results

Sharing my current workflows for WAN and SkyReels — both are text-to-video models, but they generate incredible still images too.

⚙️ Defaults work great, but everything’s tweakable.
📝 Don’t skip the in-workflow notes — they cover setup and tips.
💡 Running with under 16GB VRAM? Enable block swapping for smoother runs.
🚀 Want faster speed? Drop steps to 4. The examples here used 10 steps and ran in ~25 seconds on an RTX 5090.
🔥 After lots of testing, I’m getting better results than FLUX — faces are more realistic, and no FLUX face issues.

To Simplify things you can just swap models out instead of having two separate workflows for each model. The links to the models are in the workflows.

☕ Like what I do? Support me here: Buy Me A Coffee 💜
Every coffee helps fuel more free LoRAs & workflows!

❓What’s the difference between these two workflows?
Glad you asked! After plenty of testing:
🎬 SkyReels gives a slightly more cinematic and realistic look.
🌟 WAN is a bit sharper and punchier, but not as cinematic.
That said, LoRA choice heavily affects output, so I recommend trying both workflows with your prompts to see which one fits your style best.

🚧 These features currently work with the WAN wrapper:

🖊️ Text-to-Image
🧠 ControlNet (DepthAnything, DW Pose, Canny, Lineart, etc.)
🖼️ Image-to-Image (enhance/upscale with denoise control)

✅ Native support is available now — ControlNet and Image-to-Image coming soon!
✅ Native GGUF support is available now — ControlNet and Image-to-Image coming soon!

💭 Why do these look so good? Simple: they’re trained on video, meaning way more frames per concept — and WAN is a 14B model, much larger than FLUX. That scale and frame data really shows in the results.

Give them a spin and let me know what you think!

👉 Need help crafting text-to-image prompts? I made a GPT just for that: Perfect Text-to-Image Prompt Creator

Try them out and let me know what you think!