Sign In

welcome to test this Elsa (HiDream,Wan2.1,Hunyuan,Flux)

63
687
296
15
Updated: May 7, 2025
characterelsa
Verified:
SafeTensor
Type
LoRA
Stats
53
0
Reviews
Published
May 6, 2025
Base Model
HiDream
Hash
AutoV2
6168718E34
St Patrick's Day Badge
SamLiu

HiDream:

Amazing! HiDream feels like the next version of Flux—it's easy to train and captures details brilliantly! Although some instability in appearance still exists, that doesn't overshadow its performance.

Unfortunately, running HiDream is extremely demanding on hardware. It has three versions, and even the Fast version is still quite slow for me.

Plus, the pre-training preparations were a real pain. This LoRA is just for testing, so it's not optimized for the best performance, and the training dataset was incomplete (for comparative experiments)."

I think this could be one of the next generation models we can expect!

Detailed introduction here: https://comfyui-wiki.com/en/tutorial/advanced/image/hidream/i1-t2i

Wan2.1-14B (T2V)

I stopped the training too early without saving checkpoints - it would've performed better if continued. But this version should still be good enough for us to evaluate Wan2.1-14B's quality. Hope I'm not too late sharing this. The reason I avoided training 14B before was its massive weight files and painfully slow testing - that's why I only uploaded images initially. Did you know technically they treat images as 1-frame videos? Even with dual 4090X2 on cloud, it runs at 3 seconds per step (vs HunyuanVideo's 1 sec/step).

During testing, I noticed two key traits about 14B:

  1. It's much more resistant to overtraining than other models.

  2. Its output is cleaner/less noisy than HunyuanVideo's."

Wan2.1-1.3B

All these examples were generated using wan2.1-1.3B, and the training was done with the official 1.3B weighted model. I know, you're probably wondering why there are so many Elsa Lora. She's kind of my go-to character for testing new models – there are some other reasons too, both personal and technical, but I doubt you'd be interested in those.

Anyway, the point is Hunyuan is generally better than wan at picking up on a character's face and clothes from the training images. It usually does a pretty good job with T2V (text-to-video).

Wan is used more for I2V (image-to-video).

Flux-Elsa in winter dress

I realized that Flux's Lora doesn't work well with multiple sets of Elsa's outfits, so I tried training a set separately. However, the result wasn't as good as I expected. Flux is confusing me—there's something that's holding back the character's resemblance.

Flux-test

This might be a Civitai platform issue - the updated version I uploaded returned a 404 error (likely lost during update).

Welcome to test this Flux dev model, I may delete it after a certain time.

It was such a crude attempt that I released the final model without having time to test it, in order to use civitai's online generation capabilities