home models images videos posts articles bounties challenges events updates shop

SDXL VAE finetune + VAE training script

Name: SDXL VAE finetune + VAE training script
Rating: 5 (130 reviews)
Author: Wasabiya

129

3.5k

428k

Updated: Aug 31, 2024

tool

vae training

Verified: a year ago

SafeTensor

Details

Type	VAE
Stats	3,510 428k 92.9k
Reviews	Very Positive (130)
Published	Aug 31, 2024
Base Model	SDXL 1.0
Hash	AutoV2 BCFE8BDC27

1 File

default creator card background decoration

Wasabiya

This is my v1 attempt at a finetune of SDXL's VAE and I also wanted to share the training script. The script itself can be found on github: https://github.com/kukaiN/vae_finetune

I'm doing all this this while on vacation, so apologies for the short description. The finetuning script's readme has info on where the original script came from and the modifications I added to do mixed precision and converting the model keys from diffuser's format to SD's format.

I'm posting v1 of my first attempted vae training, but this one is a failure. I tried finetuning on 5 epoches with anime images (around 60k images from my checkpoint data), When I compare the model weights' difference and the cosine similarity of the underlying weights, I do see that the model was trained, but it seems like training on bf16 and the low lr didn't make the finetuned VAE much different. Although the model hash is different, it's not different enough to produce any noticeable difference.

I plan on training a 2 epoch training version with fp16 training and higher lr to see what happens.