Type | |
Stats | 198 0 |
Reviews | (22) |
Published | Aug 25, 2024 |
Base Model | |
Hash | AutoV2 13EFBF38A7 |
An experimental attempt to convert Pony v6 to a V-prediction model with fixed Zero Terminal Timestep noise schedule, as well as an attempt to bring some anime artists and characters while not breaking Pony's knowledge using an extremely small dataset.
v0.1
You must set sampling type to V-prediction and apply Zero Terminal SNR patch, otherwise you will get noise.
Artists: https://files.catbox.moe/ajszqh.txt
Characters: https://files.catbox.moe/hg99to.txt
They don't work mostly due to how underfit this model is, while the main goal was to convert it to vpred. But you still can try raising weights on them. You can use this model in either ComfyUI or reForge.
Lemme go on a tangent for a second. While baking this, I've come to conclusion that Pony v6 is actually a decently-trained model if you don't count the fact that it nuked most of SDXL's knowledge. While it's a lot harder to train for it, the end result is very stable and well-generalized. I've also started thinking that most people who tried to train Pony were doing it wrong mainly due to assumption that LoRAs are a fine and problem-free solution for SDXL. As to why this is not the case, I think one of the main reasons is well-explained here: https://github.com/kohya-ss/sd-scripts/discussions/294#discussioncomment-10081465 and below.