Sign In

CatWai (merge)

220
2.3k
44
Updated: Jan 19, 2025
base model
Verified:
SafeTensor
Type
Checkpoint Merge
Stats
818
Reviews
Published
Jan 19, 2025
Base Model
Illustrious
Training Images
Download
Hash
AutoV2
E49BA8E9A2
default creator card background decoration
Akiseki's Avatar
Akiseki

To clarify: this is an epsilon-prediction model, although some of the model used in merging is v-pred

This checkpoint merge is positioned as an epsilon-pred adaptation of the v-pred model catTowerNoobaiXL while keeping its high quality, so if you can run a v-pred model please also check that one out : )

v1.5 is built on catTowerNoobaiXL_v15Vpred and waiNSFWIllustrious_v90

v1.4 is built on catTowerNoobaiXL_v14Vpred and waiNSFWIllustrious_v70

v1.0_fix is built on catTowerNoobaiXL_v11Vpred and waiNSFWIllustrious_v70


This is my first checkpoint merge, an experimental attempt to turn a V-prediction model into eps-prediction model.

It's mainly built on catTowerNoobaiXL (v-prediction) and the final merge is in eps-prediction format so that you can use it in vanilla webui.

It was achieved by gradually shift its output layers to other eps models. The chosen model to replace those parts was (90% waiNSFWIllustrious + 10% ebara_pony_2.1). The converted model was further merged with waiNSFWIllustrious at 9:1 ratio so roughly speaking, it's about 70% CatTower + 28% wai + 2% ebara, that's why I call it CatWai.

The merge was done in ComfyUI. You can drag the model file or the json workflow into ComfyUI to see the detailed merge work flow. The json can be found in the zip uploaded as "training data". With the ComfyUI workflow you can save 6GB downloading if you have the mentioned 3 models on disk : ) The text version recipe is also included below.

Many thanks to source models' authors, especially the catTower model which is so nicely crafted that makes me excited on AI painting stuff again! The license for this merge is the same as catTower one as required: commercial use is prohibited.

Originally I wanted to post this one as some patch LoRA on catTower but it failed. I did get something close but the quality drop is significant even at dim=conv_dim=512.

Settings

I'm new to IL / Noob so this setup may not be optimal.

Positive Prompt:

masterpiece, best quality, absurdres, safe

Negative Prompt:

lowres, worst quality, low quality, bad anatomy, bad hands, 4koma, comic, greyscale, censored, jpeg artifacts, overly saturated, overly vivid

Sampling method: Euler

Sampling steps: 30

Resolution: personally I recommend 1024 x 1536 but people use 832 x 1216 more often

CFG Scale: 5

Clip skip: 2

Recipe

v1.4 has the same merge workflow where catTowerNoobaiXL_v11Vpred is replaced by catTowerNoobaiXL_v14Vpred

v1.5 has the same merge workflow where catTowerNoobaiXL_v11Vpred is replaced by catTowerNoobaiXL_v15Vpred and waiNSFWIllustrious_v70 is replaced by waiNSFWIllustrious_v90

You can also drag the model file or the json workflow into ComfyUI to see the detailed merge work flow. The json can be found in the zip uploaded as "training data".

A = 0.9 * waiNSFWIllustrious_v70 + 0.1 * ebara_pony_2.1
B = x * catTowerNoobaiXL_v11Vpred + (1 - x) * A
where x is [1.0, 1.0, 1.0, 0.7, 0.5, 0.25, 0.1, 0.0, 0.0, 0.0] for output_block 0-8 and remaining out blocks. x is 1.0 for other blocks (i.e. input_blocks + mid_blocks + others)
result = 0.9 * B + 0.1 * waiNSFWIllustrious_v70

License

License is the same as catTower: commercial use is prohibited.