Sign In

Waifu Diffusion - Beta 03

94
1.2k
7
Updated: Oct 5, 2024
base modelanimewoman
Verified:
SafeTensor
Type
Checkpoint Trained
Stats
1,247
Reviews
Published
May 15, 2023
Base Model
SD 2.1 768
Training
Epochs: 3
Hash
AutoV2
D38E779546

Waifu Diffusion - Beta 03

Reuploaded from Huggingface to civitai for enjoyment.

WD 1.5 Beta 3 is fine-tuned directly from stable-diffusion-2-1 (768), using v-prediction and variable aspect bucketing (maximum pixel area of 896x896) with real life and anime images. Given the broad range of concepts encompassed in WD 1.5, we expect it to serve as an ideal candidate for further fine-tuning, LoRA's, and other embedding applications. - [Notion.site]

Author's Notes

Model is good. Think of it like NAI when it first came out. It's a good way to kickstart a lot of finetuning right? Well you can just do that with WD 1.5 B3. - KaraKaraWitch

Aesthetic Models?

To be uploaded.

Installation

  1. Download the 3 files.

  2. Same deal how you install SD 2.1.

  3. Use the magic sauce VAE

If you can't do that well uhhh... I guess try and google and figure it out? I think this could help.

Usage

Use the following "mastering" prompts for improved looks:

Positive Prompt:

(exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed, anime, waifu:1.2)

Negative Prompt:

lowres, ((bad anatomy)), ((bad hands)), missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)

What can it do?

Model can do the following:

- Realistic (realistic, real life:1.2) In positive.

- Horny (The typical stuff, probably better with finetunes lol)

- Whatever you want to tune it with lol.

- Tuning is rather easy too. LoRA works (Kohya ones) and LyCORS (Tested LoCon and it works sooo yeah.)

What's new?

  • Fixed Text Encoder Training, so TE is actually trained now, give it a try if you're from Beta 2.

Loicense (License)

It's... Complicated.

TLDR: Just follow the Fair AI Public License 1.0-SD (https://freedevproject.org/faipl-1.0-sd/). If any derivative of this model is made, please share your changes accordingly. Special thanks to ronsor/undeleted (https://undeleted.ronsor.com/) for help with the license.

It does kinda go against the spirit of civitai but uhhh whatever lol.

How 2 Train a Drag- Waifu Diffusion

1. BLIP/BLIP2 and WD Tagger to provide booru tags and natural language captions to every image.

2. Apply Date gradient

3. Bucket π’œπ‘’π“ˆπ“‰π’½π‘’π“‰π’Ύπ’Έ Aesthetic into Exceptional, Best, Normal & bad.

4. Add stars to Booru images & bucket em. (Masterpiece, Best, High, Medium, Normal, Low & Worst)

5. Train

6. ???

7. Profit.

How to Lycoris/Locon/LoRA train

KaraKaraWitch here, okay so here are some of my comments from inital trial runs with WD 1.5 B3 and some common pitfalls.

1. Use the VAE provided. Do not use the builtin model VAE.

2. Enable --v2 and --v_parameterization

3. Train as per usual

"Wait what that's it?!"

Yes. The Do not however that your final loss should hover around 0.3. Any lower (like 0.29) might indicate overfitting issues.

"Amongus sus"

I meannn I only did a couple of styles and it did work out like that soo...

Where the fp32 version?!

According to devs, there is no perceivable difference in terms of quality when using either fp16 or fp32. (Unless you use memory opts like xformers. Those will cause more bigger issues than saving at fp32.)

I want it to be Diffused! (Diffusers format)

See when salt uploads the thingy to HF lol

Sooo what's the point is this model?!

Like I said in the beginning:

> Think of it like NAI when it first came out. It's a good way to kickstart a lot of finetuning right? Well you can just do that with WD 1.5 B3.

It is recommended and encouraged to finetune and/or locon/lora it!