home models images videos 3D Models articles comics challenges updates shop

Waifu Diffusion - Beta 03

Name: Waifu Diffusion - Beta 03
Rating: 5 (95 reviews)
Author: KaraKaraWitch

1.6k

Updated: Oct 5, 2024

base model

anime woman

Download

1 variant available

fp16 SafeTensor

wd-beta3-base-fp16.safetensors

Half precision, best balance • 2.4 GB

Verified: 3 years ago

Download (2.4 GB)

This checkpoint includes a config file, download and place it along side the checkpoint.

Required Components

You need these files to run this model. We'll show the best match for your preferences.

Config

wd-beta3-base-fp16.yaml • 1.77 KB

Verified: 3 years ago

Downloads your preferred variants

Details

Type

Checkpoint Trained

Stats

1,634

Reviews

Very Positive

(95)

Published

May 15, 2023

Base Model

SD 2.1 768

Training

Epochs: 3

Hash

AutoV2

D38E779546

Tensors

About this version

default creator card background decoration

127

KaraKaraWitch

Joined Dec 7, 2022

License:

CreativeML Open RAIL-M

01160-3190420558-portrait, (solo, 1girl), day time, flat chest, (Black eyes), medium hair,(Green hair) curly hair, (Light Red military uniform),.png

Waifu Diffusion - Beta 03

Reuploaded from Huggingface to civitai for enjoyment.

WD 1.5 Beta 3 is fine-tuned directly from stable-diffusion-2-1 (768), using v-prediction and variable aspect bucketing (maximum pixel area of 896x896) with real life and anime images. Given the broad range of concepts encompassed in WD 1.5, we expect it to serve as an ideal candidate for further fine-tuning, LoRA's, and other embedding applications. - [Notion.site]

Author's Notes

Model is good. Think of it like NAI when it first came out. It's a good way to kickstart a lot of finetuning right? Well you can just do that with WD 1.5 B3. - KaraKaraWitch

Aesthetic Models?

To be uploaded.

Installation

Download the 3 files.
Same deal how you install SD 2.1.
Use the magic sauce VAE

If you can't do that well uhhh... I guess try and google and figure it out? I think this could help.

Usage

Use the following "mastering" prompts for improved looks:

Positive Prompt:

(exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed, anime, waifu:1.2)

Negative Prompt:

lowres, ((bad anatomy)), ((bad hands)), missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)

What can it do?

Model can do the following:

- Realistic (realistic, real life:1.2) In positive.

- Horny (The typical stuff, probably better with finetunes lol)

- Whatever you want to tune it with lol.

- Tuning is rather easy too. LoRA works (Kohya ones) and LyCORS (Tested LoCon and it works sooo yeah.)

What's new?

Fixed Text Encoder Training, so TE is actually trained now, give it a try if you're from Beta 2.

Loicense (License)

It's... Complicated.

TLDR: Just follow the Fair AI Public License 1.0-SD (https://freedevproject.org/faipl-1.0-sd/). If any derivative of this model is made, please share your changes accordingly. Special thanks to ronsor/undeleted (https://undeleted.ronsor.com/) for help with the license.

It does kinda go against the spirit of civitai but uhhh whatever lol.

How 2 Train a Drag- Waifu Diffusion

1. BLIP/BLIP2 and WD Tagger to provide booru tags and natural language captions to every image.

2. Apply Date gradient

3. Bucket ~~𝒜𝑒𝓈𝓉𝒽𝑒𝓉𝒾𝒸~~ Aesthetic into Exceptional, Best, Normal & bad.

4. Add stars to Booru images & bucket em. (Masterpiece, Best, High, Medium, Normal, Low & Worst)

5. Train

6. ???

7. Profit.

How to Lycoris/Locon/LoRA train

KaraKaraWitch here, okay so here are some of my comments from inital trial runs with WD 1.5 B3 and some common pitfalls.

1. Use the VAE provided. Do not use the builtin model VAE.

2. Enable --v2 and --v_parameterization

3. Train as per usual

"Wait what that's it?!"

Yes. The Do not however that your final loss should hover around 0.3. Any lower (like 0.29) might indicate overfitting issues.

"Amongus sus"

I meannn I only did a couple of styles and it did work out like that soo...

Where the fp32 version?!

According to devs, there is no perceivable difference in terms of quality when using either fp16 or fp32. (Unless you use memory opts like xformers. Those will cause more bigger issues than saving at fp32.)

I want it to be Diffused! (Diffusers format)

See when salt uploads the thingy to HF lol

Sooo what's the point is this model?!

Like I said in the beginning:

> Think of it like NAI when it first came out. It's a good way to kickstart a lot of finetuning right? Well you can just do that with WD 1.5 B3.

It is recommended and encouraged to finetune and/or locon/lora it!