Type | |
Stats | 204 |
Reviews | (27) |
Published | Dec 4, 2024 |
Base Model | |
Training | Epochs: 10 |
Hash | AutoV2 91F0729DB0 |
UrangDiffusion v2.5 (oo-raw-ng Diffusion) brings a whole-new training method compared to the v2.0. The model provide more flexibility and brings some updated dataset.
The name “Urang” comes from Sundanese, meaning “We/Our/I.” The history behind the name is to make the model not only suitable for me but also for many people. Another reason is that I use many resources (training scripts, dataset collecting scripts, etc.) from other people. It’s unfair to claim this model as “my sole work.”
Standard Prompting Guidelines
The model is finetuned from Animagine XL 3.1, which is trained with danbooru tags. However, there is a little bit changes on dataset captioning, therefore there is some different default prompt used:
Default prompt:
1girl/1boy, character name, from what series, everything else in any order, masterpiece, best quality, high score, great score, absurdres.
Default negative prompt:
lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, low score, bad score, average score, signature, watermark, username, blurry,
Default configuration: Euler a with around 25-30 steps, CFG 5-7, and ENSD set to 31337. Sweet spot is around 25 steps and CFG 7.
Training Configurations
Finetuned from: UrangDiffusion v2.0
Pretraining I (Crashed mid training):
Dataset size: ~50,300 images
GPU: 1xH100 80GB
Optimizer: AdaFactor
Unet Learning Rate: 3.75e-6
Text Encoder Learning Rate: 1.875e-6
Batch Size: 16
Gradient Accumulation: 3
Warmup steps: 100 steps
Min SNR: 5
Epoch: 4
Random Cropping: True
Loss: Huber
Huber Schedule: SNR
Huber C: 0.1
Pretraining II (Continue from epoch 4):
Dataset size: ~50,300 images
GPU: 1xA100 80GB
Optimizer: AdaFactor
Unet Learning Rate: 3.75e-6
Text Encoder Learning Rate: 1.875e-6
Batch Size: 16
Gradient Accumulation: 3
Warmup steps: 100 steps
Min SNR: 5
Epoch: 6
Random Cropping: True
Loss: Huber
Huber Schedule: SNR
Huber C: 0.1
CyberFix:
Epoch 10 + (Cyberrealistic XL v3.1 - SDXL 1.0 Base) = Temp
Temp model's text encoder changed with Epoch 10's text encoder.
Added/Updated Series, Characters, and Styles
v2.5
Artists:
ningen mame
ciloranko
rhasta
atdan
sho (sho lwlw)
tianliang
duohe
fangdongye
chen bin
ishikei
ask (askzy)
wlop
tsurusaki takahiro
kokosando
wagashi (dagashiya)
nawakena
kedama milk
hiten (hitenkei)
matanonki
sy4
houraku
fuzichoco
sencha (senchat)
rei (sanbonzakura)
houkisei
alp
dino (dinoartforame)
kuroduki (pieat)
maeda hiroyuki
tabi (tabisumika)
yuumei
rella
konya
karasue
hoshi (snacherubi)
modare
creayus
reoen
kawacy
wanke
kousaki rui
chyoel
lpip
kaninn
azuuru
mignon
amazuyu
tatsuki
shiro9jira
novelance
lack
airseal
huanxiang
heitu
rsef
machi (machi0910)
meion
z3zz4
ame (uten cancel)
healthyman
wagashi (dagashiya)
yamamoto souichirou
freng
kaede (sayappa)
masaki (ekakiningen)
asakuraf
misaka 12003-gou
hood (james x)
as109
yd (orange maru)
void 0
fajyobore
alphonse (white datura)
akita hika
nanaken
nana
muchi
maro
shisoneri
tottotonero
mochirong
nixeu
fujiyama
qizhu
kase daiki
ke-ta
tidsean
aki99
hitsukuya
shimmer
morikura en
ringeko-chan
pottsness
torino aqua
zelitto
personal ami
lm7 (op-center)
quan (kurisu tina)
migolu
shiki (psychedelic g2)
mizumizuni
kita (kitairoha)
kousaki rui
mofu
namako
omone
hokoma
agm
tab head
neoartcore
sciamano240
kuroboshi kouhaku
huke
lam (ramdayo)
nyori
yano mitsuki (nanairo)
yatomi
amashiro natsuki
yukisame
mishima kurone
teshima nari
shigure ui
orobou
v2.0
Series:
zenless zone zero
wuthering waves
sewayaki kitsune no senko-san
Honkai: Star Rail:
firefly
acheron
sparkle
robin
aventurine
black swan
feixiao
yunli
lingsha
march 7th (hunt)
jade
jiaoqiu
gallagher
rappa
misha
Hololive Talents:
hololive indonesia
raora panthera
elizabeth rose bloodflame
gigi murin
cecilia immergreen
Genshin Impact:
arlecchino
clorinde
chiori
mualani
xianyun
sigewinne
kinich
xilonen
emilie
gaming
kachina
sethos
Others:
landscape
several concepts to fix anatomy issue
Special Thanks
My co-workers(?) at CagliostroLab for the insights and feedback.
Nur Hikari and Vanilla Latte for quality control.
Linaqruf, my tutor and role model in AI-generated images, and also the person behind tag ordering.
License
UrangDiffusion falls under the Fair AI Public License 1.0-SD license.