home models images videos 3D Models articles comics challenges updates shop

UrangDiffusion v3.1

Name: UrangDiffusion v3.1
Rating: 5 (388 reviews)
Author: kayfahaarukku

385

117

Updated: May 4, 2025

base model

anime woman game character girls man

Download

1 variant available

fp16 SafeTensor

UrangDiffusion-3.1-8325-000015.safetensors

Half precision, best balance • 6.46 GB

Verified: a year ago

Download (6.46 GB)

Details

Type

Checkpoint Trained

Stats

927

2.6K

Reviews

Very Positive

(118)

Published

Apr 18, 2025

Base Model

SDXL 1.0

Training

Epochs: 15

Hash

AutoV2

4243EFE3EA

Tensors

About this version

UrangDiffusion XL v3.1 is fine-tuned from Animagine XL 4.0 Base (not Zero). This 4.0 Base model serves as the base model pre-trained for the final release of Animagine XL 4.0 (not the Opt version).

I have received permission from the team to fine-tune the base model using my own method and release it under the UrangDiffusion series.

Base model: Animagine XL 4.0 Base
Fine-tuning details:

Dataset size: ~1,600 images
GPU: 1× A100 80GB
Optimizer: AdaFactor
UNet learning rate: 1.25e-6
Text encoder learning rate: N/A (disabled)
Batch size: 48
Gradient accumulation: 1
Warmup steps: 5%
Minimum SNR: 5
Epochs: 15

Due to some quirks of the model, please keep the following in mind:

v3.0 may perform better with anatomy
v3.1 may perform better with more fluid poses

If you encounter anatomical issues at 28 steps, try lowering to 27 or increasing to 29. If it improves but isn’t perfect, continue adjusting slightly up or down. If the result worsens, the previous step count was likely the optimal one.

UPDATE [19/04/2025]

Some generations are stable at step 30++ with Euler a. You might wanna crank up the steps a bit.
Some generations are also better with realistic, 3d included in the negative. Try that too.

default creator card background decoration

#57

1.6K

19.1K

254.3K

kayfahaarukku

Joined Feb 17, 2023

License:

CreativeML Open RAIL++-M Addendum

[v3.1 is still in further testing. Updates regarding new findings will be updated in the "About this version" section]

UrangDiffusion v3.1 (oo-raw-ng Diffusion) is the first UrangDiffusion version that utilize Animagine XL 4.0 as the base.

The name “Urang” comes from Sundanese, meaning “We/Our/I.” The history behind the name is to make the model not only suitable for me but also for many people. Another reason is that I use many resources (training scripts, dataset collecting scripts, etc.) from other people. It’s unfair to claim this model as “my sole work.”

Standard Prompting Guidelines

Prompting guide:

Default negative prompt: lowres, bad anatomy, bad hands, text, error, missing finger, extra digits, fewer digits, cropped, worst quality, low quality, low score, bad score, average score, signature, watermark, username, blurry
Default configuration: Euler a with around 25-30 steps, CFG 5-7, and ENSD set to 31337. Sweet spot is around 28 steps and CFG 6.

Training Configurations

Finetuned from: Animagine XL 4.0 Base (NOT 4.0-Zero)

Finetuning:

Dataset size: ~1,600 images
GPU: 1xA100 80GB
Optimizer: AdaFactor
Unet Learning Rate: 1.25e-6
Text Encoder Learning Rate: N/A (Turned off)
Batch Size: 48
Gradient Accumulation: 1
Warmup steps: 5%
Min SNR: 5
Epoch: 15

FAQ

Q: Images are sometimes noisy.
A: This is a common issue with Animagine XL 4.0 models in general. The base model is trained with only 10 epochs, which lead to the model being undertrained. Unlike Initial N or Initial I model that are trained with more resources.

Q: Hires fix model?
A: Check out the cover image metadata, you'll find it there.

Q: Initial N/Initial I is better.
A: Just leave and do not use the model. Simple. No need to announce your departure. Except you're willing to leave a constructive feedback or willing to fund future projects.

Special Thanks

My co-workers(?) at CagliostroLab for the insights and feedback.
Nur Hikari and Vanilla Latte for quality control.
Linaqruf, my tutor and role model in AI-generated images, and also the person behind tag ordering.

License

UrangDiffusion v1.0-v2.5 falls under the Fair AI Public License 1.0-SD license, while v3.x falls under the CreativeML OpenRAIL++-M license.