Updated: Dec 25, 2025
base modelRDBT [NetaYume]
Recalibrated distribution
Do NOT download TCFP8 version. I messed up with the metadata.
Huge quality loss. And yes, all cover images also made from tcfp8 version. I was thinking something is off.
I will upload full bf16 base version and cfg distilled version later today.
This model is part of the test theories to improve diffusion models.
Trained from NTYM4 with ~70k images.
Aiming for
Better textures and art details.
Better and stable prompt coherence.
Balanced contrast and lighting.
Base version:
Pretrained with 70k dataset, no distillation.
Slightly balanced contrast and lighting.
CFG distilled:
2x faster.
Balanced contrast and lighting. Never overflow/oversaturated.
Slightly style loss.
Guide
Prompt: Basically the same as NetaYume. Except:
Style prompt is required. This model does not have default style. The default tv anime style in NetaYume has been nuked.
Use "Digital anime art style by @xxxx." at the end of the prompt to prevent Gemma 2 paying too much and incorrect attention to the artist name.
Quality tags are not needed. Dataset has higher quality than avg "masterpiece".
You don't need tons of tags to describing a character. Just use the most unique ones. e.g. "elf girl frieren, fox girl tamamo \(fate\)". See: img.
Prefer simple natural language at the start, and tags at the end.
Settings:
All:
Timesteps shift 3~4.5 for better details. (from node ModelSamplingAuraFlow).
Base model:
CFG scale: 4. euler_a + normal.
Or CFG scale: 1.5. euler_cfg_pp + normal.
CFG Distilled model:
CFG scale: 1. Although CFG 1~1.5 is doable, if you want.
Sampler: Prefer euler_a + normal.
About CFG distilled model:
You can't control CFG scale and negative prompt. Those are trained inside the model.
CFG scale = 1 is a special value. It means disabling CFG and neg prompt.
Because you don't need to run a forward pass for the negative prompt, you can generate 2x faster.
Some training details
Total dataset contains ~70k images. Not equally weighted.
Only layers.[2:25] were trained.
Captions are mainly from Gemini. Natural language only, no tags.
Not a LoRA this time?
Multi stage training. No LoRA.
Versions
v0.1 base: no distillation.
v0.1 cd tcfp8: cfg distilled, also a tensorcorefp8 version for ComfyUI.





