Sign In

RDBT - NTYM

13

38

3

Updated: Dec 25, 2025

base model

Verified:

SafeTensor

Type

Checkpoint Trained

Stats

38

0

Reviews

Published

Dec 23, 2025

Base Model

Lumina

Hash

AutoV2
5AD36DB896

License:

RDBT [NetaYume]

Recalibrated distribution


Do NOT download TCFP8 version. I messed up with the metadata.

Huge quality loss. And yes, all cover images also made from tcfp8 version. I was thinking something is off.

I will upload full bf16 base version and cfg distilled version later today.


This model is part of the test theories to improve diffusion models.

Trained from NTYM4 with ~70k images.

Aiming for

  • Better textures and art details.

  • Better and stable prompt coherence.

  • Balanced contrast and lighting.


Base version:

  • Pretrained with 70k dataset, no distillation.

  • Slightly balanced contrast and lighting.

CFG distilled:

  • 2x faster.

  • Balanced contrast and lighting. Never overflow/oversaturated.

  • Slightly style loss.


Guide

Prompt: Basically the same as NetaYume. Except:

  • Style prompt is required. This model does not have default style. The default tv anime style in NetaYume has been nuked.

  • Use "Digital anime art style by @xxxx." at the end of the prompt to prevent Gemma 2 paying too much and incorrect attention to the artist name.

  • Quality tags are not needed. Dataset has higher quality than avg "masterpiece".

  • You don't need tons of tags to describing a character. Just use the most unique ones. e.g. "elf girl frieren, fox girl tamamo \(fate\)". See: img.

  • Prefer simple natural language at the start, and tags at the end.

Settings:

All:

  • Timesteps shift 3~4.5 for better details. (from node ModelSamplingAuraFlow).

Base model:

  • CFG scale: 4. euler_a + normal.

  • Or CFG scale: 1.5. euler_cfg_pp + normal.

CFG Distilled model:

  • CFG scale: 1. Although CFG 1~1.5 is doable, if you want.

  • Sampler: Prefer euler_a + normal.


About CFG distilled model:

  • You can't control CFG scale and negative prompt. Those are trained inside the model.

  • CFG scale = 1 is a special value. It means disabling CFG and neg prompt.

  • Because you don't need to run a forward pass for the negative prompt, you can generate 2x faster.

Some training details

Total dataset contains ~70k images. Not equally weighted.

Only layers.[2:25] were trained.

Captions are mainly from Gemini. Natural language only, no tags.

Not a LoRA this time?

Multi stage training. No LoRA.


Versions

v0.1 base: no distillation.

v0.1 cd tcfp8: cfg distilled, also a tensorcorefp8 version for ComfyUI.