Introduction
Neta Cat Tower is a text-to-image model fine-tuned from NetaYume Lumina.
This model was trained with the goal of enhancing anime style.
No learning was conducted regarding the addition of characters.
Model Components
Diffusion Transformers (DiT): This model
Text Encoder: Pre-trained Gemma-2-2b
AutoEncoder: Pre-trained Flux.1 dev's AE
"all_in_one" is a single model that are combined with DiT, text encoder and autoencoder.
The model uploaded to Civitai is a "all_in_one" DiT model.
(I changed the uploaded model file from all-in-one to DiT to reduce the download file size.)
If you want to get all-in-one model, please download it from my Hugging Face page.
How to Get Started with the Model
Please refer to the Neta Lumina's model card.
You need to use the webui that Lumina Image 2.0.
ComfyUI
Forge Neo
Recommended settings
Sampler: res_multistep/ euler_ancestral
Scheduler: linear_quadratic
Steps: >=30
CFG (guidance): 4 – 5.5
Resolution: 1024 × 1024, 768 × 1532, 968 × 1322, or >= 1024
Prompt
Please refer to the Neta Lumina Prompt Book
About character knowledge, please refer to the NetaYume Lumina's Civitai page
Training Information
Please refer my Hugging Face page
Acknowledgments
duongve: Thanks to duongve for sharing awesome model.


