adult toys

anal

blowjob

breast out

breasts out

child on child

convenient censoring

corpses

covered nipples

diapers

dildo riding

disturbing

downblouse

emaciated bodies

exposed female nipple

extremist

female nudity

female swimwear

female swimwear or underwear

female underwear

futanari

genitals

gigantic breasts

graphic female nudity

graphic male nudity

graphic violence or gore

hair over breasts

hanging

hate speech

hate symbols

hentai

huge breasts

huge butt

illustrated explicit nudity

incest

lingerie

male nudity

male swimwear or underwear

male underwear

nazi party

no panties

nsfw

nude

nudity

one breast out

oral

oral invitation

partial nudity

peeing

pg-13

physical violence

porn

revealing clothes

scat

self injury

sexual activity

sexual intent

sexual situations

sexy

sitting on face

strapless leotard

suggestive

thick thighs

undressed

urine

vore

weapon violence

white supremacy

wide hips

bukkake

fellatio

bikini

cumshot

implied fellatio

eat_cum

cumdrip

cum in pussy

cum on face

after fellatio

cum on hair

cum on body

cum on tongue

cum on hands

cum in mouth

presenting ass

<h3 id="i.-introduction">I. Introduction</h3>NetaYume Lumina is a text-to-image model fine-tuned from <a target="_blank" rel="ugc" href="https://huggingface.co/neta-art/Neta-Lumina">Neta Lumina</a>, a high-quality anime-style image generation model developed by <a target="_blank" rel="ugc" href="https://huggingface.co/neta-art">Neta.art Lab</a>. It builds upon <a target="_blank" rel="ugc" href="https://huggingface.co/Alpha-VLLM/Lumina-Image-2.0">Lumina-Image-2.0,</a> an open-source base model released by the <a target="_blank" rel="ugc" href="https://huggingface.co/Alpha-VLLM">Alpha-VLLM team</a> at Shanghai AI Laboratory.Key Features:<ul><li>High-Quality Anime Generation: Generates detailed anime-style images with sharp outlines, vibrant colors, and smooth shading.</li><li>Improved Character Understanding: Better captures characters, especially those from the Danbooru dataset, resulting in more coherent and accurate character representations.</li><li>Enhanced Fine Details: Accurately generates accessories, clothing textures, hairstyles, and background elements with greater clarity.</li></ul><h3 id="ii.-information">II. Information</h3>For version 1.0:<ul><li>This model was fine-tuned from the NetaLumina model, version <code>neta-lumina-beta-0624-raw</code>, using a custom dataset consisting of approximately 10 million images. Training was conducted over a period of 3 weeks on 8× NVIDIA B200 GPUs.</li></ul>For version 2.0:This version has 2 versions:Version 2.0:<ul><li>I switched the base model to Neta Lumina v1 and trained this model on my custom dataset, which consists of images sourced from both e621 and Danbooru. The dataset is annotated with a mix of languages: 30% of the images are labeled in Japanese, 30% in Chinese (50% using Danbooru-style tags and 50% in natural language), and the remaining 40% in natural English descriptions.</li><li>For annotations, I used ChatGPT along with other models capable of prompt refinement to improve tag quality. Additionally, instead of training at a fixed resolution of 1024, I modified the code to support multiscale training, dynamically resizing images between 768 and 1536 during training.</li><li>Notes: Currently, I've only evaluated this model using benchmark tests, so its full capabilities are still uncertain. However, based on my initial testing, the model performs quite well when generating images at a resolution of 1312x2048 (as shown in the sample images I provided).</li><li>Moreover, this version the model generates images with the size up to 2048x2048 based on my testing.</li></ul>Version 2.0 plus:<ul><li>This model is fine-tuned from version 2.0, which had been trained on a dataset of higher-quality images. In this dataset, each image is annotated with both natural language descriptions and Danbooru-style tags.</li><li>The training procedure follows the same overall design as version 2, but is divided into three stages.<ul><li>In the first two stages, the top 10 layers are frozen, and training is performed separately on the Danbooru-labeled subset and the natural language-labeled subset.</li><li>In the final stage, all layers are unfrozen and optimized jointly on the full dataset, which incorporates both Danbooru and natural language annotations.</li></ul></li><li>This version reduces the issue of generated images exhibiting an artificial or 'AI-like' appearance, while also improving spatial understanding. For instance, the model is able to generate images in which a character is positioned on the left or right side of the images according to the prompt (as illustrated in the example). In addition, it provides modest improvements in rendering artist-specific styles.</li><li>You can find gguf quantization at here: <a target="_blank" rel="ugc" href="https://huggingface.co/Immac/NetaYume-Lumina-Image-2.0-GGUF">https://huggingface.co/Immac/NetaYume-Lumina-Image-2.0-GGUF</a></li></ul>Version 3.0:<ul><li>This version introduces new character knowledge and also improves some existing characters that could not previously be generated (I will provide a list of the improved characters later). However, please note that not all characters in the list may be generated, since I aim to preserve the old knowledge while also enhancing aspects like text rendering, anatomy (when using artist styles, the model may sometimes produce inaccurate or imperfect anatomy), model stability, and some additional secret improvements.</li><li>For generating text within the images, I recommend using this system prompt: "You are an image generation assistant if the prompt includes quoted or labeled on image text render it verbatim preserving spelling punctuation and case. &lt;Prompt Start&gt;", it may help you achieve better results.</li><li>Here is a link to a gallery of example images generated in an artistic style using this version: <a target="_blank" rel="ugc" href="https://gumgum10.github.io/gumgum.github.io/">Artist Style Gallery</a>. Thank @LyloGummy for contributing.</li></ul>For version 3.5 (pre-trained model):<ul><li>This version is a pre-trained model (I’m not sure what to call it, but it’s basically a continuation of the previous work by the Neta team, using the Neta Lumina v1.0 model). To clarify further, versions 2.0 Plus and 3.0 were fine-tuned from this pre-trained model. My workflow involves using the best checkpoint from this pre-trained model at that time and fine-tuning it.</li><li>In this version, I also updated my dataset (only the Danbooru dataset, up to date at 12:00 a.m. on September 3). The new dataset only contains tags, since I don’t have anyone to help me validate natural prompts.</li><li>Basically, I didn’t change the dataset too much I just updated it with the latest data, using a part of dataset from neta team and merged it with the previous one. So, the model still generates images that look quite similar. However, if you use the correct trigger prompts, the outputs will differ. The good news is that it still retains all of its previous knowledge accurately (some antistyle has been improved).</li><li>In addition, the default style of model currently is stable, the anatomy and text generation seems better than previous.</li><li>Lastly, this model is different from the test version I released on Hugging Face.</li><li>Here is the diffusers format for this version: <a target="_blank" rel="ugc" href="https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0-Diffusers-v35-pretrained">duongve/NetaYume-Lumina-Image-2.0-Diffusers-v35-pretrained · Hugging Face</a></li></ul><h3 id="iii.-model-components:">III. Model Components:</h3><ul><li>Text Encoder: Pretrained Gemma-2-2B</li><li>VAE: From Flux.1 dev's VAE</li><li>Image Backbone: Fine-tuned version of NetaLumina's backbone</li></ul><h3 id="iv.-file-information">IV. File Information</h3><ul><li>This all-in-one file includes weights for VAE, text encoder, and image backbone. Fully compatible with ComfyUI and other systems supporting custom pipelines.</li><li>If you only want to download the image backbone, feel free to visit my <a target="_blank" rel="ugc" href="https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0">Hugging Face page</a>, it includes the separated files along with the <code>.pth</code> files in case you want to use them for fine-tuning.</li></ul><h3 id="v.-suggestion-settings">V. Suggestion Settings</h3>For more details and to achieve better results, please refer to the <a target="_blank" rel="ugc" href="https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd">Neta Lumina Prompt Book</a>.<h3 id="vi.-notes-and-feedback">VI. Notes &amp; Feedback</h3>This is an early experimental fine-tuned release, and I’m actively working on improving it in future versions. Your feedback, suggestions, and creative prompt ideas are always welcome — every contribution helps make this model even better!<h3 id="vii.-how-to-run-the-model-on-another-platform">VII. How to Run the Model on Another Platform</h3>You can use it through the tensor.art platform. Here is the model link: <a target="_blank" rel="ugc" href="https://tensor.art/models/898410886899707191">https://tensor.art/models/898410886899707191</a>However, to run the model in an optimized way, I recommend using Comfyflow from tensor.art (because its default runner lacks configuration, which makes the model run suboptimally). Here is an example flow you can use on the platform: <a target="_blank" rel="ugc" href="https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0/blob/main/Lumina_image_v2_tensorart_workflow.json">https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0/blob/main/Lumina_image_v2_tensorart_workflow.json</a><h3 id="viii.-acknowledgments">VIII. Acknowledgments</h3><ul><li>Big thanks to <a target="_blank" rel="ugc" href="https://huggingface.co/narugo">narugo1992 </a>for the dataset contributions.</li><li>Credit to <a target="_blank" rel="ugc" href="https://huggingface.co/Alpha-VLLM">Alpha-VLLM</a> and <a target="_blank" rel="ugc" href="https://huggingface.co/neta-art">Neta.art Lab</a> for the fantastic base model architecture.</li></ul>If you'd like to support my work, you can do so through <a target="_blank" rel="ugc" href="https://ko-fi.com/duongve">Ko-fi</a>!