DATASET MAKER FLUX KLEIN 9B.json

Before diving in, I want to make clear that in this field everyone has their own recipe and workflow for building datasets and training LoRAs. There is no definitive guide — experimentation is how you find your own path. With this simple guide you will end up with a complete dataset ready to train a LoRA for your favourite character. Personally, I focus on photorealistic characters, so the tutorial is oriented in that direction, but the workflow is perfectly adaptable to other styles. <h3 id="phase-1-choosing-the-reference-image">Phase 1 – Choosing the Reference Image</h3>Everything starts with a reference image. Everyone has their own method: some use commercial models, some run local models with prompts, some use dedicated tools. The important thing is to get an image of a bust or a face — it does not need to be photorealistic. In the example below I used an image generated with Artbreeder. <edge-media url="c61aaf28-035e-404c-b49d-ac170b1abdbd" type="image" filename="artbreeder-portraits_sg2-2026-03-12T07_20_05.821Z.jpeg"></edge-media> The image can be square or rectangular — it does not matter. This will not be the starting image for the dataset; it is purely a visual reference. <h3 id="phase-2-creating-the-starting-image">Phase 2 – Creating the Starting Image</h3>Once you have the reference image, you feed it into a workflow that produces a high-quality photorealistic result. The workflow is based on Flux Klein 9B — a model well suited for image editing — combined with a “consistency” LoRA you can download here: <a target="_blank" rel="ugc" href="https://huggingface.co/dx8152/Flux2-Klein-9B-Consistency">https://huggingface.co/dx8152/Flux2-Klein-9B-Consistency</a>From my tests, this LoRA significantly improves coherence with the reference image and reduces much of the artificial aesthetic typical of Klein, such as plastic-looking skin and overly standardised facial features. This issue is less pronounced with Flux 2 Pro, but that is a separate discussion — here we want to do everything locally.The workflow outputs images at 1024×1536. I find a vertical format ideal: by including the upper body in the frame, the dataset will produce better consistency results when training.With this workflow you get a result like the one below: <edge-media url="f58c02a5-8e07-4eb8-8030-73dbaab9063c" type="image" filename="dataset_lora_00272_.png"></edge-media> As you can see, the result maintains a good likeness to the reference. Without the consistency LoRA the outputs tend to be far too generic. <h3 id="phase-3-photo-retouching"> Phase 3 – Photo Retouching</h3>If the result satisfies you, move on to the next phase. Otherwise, here are two approaches I often use:Light retouching in Photoshop: Use Camera Raw and masks to adjust tones for individual areas (hair, eyes, mouth, and skin tone).Liquify filter: This lets you adjust the size and spacing of facial features. I recommend keeping changes subtle — if you push things too far, the model will tend to standardise the exaggerations during training. Think of it as a small nudge rather than a transformation.ZIT denoising: If you are not happy with the skin quality, you can run the image through ZIT with a very low denoise value (0.1). This avoids altering the character while smoothing out some of the artefacts Klein can produce. Results vary — sometimes it helps, sometimes it does not. Experiment! <edge-media url="643d30c1-d81e-4f3c-a8d3-166b5d47811b" type="image" filename="Reference.png"></edge-media> <h3 id="phase-4-dataset-creation"> Phase 4 – Dataset Creation</h3>Now that you have a solid starting image, it is time to build the dataset. The attached workflow can generate around 45 images of the subject in various poses and framings.Inside the workflow there is a ComfyUI node called CR Prompt List (part of the CR Roll suite). The prompts were selected after extensive testing and, in my opinion, offer a good balance between close-ups, half-body shots, and full-figure images.At the top of the node you can add a prefix phrase that will be prepended to every prompt — for example, "a tall woman with large breasts". This encourages the model to generate more consistent body proportions. As always, feel free to experiment — you can also leave it blank.Out of 45 images, some will be duplicates and others will show obvious deformities. Those should be discarded. I prefer a bulk-generation approach: produce a large batch, then hand-pick the best results. Some images can also be flipped and reused (e.g., left and right profile). <edge-media url="9dbac7a9-e345-419e-bd04-08acc825590f" type="image" filename="Untitled design.png"></edge-media><h3 id="phase-5-captioning"> Phase 5 – Captioning</h3>To generate captions for the images I use TagGui, a practical and powerful tool for creating easily editable captions. It uses JoyCaption as its engine and the results are very good.Once the captions are generated, I edit them manually, removing anything superfluous. I generally keep the description of the pose, clothing, and background — everything else I want the model to learn on its own.For example, the caption for the photo below might read: <edge-media url="0f6f035b-d6ef-4e2a-9068-6217f8ed1a2a" type="image" filename="2.png"></edge-media> "Close-up photograph of a woman with a neutral expression. She is wearing a white t-shirt and looking to the right. Simple dark grey background." That is all! I hope you find this guide useful — feel free to leave feedback.Attached you will find both workflows: one for creating the starting image and one for generating the full dataset. <h3 id="thank-you-and-as-always-if-you-want-to-support-me-in-my-work-you-can-find-all-the-info-in-my-bio!">THANK YOU AND, AS ALWAYS, IF YOU WANT TO SUPPORT ME IN MY WORK, YOU CAN FIND ALL THE INFO IN MY BIO! </h3>

Untitled design (1).png

HOW TO CREATE A PERFECT (or almost ) DATASET FOR A CHARACTER LORA

ComfyUI_temp_dtgyi_00006_.png

physical violence

weapon violence

wide hips

revealing clothes

thick thighs

downblouse

convenient censoring

huge breasts

pg-13

corpses

suggestive

oral invitation

pg13

sexy

sexual situations

male nudity

disturbing

male swimwear or underwear

female swimwear or underwear

partial nudity

undressed

female nudity

breasts out

exposed female nipple

breast out

lingerie

male underwear

hair over breasts

female swimwear

gigantic breasts

no panties

graphic violence or gore

covered nipples

huge butt

strapless leotard

sitting on face

emaciated bodies

one breast out

nsfw

female underwear

nude

graphic male nudity

adult toys

illustrated explicit nudity

nudity

graphic female nudity

hentai

futanari

porn

sexual intent

genitals

peeing

vore

oral

sexual activity

anal

blowjob

dildo riding

incest

hanging

hate symbols

nazi party

white supremacy

diapers

scat

self injury

hate speech

urine

extremist

child on child

latex clothing

swimwear

bukkake

fellatio

cumshot

implied fellatio

eat_cum

cumdrip

cum in pussy

cum on face

after fellatio

cum on hair

cum on body

cum on tongue

cum on hands

cum in mouth

triple fellatio

autofellatio

fucked silly

cum on pussy