Generation:
No trigger word.
Very simple tag-based prompt example:
detailed, two girls, tongue out, smile, night raveMore useful prompt words to swap out into the prompt that were common in the dataset:
tongue out, puckered lips, laying down, on back, on stomach, ring light, asian, latina, african, 3d, animationStrongly recommended settings:
I encourage you that for every prompt you like you try both the default LoRA that was trained with 3000 steps, and also the 3250 step version. Both seem good and different enough
Using
detailedin the prompt should always be betterGenerating at 1328x1328 should always be better than 1024x1024
Try euler-simple/euler ancestral-simple/lcm-simple, shift 0.5 to 4
I found the settings I personally like while focusing on a fast Qwen Image workflow with the 4 step lightning LoRA, tweaking the generation settings easily shifts around the DALL-E 3-like girl style so you should find the settings for the look you like
My settings:
I actually use the Qwen Image Edit lightning LoRA which gives much more interesting results and I think is the biggest contributor to solving the low seed variance problem of Qwen Image, albeit at a cost of usually slightly grainier image. You can also try different lightning LoRAs.
1328x1328, 4 steps, cfg 1, euler-simple, shift 2.5 (and 0.5/1/2/3.1)
I did minimal testing without lightning, 2.5 cfg 50 steps worked OK, appending the officially recommended , Ultra HD, 4K, cinematic composition. string to the 50 step workflow prompt also seems good.
Limitations:
There was slight hand body horror blur in the dataset which can bleed in.
There can also be some weird wacky clothing and tattoos concept bleed with the 3000 steps model, but I still think that one has better DALL-E 3-like styles and faces.
Concept bleed can happen depending on the prompt and settings of some liquid spilled on the body at times, particularly at 1024x1024, from some more unique images in the training dataset that I assume weren't tagged fully.
Training info:
Used ai-toolkit with this official tutorial and it's settings, with 0.0002 learning rate and 3500 steps. 3500 steps overcooks the lora, while the 3000 and 3250 checkpoints are good.
~100 images, mostly 1024x1024, very few and very simple tag-based captions.
Post in the comments below if you find an interesting generation settings setup.

