Type | |
Stats | 1,130 4,121 |
Reviews | (142) |
Published | Sep 8, 2024 |
Base Model | |
Usage Tips | Strength: 1 |
Trigger Words | Photography |
Hash | AutoV2 91CF7969C9 |
Dramatic light, realistic textures, naturalistic colors, and long depth-of-field photos
in Forge, please enable 'FP16 LoRA mode'
This will probably be this model's final form. Compared to v1.0, v1.2 has more contrast, even more (potential) sharpness in the background, more natural coloring, and more consistently dramatic lighting, while always looking extremely realistic.
Use the word 'Photography' in your prompt: This final version is prone to generate illustrations - really nice ones! - if you don't emphasize 'photography' enough. Be sure to prompt clearly for photographs!
If you want long depth of field/in-focus background: describe your background. Just write up some unique characteristics about what's back there - you'll get more focus on it.
Unlike with most other approaches taken with Flux, this won't mean you need to abandon cinematic/professional photo styles. Things can look both epic and have long depth of field. You can push this model toward snapshot or analog style images, though it's better done with a specific LoRA aimed at such.
If you want a blurred background: describe a foreground subject and refer to the background as sparsely and generically as possible. It will give you a focus on the subject and blur the background most of the time. Or reduce the strength of or don't use this LoRA, LOL. It's got positive impact outside of DOF so for photo prompts, I'm using it by default.
More then DOF: The primary motivation for the LoRA is depth and background detail - but even if you're prompting for a photo with an explicitly shallow DOF, this model brings improved realism of lighting, coloring, and texture as you can see in the comparison below (better viewed full-size at this imgur link)
I started by training Flux on the same dataset as used for the SDXL model, Eldritch Candid Photography - that didn't yield pleasing results at all. So I changed the dataset a bunch, reduced the noise texture a ton, and modified the color/tonal edits. This produced a very nice Beta model.
But I didn't think the imposed tone and grain were doing as much service to this model as to the inspiration XL model. So, I trained a new model on the same dataset (very long DOF images) without any processing to them. On its own, images output from it were kind of harsh and austere looking. So I merged that with the beta model and got a really nice v1.0
But this still wasn't quite what I was looking for. So I tried a few other approaches and merge techniques before deciding on a simple curves adjustment to my dataset to heighten contrast and achieve a consistent dramatic lighting across them all. This one all on its own was close to the final target! But I tested a bunch of simple merges with my various models and settled on what is v 1.2 - I expect to be the final state of this thing for Flux.1 D