Type | |
Stats | 3,749 |
Reviews | (285) |
Published | Oct 9, 2024 |
Base Model | |
Hash | AutoV2 DF6D7BA1A1 |
VERSION LINKS: FP8 β’ FP16 β’ NF4 β’ GGUF Q8_0 / Q6_K / Q5_KM / Q5_KS / Q5_0 / Q5_1 / Q4_KM / Q4_KS / Q4_0 / Q4_1 / Q3_KM / Q3_KS
V2 is an alternative to V1 with sharper good quality images from 4 steps. This version merges Schnell + Finetuned Dev + Hyper using the same but refined formula of variable block ratios from V1. Check comparison images below!
MAKE SURE TO RENAME YOUR FILES AFTER DOWNLOAD, CIVITAI GIVES THEM WRONG NAMES!
Tested sampler/scheduler for low steps:
ComfyUI: euler sampler, simple or beta scheduler.
Forge: euler, flux realistic sampler. KL Optimal or beta scheduler.
This model doesn't take guidance parameter, like schnell.
The versions with AIO (All in one) in the name include UNET + VAE + CLIP L + T5XXL (fp8). Also known as Checkpoint or Compact version.
Using BNB NF4 & GGUF quants in ComfyUI requires installing custom nodes that add special model loaders:
NF4 + Lora support: https://github.com/bananasss00/ComfyUI_bitsandbytes_NF4-Lora
(outdated) NF4 UNET: https://github.com/DenkingOfficial/ComfyUI_UNet_bitsandbytes_NF4
(outdated) NF4 AIO checkpoint: https://github.com/comfyanonymous/ComfyUI_bitsandbytes_NF4
For using UNET versions, you also need to have the TEXT ENCODERS and VAE.
If you don't have them, download them from here:
T5XXL - CLIP L: https://huggingface.co/comfyanonymous/flux_text_encoders/tree/main
GGUF T5XXL: https://huggingface.co/city96/t5-v1_1-xxl-encoder-gguf
VAE: https://huggingface.co/black-forest-labs/FLUX.1-schnell/tree/main/vae
Place the model in "models/diffusion_models" or "models/unet", both text encoders in "models/clip" and vae in "models/vae" folder.
In ComfyUI, use the standard flux workflow or add 'Load Diffusion Model', 'DualClipLoader' and 'Load VAE' nodes to replace the checkpoint loader and complete the setup.
In Forge, set the option "Diffusion in low bits" to "bnb-nf4"
Thanks to city96 for gguf quantization script.
Thanks to reddit user a_beautiful_rhind for bnb quantization script.
FLUX FUSION VERSION 1
Merge of Schnell and Dev variants of the Flux.1 model with a irregular smoothed ratio for each of the layers.
Quick comparison between versions. Prompts and settings at the end.
ββ Click show more for more examples and instructions ββ
Recommended use around 8 steps. If textures like skin look overworked, try lowering steps.
Comparison of V1 QUANTS:
Test parameters: 8 Steps, CFG 3.5, 1536x1536, seed 0
Prompts:
"Extreme closeup, frog face, star crystal structure, intricate designs, glowing hues. Extreme depth of field, celestial light, shimmering details, otherworldly charm, majestic elegance."
"extremely detailed 3d render portraits of a cyber-dragon themed flaming gothic arcane tech woman, cables, arcane tech-dragon inspired design, exposed machinery. the casing is glittery transparent tinted orange, red and black, allowing to see the internals. sophisticated fantasy design. abstract thematic background. extreme depth of field. dragon behind"
"classic pokemon 3DCG illustration. thick outlines. pokemon render style. extremely dynamic composition showcasing the special ability power. flowing pose with extreme closeup on the face in the upper area of the image. intense perspective, in motion movement effects, dynamic impactful vfx, eye catching.. A rock PokΓ©mon with earthquake powers poses dynamically, its body a twisted mass of rugged terrain and molten lava flows. The upper area zooms in on its face, a mask of stone and fury with blazing eyes. Earth shatters beneath its feet as it stomps, unleashing seismic waves that ripple through the abstract background like a fractured canvas. Intense perspective compresses space, conveying unstoppable power. Vibrant colors dance: fiery oranges, electric blues, and smoldering grays. Movement effects blur edges, blurring boundaries between rock and energy. Impactful VFX burst forth in the foreground, echoing the PokΓ©mon's raw force.. masterpiece, professional, best quality, sharp, extreme detail, Hyper-detailed, high-resolution, intricate, vivid. "
"Ethereal female face in 4K ultra closeup, eyes radiating eerie mystical aura with crystalline composition tinted purple-blue hues. Surrounding inferno blazes with dynamic flames and motion effects, creating a vertiginous atmosphere. Extreme depth of field emphasizes surreal otherworldly presence. Glowing eyes at the focal point contribute to haunting mystique, shot from an altered viewing angle emphasizing mysticism. Use Octane and Redshift raytracing for realistic fire and light effects, achieving ultra-realistic 3D render with intense, dreamlike quality."
"Ethereal star princess, diaphanous gown, shimmering stardust, intricate halo, luminous beauty. Night sky, glowing constellations, soft light, dreamy ambiance, mesmerizing allure."
"Sci-fi landscape, derelict alien structure, holographic iridescence, massive metal arches, dark skies, damaged antennas. Ground littered with debris, scattered wreckage, distant moon, dim light."