Step-by-Step Guide Series:
ComfyUI - IMG to IMG Workflow

This article accompanies this workflow: link

Workflow description :

The aim of this workflow is to generate images from another one and a text in a simple window.

Prerequisites :

📂Files for "base" version :

Model : flux1-dev-fp8.safetensors or flux1-dev-fp16.safetensors
in ComfyUI\models\diffusion_models

CLIP : clip_l.safetensors
in ComfyUI\models\clip

📂Files for GGUF version :

Model : Q8, Q6, Q5, Q4, Q3
in ComfyUI\models\unet

CLIP : Q8, Q6, Q5, Q4, Q3
in ComfyUI\models\clip

📂Files for NUNCHAKU version :

Model : svdq-int4_r32-flux.1-dev.safetensors
in ComfyUI\models\diffusion_models

📂Common Files

Text encoder : t5xxl_fp8_e4m3fn.safetensors or ViT-L-14-TEXT-detail-improved-hiT-GmP-TE-only-HF.safetensors

VAE : ae.safetensors
in ComfyUI\models\vae

ANY upscale model :

Realistic : RealESRGAN_x4plus.pth
Anime : RealESRGAN_x4plus_anime_6B.pth

in ComfyUI\models\upscale_models

📦Custom Nodes :

Don't forget to close the workflow and open it again once the nodes have been installed.

Usage :

In this new version of the workflow everything is organized by color:

Green is what you want to create, also called prompt,
Yellow is all the parameters to adjust the video,
Blue are the model files used by the workflow,
Purple is for LoRA.

We will now see how to use each node:

Write what you want in the “Prompt” node :

Select image format :

Choose the guidance level :

I recommend between 3.5 and 4.5. The lower the number, the freer you leave the model. The higher the number, the more the image will resemble what you “strictly” asked for.

Choose a scheduler, number of steps and denoise level :

I recommend normal or beta and between 20 and 30. The higher the number, the better the quality, but the longer it takes to get an image.

The denoise allows you to choose how the base image will influence the new one. At 0, the image will be exactly the same (useless), and at 1, the image will be completely new (so your image will be ignored).

I recommend starting around 0.7/0.8 and reducing or increasing it depending on the desired result.

Choose a sampler :