santa hat
deerdeer nosedeer glow
Sign In

Step-by-Step Guide Series: ComfyUI - IMG to IMG

Step-by-Step Guide Series: ComfyUI - IMG to IMG

Step-by-Step Guide Series:
ComfyUI - IMG to IMG Workflow

This article accompanies this workflow: link

Foreword :

English is not my mother tongue, so I apologize for any errors. Do not hesitate to send me messages if you find any.

This guide is intended to be as simple as possible, and certain terms will be simplified.

Workflow description :

The aim of this workflow is to generate images from another one and a text in a simple window.

Prerequisites :

ComfyUI

Model files :

Additionnal nodes :

Don't forget to close the workflow and open it again once the nodes have been installed.

Usage :

Write what you want in the “Prompt” node.

Choose a number of steps :

I recommend between 20 and 30. The higher the number, the better the quality, but the longer it takes to get an image.

Chose denoise level :

You must choose between 0 and 1. 0 will give you exactly the same image, 1 will give you a new image completely unrelated to the given one. I recommend starting at 0.8.

Choose the guidance level :

I recommend between 3.5 and 4.5. The lower the number, the freer you leave the model. The higher the number, the more the image will resemble what you “strictly” asked for.

Set the upscale ratio : (optional)

I recommend leaving it at 2. If you enable upscaling, your image will be recreated with the chosen factor (in this case twice as large, for example).

Choose your model:

Depending on whether you've chosen basic or gguf workflow, this setting changes. I personally use the gguf Q8_0 version.

Choose a FLUX clip encoder and a text encoder :

I personally use the GGUF Q8_0 encoder and the text encoder ViT-L-14-TEXT-detail-improved-hiT-GmP-TE-only-HF.

Select an upscaler : (optional)

I personally use 4x_NMKD-Siax_200k.

Now let's take a look at the activation node :

Here the first option must always be activated (otherwise nothing happens ^^").
Activate the number of LoRAs you wish to use. If you don't know what a LoRA is, don't activate any.
Metadata allows you to integrate all generation information into your file. Such as the prompt, number of steps, ...
Finally, the upscaler generates a larger image. But this takes time, so you can choose to disable it.

Below, you'll find a selection of LoRAs according to the number you've chosen to activate :

The first option lets you choose the LoRA.

The second allows you to choose the “strength” of this LoRA. The higher the number, the more the LoRA will be used.

I recommend starting at 1 and reducing or increasing depending on the desired result.

Now select your base image :

The new image will be exactly the same size as the original. So be careful not to make it too big, or the generation will be very slow.

Now you're ready to create your image.

Just click on the “Queue” button to start:

Once rendering is complete, the image appears in the “image viewer” node.

If you have enabled upscaling, a slider will show the base image and the upscaled version.

This guide is now complete. If you have any questions or suggestions, don't hesitate to post a comment.

13

Comments