Sign In

Folder Image Captioner with Qwen-VL WF

Type

Workflows

Stats

41

0

Reviews

Published

Apr 2, 2026

Base Model

Qwen

Hash

AutoV2
F48CCF79C6
default creator card background decoration
bobgus39's Avatar

bobgus39

Folder Image Captioner with Qwen-VL

This ComfyUI workflow allows you to batch caption entire folders of images quickly and efficiently.

It loads images from a selected folder, resizes them if needed, generates high-quality detailed captions using Qwen-VL-Mod (Qwen3-VL-8B-Instruct-Abliterated), and saves both the original image and its corresponding caption file with the exact same filename (e.g., photo.jpg + photo.txt).

Ideal for creating training datasets for LoRAs, character fine-tuning, or any project that requires consistent captions.

Features:

  • Batch processing directly from folder

  • Saves image + caption with the same name

  • High detail and accuracy thanks to Qwen-VL

  • Maintains the same pose, camera angle, lighting, and location from the original image

Required Custom Nodes:

  • ComfyUI Custom Nodes

  • Qwen-VL-Mod (or Qwen3-VL-8B-Instruct-Abliterated)

  • Resize Image v2

  • Load Image Dataset from Folder

  • Save Image and Text Dataset to Folder

Created by: bobgus39 Original profile: https://civitai.com/user/bobgus39

Usage: Simply select your image folder and run the workflow. The captions will respect the original pose, camera angle, lighting, and background/location of each image, making them perfect for training consistent characters or scenes.