Sign In

Qwen Image Edit - Remix

Updated: Mar 29, 2026

base model

Verified:

SafeTensor

Type

Checkpoint Merge

Stats

688

0

Reviews

Published

Mar 29, 2026

Base Model

Qwen

Hash

AutoV2
10CF71B500
default creator card background decoration
FX_FeiHou's Avatar

FX_FeiHou

License:

🌟 Qwen Image Edit Remix

Qwen Image Edit Remix is a high-performance Qwen-based model designed for Image Editing, Image-to-Image, and Text-to-Image tasks.
It focuses on stability, speed, and subject consistency, while still allowing flexible and creative remix-style generation.

The model runs in FP8 precision and includes acceleration LoRA, significantly improving inference speed and reducing VRAM usage without sacrificing output quality.
This model supports NSFW content. Please ensure responsible and lawful usage.


πŸ“¦ Model Variants

πŸ”ΉAIO v2.0 (All-In-One)

The AIO version comes with baked-in CLIP and VAEβ€”ready to use right out of the box.

  • Installation: Download and place the model file into your models\checkpoints folder.

  • Usage: Simply use the Load Checkpoint node in ComfyUI to load the model.

πŸ”Ή Standard Version (without VAE / CLIP)

  • Contains only the core model weights

  • Requires users to load their own VAE and CLIP

  • Recommended for advanced users with existing pipelines or custom components

⚠️ Aside from the inclusion of VAE and CLIP, both versions are identical in structure, performance, and output quality
⚠️ Both versions run in FP8 precision and include the same acceleration LoRA


πŸŽ‰ AIO v2.0 Update Notes

This v2.0 release brings several major upgrades to visual quality and control:

  • 🧍 Enhanced Human Pose Accuracy: Significantly improves skeletal structure in complex dynamic poses. Limbs are generated much more naturally, bidding farewell to awkward anatomy.

  • πŸ§‘β€πŸ€β€πŸ§‘ Reduced Distortion in Multi-Person Scenes: Specially optimized for multi-subject interactions. Effectively minimizes limb blending, dislocations, and abnormal limb counts when generating multiple people.

  • 🎯 Increased Prompt Sensitivity: The model now understands and responds to your prompts much more precisely, keenly capturing and reproducing the specific details and styles you ask for.


✨ Core Capabilities

  • Image Editing
    Precise instruction-based editing of input images, including character, clothing, background, style, and detail adjustments.

  • Image-to-Image (I2I)
    Redraw, enhance, or stylize images while preserving the original composition and subject structure.

  • Text-to-Image (T2I)
    Generate images purely from text prompts without requiring any input image.

  • Remix-oriented Generation
    Designed for re-creation rather than full regeneration, maintaining key visual elements while introducing new creative variations.

  • Efficient Inference
    FP8 + acceleration LoRA provides a strong balance between speed, VRAM efficiency, and visual quality.


  • Sampler
    euler_ancestral

  • Scheduler
    beta

This combination offers a good balance between stability, detail preservation, and overall visual coherence, especially for image editing and remix workflows.


🎯 Use Cases

  • AI image editing and retouching

  • Image-to-image redraw and style transfer

  • Text-to-image content creation

  • Outfit, pose, and scene modification

  • Character-consistent remix and iteration

  • Posters, covers, and visual concept design

  • ComfyUI / Diffusers image generation workflows