Sign In

Z-image-hires Workflow

26

156

9

Updated: Nov 29, 2025

tool

Type

Workflows

Stats

156

0

Reviews

Published

Nov 29, 2025

Base Model

ZImageTurbo

Hash

AutoV2
E26DF5CCE4
default creator card background decoration
oosayam's Avatar

oosayam

🚀 Z-Image Turbo FP8 Hires Workflow (Low VRAM Optimized)

This is a high-efficiency ComfyUI workflow designed specifically for Low VRAM users. By utilizing FP8 Quantized Models and Latent Upscale technology, it generates high-resolution images (1024x1792) rapidly while maintaining minimal resource usage.

✨ Key Features

  • Extreme Low VRAM Usage: Full FP8 pipeline (Model & Text Encoder) to drastically reduce memory footprint.

  • Lightning Fast: Optimized for Turbo models and efficient sampling steps.

  • Hires Fix Pipeline: Utilizes Latent Upscale + 2nd Pass KSampler to ensure crisp details without heavy VRAM cost.

  • AuraFlow Architecture: Optimized using the ModelSamplingAuraFlow node.


📂 Models Required & Downloads

To ensure the workflow functions correctly, please download the following models and place them in your respective ComfyUI folders:

1. UNet Model (Place in models/unet/)

2. CLIP / Text Encoder (Place in models/clip/)


⚙️ Key Settings & Configuration

This workflow operates on a 2-Pass system. Please adhere to the following settings for the best results:

🔹 Phase 1: Base Generation

  • Latent Size: Generates at a lower initial resolution (e.g., 512x896) to save compute resources.

🔹 Phase 2: Latent Upscale

  • Upscale Method: Uses LatentUpscaleBy.

  • Scale Factor: Default is 2 (resulting in a final output of 1024x1792).

🔹 Phase 3: Hires Fix (Refiner)

This step is crucial for image clarity and detail:

  • Sampler: res_multistep (Highly Recommended).

  • Denoise: Recommended range 0.5 - 0.6.

    • < 0.5: Changes are minimal; the image may remain slightly blurry.

    • > 0.6: Adds more detail, but setting this too high may alter the image structure or cause hallucinations.


📊 Performance Benchmark

Data based on actual testing:

GPUOutput ResolutionTimeNVIDIA RTX 5070 Ti1024 x 17928 ~ 9 sec


📝 Usage Tips

  1. Memory Management: If you are extremely limited on VRAM, ensure no other large models are loaded in the background.

  2. Prompting: Since this uses the Qwen text encoder, it has strong natural language understanding. Detailed, sentence-based prompts work very well.

  3. Troubleshooting: If you notice the image details breaking or looking "burnt," try slightly lowering the denoise value in the second KSampler.