home models images videos 3D Models articles comics challenges updates shop

z_image_turbo_nvfp4

Name: z_image_turbo_nvfp4
Rating: 5 (15 reviews)
Author: Tukanazo1966

473

Updated: May 5, 2026

base model

Download

1 variant available

nf4 SafeTensor

z_image_turbo_nvfp4.safetensors

4-bit normalized (pruned) • 4.2 GB

Verified: 3 months ago

Download (4.2 GB)

Required Components

You need these files to run this model. We'll show the best match for your preferences.

Z Image Base

Text Encoder

External

Base • qwen_3_4b_fp4_mixed.safetensors

3.24 GB

Downloads your preferred variants

Details

Type

Checkpoint Trained

Stats

473

Reviews

Positive

(15)

Published

Apr 28, 2026

Base Model

ZImageTurbo

Hash

AutoV2

A553C889DB

Tensors

default creator card background decoration

46.9K

1.1K

261

Tukanazo1966

Joined Nov 28, 2023

License:

Apache 2.0

Originally Posted: https://huggingface.co/Comfy-Org/z_image_turbo

🚀 Z-Image Turbo [NVFP4] – The Speed of Light Meets High-Fidelity Realism

Welcome to the next generation of efficiency. This is a specialized quantization of Z-Image Turbo using the cutting-edge NVFP4 (4-bit Floating Point) format. Optimized specifically for NVIDIA Blackwell (RTX 50-series) architecture, this model delivers unmatched inference speeds without sacrificing the photorealistic soul of the original weights.

Why NVFP4?

Unlike standard 4-bit quantizations, NVFP4 utilizes a dual-level scaling strategy (FP8 micro-blocks + FP32 tensor scaling). This minimizes quantization errors, preserving the fine textures, skin details, and lighting accuracy that define high-end diffusion models, all while slashing VRAM usage.

✨ Key Features

Insane Speed: Generate 1.5MP to 2MP images in seconds.
VRAM Efficient: Extremely lightweight footprint (approx. 4.2GB), making it accessible for GPUs with lower memory while flying on high-end cards.
Zero-CFG Distilled: Designed for 8-10 step generations with no CFG overhead.
Bilingual Mastery: Exceptional at rendering both English and Chinese text within the image, also Spanish
Raw Realism: Maintains the "organic" look—perfect for those who hate the "AI plastic" aesthetic.

🛠 Recommended Settings

Steps: 8 to 12 (10 is the sweet spot).
CFG Scale: 1.0 (The model is distilled; higher values are not needed).
Sampler: Euler a & bong tangent.
Resolution: Optimized for 1024x1024 and 1536x1536.
Architecture: Works best in environments supporting CUDA 13 and Forge UI.

Pro Tip: For the best results, use descriptive, natural language prompts. This model excels at "unposed" photography, cinematic lighting, and complex material textures like weathered skin or textile weaves.