Sign In

z_image_turbo_nvfp4

Updated: Apr 28, 2026

base model

Download

1 variant available

nf4 SafeTensor

4-bit normalized (pruned) • 4.2 GB

Verified:

You need these files to run this model. We'll show the best match for your preferences.

Text Encoder

qwen_3_4b_fp4_mixed.safetensors • 3.24 GB

Verified:

Downloads your preferred variants

Type

Checkpoint Trained

Stats

141

Reviews

Published

Apr 28, 2026

Base Model

ZImageTurbo

Hash

AutoV2
A553C889DB

License:

Apache 2.0

i Don't own the model

🚀 Z-Image Turbo [NVFP4] – The Speed of Light Meets High-Fidelity Realism

Welcome to the next generation of efficiency. This is a specialized quantization of Z-Image Turbo using the cutting-edge NVFP4 (4-bit Floating Point) format. Optimized specifically for NVIDIA Blackwell (RTX 50-series) architecture, this model delivers unmatched inference speeds without sacrificing the photorealistic soul of the original weights.

Why NVFP4?

Unlike standard 4-bit quantizations, NVFP4 utilizes a dual-level scaling strategy (FP8 micro-blocks + FP32 tensor scaling). This minimizes quantization errors, preserving the fine textures, skin details, and lighting accuracy that define high-end diffusion models, all while slashing VRAM usage.

✨ Key Features

  • Insane Speed: Generate 1.5MP to 2MP images in seconds.

  • VRAM Efficient: Extremely lightweight footprint (approx. 4.2GB), making it accessible for GPUs with lower memory while flying on high-end cards.

  • Zero-CFG Distilled: Designed for 8-10 step generations with no CFG overhead.

  • Bilingual Mastery: Exceptional at rendering both English and Chinese text within the image, also Spanish

  • Raw Realism: Maintains the "organic" look—perfect for those who hate the "AI plastic" aesthetic.

  • Steps: 8 to 12 (10 is the sweet spot).

  • CFG Scale: 1.0 (The model is distilled; higher values are not needed).

  • Sampler: Euler a & bong tangent.

  • Resolution: Optimized for 1024x1024 and 1536x1536.

  • Architecture: Works best in environments supporting CUDA 13 and Forge UI.

Pro Tip: For the best results, use descriptive, natural language prompts. This model excels at "unposed" photography, cinematic lighting, and complex material textures like weathered skin or textile weaves.