Qwen 3_4_B Trained Text Encoder for Z-Image
FP32
Full Finetune at FP32 (Full Model Finetune - All Parameters & All layers)
FP32 Finetune of QWEN3_4b focusing on describing human features SFW/NSFW captions.
Can be run in FP32 with no time loss on most machines that use CPU offloading.
BF16
Full Finetune at BF16 (20 Layers)
Long Text descriptions 500-1000 token length focusing on describing human features.
For use with Z-Image or Z-Image Turbo
Comparison Images showing QWEN base VS Human Corpus HERE


