Long CLIP (Distilled)
Teacher/Student Distillation from 248/224 token length to projected 77
Pruned for use in SDXL, FLUX, SD 1.5, SD3, Hunyaun Video
DO NOT USE IN HI-DREAM
Some of the top onsite models built with FP32 Distilled CLIP/FP32 VAE
HiDream CLIP has been trained on a distillation set and the 248 and 224 token lengths reduced to 77 based on the pooled vision/text model output.