I was trying to create Qwen-Image Workflow ASAP ;)
get GGUF: https://huggingface.co/city96/Qwen-Image-gguf/tree/main
get VAE and CLIP: https://huggingface.co/Comfy-Org/Qwen-Image_ComfyUI/tree/main/split_files
Update: there are at least two Abliterated (Uncensored to a degree) GGUF versions of CLIP text-encoder for this model:
1) https://huggingface.co/mradermacher/Qwen2.5-VL-7B-Instruct-abliterated-GGUF/tree/main
2) https://huggingface.co/mradermacher/Qwen2.5-VL-7B-Abliterated-Caption-it-GGUF/tree/main <--- this one is my personal favorite!
3) for 4-8 steps lora go to this link: https://huggingface.co/lightx2v/Qwen-Image-Lightning/tree/main
P.S. The model is very sensitive to photography settings. Try to be careful with the depth of field and shallow focus in your prompts.