Evolution of Text-to-Image 2025.zip

Go to year: <a target="_blank" rel="ugc" href="https://civitai.com/articles/9188">2022</a> • <a target="_blank" rel="ugc" href="https://civitai.com/articles/9193">2023</a> • <a target="_blank" rel="ugc" href="https://civitai.com/articles/9209">2024</a> • 2025<hr /><h1 id="introduction">Introduction</h1>Now most models are good at text and aren't as prone to mangling human anatomy. A major addition this year included multiple models for editing images with text prompts.Prompts and project details can be found at the bottom of the article. High resolution versions of the comparison image grids are in this article's attachment.<h1 id="the-models">The Models</h1><h2 id="lumina-2.0">Lumina 2.0</h2>January 2025Lumina is a more lightweight release than most that will follow it this year. It seems like a marginal improvement over base SDXL in some ways, but can't compete with most current models. It is better at SDXL at following instructions, but it still struggles with anatomy and text.<edge-media url="a4facbc4-cda1-437b-bcd7-64ef8bc64476" type="image" filename="250100_Lumina Image 2_small.jpg"></edge-media>Lumina 2✅free download (<a target="_blank" rel="ugc" href="https://github.com/Alpha-VLLM/Lumina-Image-2.0">GitHub</a> | Hugging Face: <a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/Lumina_Image_2.0_Repackaged/tree/main/all_in_one">all-in-one</a>, <a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/Lumina_Image_2.0_Repackaged/tree/main/split_files/diffusion_models">diffusion variant</a>)<h2 id="firefly-image-4">Firefly Image 4</h2>April 2025Adobe released Firefly Image 4 and Firefly Image 4 Ultra. Based on my tests, the Ultra version seems to heavily favor realism over stylized images. Overall, it seems to be a big improvement over Firefly 3, but probably not worth choosing unless you're already in the Adobe ecosystem.<edge-media url="cf4f86d3-cf6f-413f-b4d9-03e417dc4aa7" type="image" filename="250400_FFLY4_Firefly Image 4_small.jpg"></edge-media>Firefly Image 4❌free download<edge-media url="efa9ad81-d3a3-4f74-9d88-c44b25976189" type="image" filename="250400_FFLY4_Firefly Image 4 Ultra_small.jpg"></edge-media>Firefly Image 4 Ultra❌free download<h2 id="gpt1-image">GPT1 Image</h2>April 2025GPT1 has great prompt adherence, is good at text, and can make aesthetically pleasing images even when only given minimal prompts.<edge-media url="11db653d-a811-4923-afb4-7056e478ab2f" type="image" filename="250400_GPT1_GPT1 Image Medium_small.jpg"></edge-media>GPT1 Image❌free download<h2 id="hidream-i1">HiDream-I1</h2>April 2025HiDream had great prompt adherence, is good with text, and leans toward stylized images. Both the Dev and Full versions are available to run locally.<edge-media url="22e9f839-565a-45f9-89b5-ee79671ea375" type="image" filename="250400_HIDR_HiDream-I1 Dev_small.jpg"></edge-media>HiDream-I1 Dev✅free download (<a target="_blank" rel="ugc" href="https://civitai.com/models/1562709/hidream">CivitAI </a>| Hugging Face: <a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/tree/main/split_files/diffusion_models">fp8/bf16</a>, <a target="_blank" rel="ugc" href="https://huggingface.co/city96/HiDream-I1-Dev-gguf/tree/main">gguf</a>)<edge-media url="0af6a1db-c161-419c-825b-1913e8a58c85" type="image" filename="250400_HIDR_HiDream-I1 Full_small.jpg"></edge-media>HiDream-I1 Full✅free download (<a target="_blank" rel="ugc" href="https://civitai.com/models/1562709/hidream">CivitAI </a>| Hugging Face: <a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/tree/main/split_files/diffusion_models">fp8/bf16</a>)<h2 id="midjourney-7">Midjourney 7</h2>April 2024Midjourney's new model emphasizes personalization that can adapt to the user's preferences. This means you might get images that look very different from mine with the same prompts.<edge-media url="d5d69194-0469-47d0-bbde-19ddb93992af" type="image" filename="250400_MJD7_Midjourney 7_small.jpg"></edge-media>Midjourney 7❌free download<h2 id="seedream-3">Seedream 3</h2>April 2025Seedream 3 is ByteDance's text-to-image model. It's good with text and aesthetics, but sometimes seemed to prioritize style over following my instructions. This version is pretty good, but it's going to have a couple more updates in only a few months.<edge-media url="cc1dcd37-8a19-46af-b5c4-2c2d4488aca7" type="image" filename="250400_SEED3_Seedream 3_small.jpg"></edge-media>Seedream 3❌free download<h2 id="google-imagen-4-(reg-and-ultra)">Google imagen 4 (reg and ultra)</h2>August 2025This model from google seems to favor stylized images. It's good at prompt adherence and great with text.<edge-media url="68c9d470-da68-48e9-86cb-50e8b3ab69e6" type="image" filename="250800_GI4_Google Imagen 4_small.jpg"></edge-media>Google Imagen 4❌free download<edge-media url="73594037-acb1-416d-b4aa-cdf49af66782" type="image" filename="250800_GI4_Google Imagen 4 Ultra_small.jpg"></edge-media>Google Imagen 4 Ultra❌free download<h2 id="nano-banana">Nano Banana</h2>August 2025A.K.A Gemini 2.5 Flash Image/Gemini 3 Pro Image. Another model under the Google umbrella. It's good at prompt adherance, text, and aesthetics. It can be used to generate images directly and to edit existing images.<edge-media url="6458ee28-839d-456b-8183-5c803c9466ec" type="image" filename="250800_NB_Nano Banana_small.jpg"></edge-media>Nano Banana❌free download<h2 id="qwen-image">Qwen Image</h2>August 2025Qwen is currently a community favorite with an Apache license. It's great at prompt adherence, text, anatomy, and styles. It's resource intensive, but variations have been released that can be run on lower vram graphic cards.<edge-media url="b7565a29-d317-495e-b4f0-fdfe064419ff" type="image" filename="250800_Qwen Image_small.jpg"></edge-media>✅free download (<a target="_blank" rel="ugc" href="https://civitai.com/models/1864281/qwen-image">CivitAI </a>| Hugging Face: <a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/Qwen-Image_ComfyUI/tree/main/split_files/diffusion_models">fp8/bf16</a>, <a target="_blank" rel="ugc" href="https://huggingface.co/city96/Qwen-Image-gguf/tree/main">gguf</a>, <a target="_blank" rel="ugc" href="https://huggingface.co/nunchaku-tech/nunchaku-qwen-image/tree/main">nunchaku</a>)<h2 id="seedream-4.0">Seedream 4.0</h2>September 2025ByteDance's second release of Seedream this year, but not its last.<edge-media url="32e69a39-c145-4437-9a0f-fea2d1ae1783" type="image" filename="250900_SEED3_Seedream 4_small.jpg"></edge-media>Seedream 4.0❌free download<h2 id="hunyuan-image-2.1">Hunyuan Image 2.1</h2>September 2025Hunyuan can be run locally but it is resource-intensive. It is good with text and prompt adherence. It can create aesthetically pleasing images, but the images tended to be more basic without extra guidance. Without a refiner, this model tends to be blurry and lack fine details.<edge-media url="e6644e51-eef1-4943-93f3-8e5606bb35e1" type="image" filename="250900_Hunyuan Image 2-1_small.jpg"></edge-media>✅free download (Hugging Face: <a target="_blank" rel="ugc" href="https://huggingface.co/tencent/HunyuanImage-2.1">official</a>, <a target="_blank" rel="ugc" href="https://huggingface.co/QuantStack/HunyuanImage-2.1-Distilled-GGUF/tree/main">gguf</a>)<h2 id="kandinsky-5">Kandinsky 5</h2>November 2025Kandinsky is a Russian model, and if you use it long enough that may become obvious since it sometimes randomly adds Russian text to images. It's one of the more inconsistent models I tested; sometimes it's great at prompt adherence and other times it ignores basic requests and does its own thing.<edge-media url="5a8e4842-8867-45fc-9af7-54636785bba5" type="image" filename="251100_Kandinsky 5_small.jpg"></edge-media>✅free download (<a target="_blank" rel="ugc" href="https://huggingface.co/collections/kandinskylab/kandinsky-50-image-lite">Hugging Face</a>)<h2 id="z-image-turbo">Z Image Turbo</h2>November 2025Z-Image Turbo is one of the few new models released this year that can be run on lower vram and still offer significant improvements over the older Stable Diffusion models. It's good with text, realism, and is fairly good at prompt adherence (my experience was good but not as good as some of the other current models).<edge-media url="53e34ad4-a411-493f-be67-266a82a1e9c1" type="image" filename="251100_Z Image Turbo_small.jpg"></edge-media>✅free download (<a target="_blank" rel="ugc" href="https://civitai.com/models/2168935/z-image-turbo">CivitAI</a> | Hugging Face: <a target="_blank" rel="ugc" href="https://huggingface.co/Comfy-Org/z_image_turbo/tree/main/split_files/diffusion_models">bf16</a>, <a target="_blank" rel="ugc" href="https://huggingface.co/silveroxides/Z-Image-Turbo-quants-plus/blob/main/Z-Image-Turbo-Plateau-fp8mixed.safetensors">fp8</a>, <a target="_blank" rel="ugc" href="https://huggingface.co/jayn7/Z-Image-Turbo-GGUF/tree/main">gguf</a>)<h2 id="flux-2">Flux 2</h2>November 2025The latest in the Flux lineup has been released, and it's quite resource hungry. I haven't had much time to experiment with it yet.<edge-media url="5db169f5-ff44-40de-a968-a35863bc8db7" type="image" filename="251100_Flux Dev 2-0_small.jpg"></edge-media>Flux Dev 2.0✅free download (<a target="_blank" rel="ugc" href="https://civitai.com/models/2165902/flux2">CivitAI</a> | Hugging Face: <a target="_blank" rel="ugc" href="https://huggingface.co/silveroxides/FLUX.2-dev-fp8_scaled/tree/main">fp8</a>, <a target="_blank" rel="ugc" href="https://huggingface.co/city96/FLUX.2-dev-gguf/tree/main">gguf</a>)<edge-media url="b6e57a2e-e61f-455f-abaa-bcbe45d07e2f" type="image" filename="251100_Flux Pro 2-0_small.jpg"></edge-media>Flux Pro 2.0❌free download<h2 id="seedream-4.5">Seedream 4.5</h2>November 2025And yet another Seedream release. I haven't used it much, but it looks like it has a focus on creating stylized images.<edge-media url="340fbdbb-6ed9-4b2b-8ba6-c475585c37c4" type="image" filename="251200_Seedream 4-5_small.jpg"></edge-media>Seedream 4.5❌free download<h1 id="project-details">Project Details</h1><h2 id="disclaimer">Disclaimer</h2>I'm not an insider with special access to anything or a programmer who understands how all this works under the hood. I took some time to research, but this is from information found online and I can't guarantee everything is accurate. This is a work in progress; I'm still working on filling in missing information.Also note that this is only a comparison of base models. Some models can produce significantly better images by using trained checkpoints, styles, presets, or detail enhancers.<h2 id="criteria">Criteria</h2><ul><li>Must still be publicly accessible in 2025 without a complicated setup.</li><li>I'm trying to keep no more than two of each release: a "top of the line" version and the smaller version released for consumers. I've left out variants like those for turbo, editing, and less resource intensive versions.</li></ul><h2 id="process">Process</h2><ul><li>I chose 15 prompts that show a variety of photo realism, art styles, people, animals, objects, specific instructions, open-ended short prompts, text, and abstract concepts.</li><li>All images come from the first generation set and I never picked from more than 1-4 images.</li><li>When possible, I used images from the same seed which can show differences between minor versions of the same model.</li><li>I used the recommended settings for each model or the default offered online.</li><li>I didn't use additional styles or presets.</li></ul><h2 id="prompts">Prompts</h2><ul><li>african hydropunk princess</li><li>artificial intelligence</li><li>astronaut exploring an alien planet</li><li>overhead view of a breakfast plate with eggs, toast, strawberries, coffee, and a fork</li><li>exterior of a cafe watercolor painting</li><li>person wearing cyberpunk accessories in a high tech neon city</li><li>druid man character design</li><li>ethereal fairy in the style of oil painting</li><li>graphic design logo with fennec fox and succulents and text "Desert Design"</li><li>man and a woman in love</li><li>photo of a deer in an enchanted forest with cinematic lighting</li><li>Photo portrait of a woman with long black curly hair in natural light. She's wearing a fashionable purple blouse, a gold necklace with a locket, and hoop earrings. Bokeh background.</li><li>pixel art city street scene with shops and pedestrians at night</li><li>red potion bottle with text "health" on the left, blue potion bottle with text "mana" in the middle, green potion bottle with text "poison" on the right, on a wooden table in a dark alchemist's laboratory, in the style of a detailed digital painting</li><li>woman lying on the grass</li></ul><h2 id="article-updates">Article Updates</h2><hr />Go to year: <a target="_blank" rel="ugc" href="https://civitai.com/articles/9188">2022</a> • <a target="_blank" rel="ugc" href="https://civitai.com/articles/9193">2023</a> • <a target="_blank" rel="ugc" href="https://civitai.com/articles/9209">2024</a> • 2025

artificial intelligence.jpg

Evolution of Text-to-Image: 2025

00766-Euler_None_wizard cat lora CBS novuschroma01 style 1 5.png

physical violence

weapon violence

wide hips

revealing clothes

thick thighs

downblouse

convenient censoring

pg-13

corpses

suggestive

oral invitation

pg13

sexy

huge breasts

sexual situations

male nudity

disturbing

male swimwear or underwear

female swimwear or underwear

partial nudity

undressed

female nudity

breasts out

exposed female nipple

breast out

lingerie

male underwear

hair over breasts

female swimwear

gigantic breasts

no panties

graphic violence or gore

covered nipples

huge butt

strapless leotard

sitting on face

emaciated bodies

one breast out

female underwear

nude

nsfw

graphic male nudity

adult toys

illustrated explicit nudity

nudity

graphic female nudity

hentai

futanari

porn

sexual intent

genitals

peeing

vore

oral

sexual activity

anal

blowjob

dildo riding

incest

hanging

hate symbols

nazi party

white supremacy

diapers

scat

self injury

hate speech

urine

extremist

child on child

latex clothing

swimwear

bukkake

fellatio

cumshot

implied fellatio

eat_cum

cumdrip

cum in pussy

cum on face

after fellatio

cum on hair

cum on body

cum on tongue

cum on hands

cum in mouth

triple fellatio

autofellatio

fucked silly

cum on pussy