<h1 id="sd-3.5-will-surpass-flux-vttj9tf1c">SD 3.5 will surpass FLUX</h1>FLUX has a much larger UNET even compared to SD 3.5 Large but its ability to do something simple like draw a different face is abysmal.FLUX uses a large natural language model to tokenize with a 160MB CLIP (CLIP-L) that came out years ago.SD3.5 uses that same natural language model (T5xxl) and CLIP-L also but has the benefit of CLIP-GIf you want to try out SD 3.5 with Google FLAN T5xxl:<ul><li><a target="_blank" rel="ugc" href="https://civitai.com/models/900327/sd-35-medium-google-flan">Medium</a></li><li><a target="_blank" rel="ugc" href="https://civitai.com/models/882666/sd35-large-google-flan">Large</a></li></ul><h3 id="compare-the-same-prompt:-j0lldukp6">Compare the same prompt:</h3>"a 4k photo with every possible detail of the most beautiful female in the world, combine ever ethnicity and skin color but only ages 18-26yo and female with feminine features, recreate Eve from the bibles account in genesis of the first female"<img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/11551bc9-75e6-4a7c-a72a-2e756cb06766/width=525/11551bc9-75e6-4a7c-a72a-2e756cb06766.jpeg" /><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/5671739a-87a1-4540-aa4c-96d9956ed00d/width=525/5671739a-87a1-4540-aa4c-96d9956ed00d.jpeg" /><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/d517b35e-8b0e-47a3-8a95-055221d467cb/width=525/d517b35e-8b0e-47a3-8a95-055221d467cb.jpeg" /><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/b25b53fa-a8ea-412d-a079-2cf81e6a0c65/width=525/b25b53fa-a8ea-412d-a079-2cf81e6a0c65.jpeg" /><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/00d0c747-8d81-4c5b-9879-2f66626070cd/width=525/00d0c747-8d81-4c5b-9879-2f66626070cd.jpeg" />Same prompt but great variation. Ask that of FLUX and you get the same face over and over.<h3 id="once-sd3.5-hits-onetrainer-we-will-make-something-great-m2006pfyx">Once SD3.5 hits onetrainer we will make something great</h3>Even if SD 3.5 is just XL with a natural language element the benefit is allowing multiple language to interact with a tokenizer in their native language will benefit the AI art community.Not having to "speak" to the clip in tokens also allows for more variation just based on the random and unknown nature of the text encoder/tokenizer interaction.

ComfyUI_temp_nitez_00023_.png

SD 3.5 will surpass FLUX

sexual situations

physical violence

disturbing

male nudity

hanging

hate symbols

nazi party

weapon violence

female swimwear or underwear

male swimwear or underwear

partial nudity

white supremacy

adult toys

graphic male nudity

illustrated explicit nudity

nudity

pg-13

emaciated bodies

exposed female nipple

female nudity

incest

sexy

revealing clothes

graphic violence or gore

graphic female nudity

convenient censoring

blowjob

sexual activity

sexual intent

suggestive

corpses

wide hips

peeing

oral invitation

undressed

male underwear

female swimwear

genitals

female underwear

thick thighs

breasts out

strapless leotard

vore

breast out

one breast out

huge breasts

gigantic breasts

huge butt

covered nipples

hair over breasts

no panties

sitting on face

anal

dildo riding

downblouse

oral

porn

futanari

hentai

nude

lingerie

nsfw

child on child

self injury

extremist

hate speech

diapers

urine

scat

latex clothing

swimwear

bukkake

fellatio

cumshot

implied fellatio

eat_cum

cumdrip

cum in pussy

cum on face

after fellatio

cum on hair

cum on body

cum on tongue

cum on hands

cum in mouth

triple fellatio

autofellatio

fucked silly

cum on pussy

pov fellatio

SD 3.5 will surpass FLUX

SD 3.5 will surpass FLUX

Compare the same prompt:

Once SD3.5 hits onetrainer we will make something great

Comments