<h1 id="heading-67614">Introduction</h1>Hi everyone! It has been a while since I've started posting on CivitAI (since February of this year), and one of the question I get all the times is: "Why do my pictures look different than the examples?" or "I cannot replicate your example pictures.". In this article I want to explain my workflow and explain why I'm creating pictures this way.<h1 id="heading-67615">Why?</h1>tldr: I have fun creating my artworks, and I wanna show the full capabilities of my modelsThe purpose of this post is to share how my AI art creation process has evolved over time. AI art has undergone many changes since its inception, with new tools and techniques emerging constantly. My artworks reflect this evolution, as you can see from the difference between my first and latest models. In the beginning, I used to generate simple images at 512x768 resolution, without any further editing. Since then, I have created thousands of images and developed a more complex workflow, which I will explain in the following paragraphs.Many creators publish models with unedited pictures for the sake of transparency. I respect their choice, but I think this does not reveal the full potential of a model. I know that many SD users prefer to prompt without using any image editing software, but if you compare the basic generations, you will see that they are very similar and do not show the full range of the model.I want to clarify that this is only my opinion. It is not a universal truth and I do not expect everyone to follow it.<h1 id="heading-67616">The generation process</h1>When I generate a picture, there are different approaches I take:<ul><li>I start from a handwritten prompt (this is something I do, only if I am working on a specific artwork)</li><li>I use my <a target="_blank" rel="ugc" href="https://github.com/Inzaniak/sd-webui-ranbooru">Ranbooru </a>extension. This is what I mostly do, as it is the fastest way to create some interesting pictures to showcase my models.</li><li>I use controlnets combined with the Ranbooru extension. One of my favorite way of starting a picture is to use the IP-Adapters just to create the basic 512x768 generation</li></ul>Let's have a look at 3 different examples generated with these approaches:<h2 id="heading-67617">Approach #1</h2><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/de62fda6-3c4c-42c0-66f8-b7e56f925c00/width=450/00297-551863209.jpeg" alt="00297-551863209.png" />The thumbnail for my Mistoon_Amethyst v2 model has been generated using a manual prompt:<pre><code>girl,(hotify:1.2),sitting,chair,kitchen,food,table,short_hair,bob cut,hair clip,freckles,suspenders,red_hair,cleavage ,(masterpiece,detailed,highres:1.4)</code></pre>You'll find these mostly on my older samples.<h2 id="heading-67618">Approach #2</h2><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/bcdb6187-28c1-4a80-86ff-0439f47a7261/width=450/00003-733341738.jpeg" alt="00003-733341738.png" />This is one of the samples available for my Mistoon_Anime model, and here's the prompts:<pre><code>detailed_background,indoors,aqua_hair,holding_phone,hairclip,hair_between_eyes,guitar,beanie,straight-on,looking_at_viewer,eyes_visible_through_hair,electric_guitar,closed_mouth,effects_pedal,headphones,upper_body,hat,hoodie,instrument,solo,aqua_eyes,sticker,hood_down,phone,parental_advisory,nail_polish,smartphone,cellphone,jewelry,1girl,white_headwear,bandaid_on_nose,patch,bandaid_on_face,short_hair,original,bandaid,highres,sidelocks,holding,grey_hoodie,circuit_board,long_sleeves,aqua_nails,necklace,standing,print_hoodie,ring,hair_ornament,yorugata_mao,hood,bandaid_on_cheek</code></pre>As you can see from the prompt there are a lot of tags which are not even used inside of the picture. This usually means that I've used Ranbooru to get those from some booru (usually Gelbooru).<h2 id="heading-67619">Approach #3</h2><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/f9d280a6-b163-40c5-8f06-f65bd8b42cd7/width=450/00173.jpeg" alt="00173.png" />This is the thumbnail for my Mistoon_Diamond model. This artwork uses the 3rd approach and took me about 6/7 hours to make. This is the original prompt:<pre><code>hotify,from below,black hair,pink panties,wings,underwear only,multiple girls,white wings,underwear,highres,yuigahama yui,panties,yukinoshita yukino,blue bra,medium hair,pink bra,looking at another,2girls,pink choker,female focus,brown hair,blue panties,ahoge,cleavage,choker,matching outfits,medium breasts,eye contact,blue choker,see-through,yahari ore no seishun lovecome wa machigatteiru.,bra,angel wings,breasts,lingerie,ass,long hair</code></pre>If you try to use the same prompt and the Mistoon_Diamond model, however you won't get anything similar to this picture. What you get is this:<img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/6d0080ea-5472-41b4-ac2e-71d25bba9ca1/width=525/6d0080ea-5472-41b4-ac2e-71d25bba9ca1.jpeg" />So how did I generate the actual picture. The answer is that I've used the IP2Adapter combined with <a target="_blank" rel="ugc" href="https://gelbooru.com/index.php?page=post&amp;s=view&amp;id=7518686">this beautiful artwork I found on Gelbooru</a>. If you pass that picture to the controlnet you'll get something like this:<img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/62df9adf-3fd8-4a8c-9343-baace06900bf/width=525/62df9adf-3fd8-4a8c-9343-baace06900bf.jpeg" />Which is similar in composition to the one I've originally made.<h1 id="heading-67620">Different Levels of Effort</h1>The pictures you'll find in my samples can usually be divided into 3 different "tiers" of effort:<ul><li>The Lazy pictures: these are made using the usual SD poses, hiding hands, removing details and keeping plain backgrounds. These are the ones anybody can easily make just by following <a target="_blank" rel="ugc" href="https://medium.com/p/6be78130eb9e">my workflow</a>.</li><li>The "I'm trying" pictures: these are made with small edits and corrections using a digital drawing software (Krita). Nothing too complex, usually just fixing hands or strange details.</li><li>The "gud" pictures: these are made with actual effort and took multiple hours to complete.</li></ul>Let's have a look at the different levels:<h2 id="heading-67621">Lazy Pictures</h2><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/a5ab7e46-d7b9-4ebc-be0d-5b66ee8236ba/width=450/00000-508004103.jpeg" alt="00000-508004103.png" />This is an example of a picture which is easy for the SD model to generate without making relevant mistakes. The prompt was:<pre><code>&lt;lora:potato:0.6&gt;,hotify,1girl,stuffed toy,cleavage,blue hair,solo,cameo,collarbone,blue shirt,jewelry,smile,stuffed animal,parted bangs,yellow eyes,looking at viewer,bangs pinned back,teddy bear,bare shoulders, short hair</code></pre>As you can see there are no hands in the picture, the clothes are incredibly simple and also the background has no major details that could create issues.<h2 id="heading-67622">"I'm trying" pictures</h2><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/aac0179f-2245-44f9-983b-95f0dd12a134/width=1920/00067-694445704.jpeg" alt="00067-694445704.png" />Here's an example of a picture that needed some fixes. The prompt was:<pre><code>mizuki apple,long sleeves,earrings,blonde hair,lipstick,mature female,blue nails,standing,makeup,jewelry,1girl,thick eyebrows,long hair,looking at viewer,solo,full body,yellow background,pandora party project, skirt</code></pre>In this case there were a lot of issues with the arms and different facial details, so I had to take the picture and edit it inside of Krita. Afterwards I inpainted a few details (like the faces).<h2 id="heading-67623">gud pictures</h2><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/6f4aeefc-ac6e-4a2c-aed4-1ca02e0bd083/width=700/00095-1029534685.jpeg" alt="00095-1029534685.png" />This is an example of a picture that took a long time to make and actual effort. To explain how I made this one, I got gifted a vinyl figure of Itsuki from the Quintessential Quintuplets series, so I took an horrible photo of it:<img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/e76f9863-93a8-4609-9d05-851ffc30d543/width=525/e76f9863-93a8-4609-9d05-851ffc30d543.jpeg" />Then I removed the background, roughly drawn the new one, passed to img2img and then repeated the steps until I was completely satisfied with the results.These pictures are the ones which I think really show the potential of SD.<h1 id="heading-67624">The workflow</h1>If you want to learn about my workflow in a detailed step-by-step guide you can do it here:<a target="_blank" rel="ugc" href="https://medium.com/p/6be78130eb9e">Stable Diffusion Ultimate Guide pt. 6: Workflow | by Umberto Grando | Medium</a>I'll try to explain how I usually generate the 3 different tiers of pictures explained above:<h2 id="heading-67625">Lazy Workflow</h2><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/1ff48351-7daa-4546-a67a-acedf694f1a2/width=525/1ff48351-7daa-4546-a67a-acedf694f1a2.jpeg" />The quickest workflow I use to generate a picture is:<ul><li>Generate a prompt using Ranbooru</li><li>Generate 6 pictures</li><li>Choose the best (or rerun until satisfied)</li><li>Copy the picture in the img2img panel</li><li>Run the prompt again in img2img with 1.5x the resolution (576x1152 in this case)</li><li>Run it again at the max resolution I'm capable of running (960x1920 in this case)</li></ul><h2 id="heading-67626">"I'm Trying" Workflow</h2><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/a324b378-de87-4e88-9524-7ca31b000fa2/width=525/a324b378-de87-4e88-9524-7ca31b000fa2.jpeg" />The "I'm Trying" workflow follows these steps:<ul><li>Generate a prompt using Ranbooru (or manually)</li><li>Generate 6 pictures</li><li>Choose the best (or rerun until satisfied)</li><li>Remove or change details I don't like using Krita (check out the blood on her face, the ribbon, and the fingers)</li><li>Copy the picture in the img2img panel</li><li>Run the prompt again in img2img with 1.5x the resolution (576x1152 in this case)</li><li>Fix again the details (Krita/Inpainting)</li><li>Run it again at the max resolution I'm capable of running (960x1920 in this case)</li></ul><h2 id="heading-67627">gud Workflow</h2><img src="https://cdn.discordapp.com/attachments/142209782718267392/1153812610584690768/02656-372524214.png" /><img src="https://images-wixmp-ed30a86b8c4ca887773594c2.wixmp.com/f/7026bd2c-2204-4ed0-8635-b5707ea09627/dg9l5kf-7808ce8f-2a97-438c-aadd-12666b53ada5.png/v1/fill/w_1280,h_854,q_80,strp/mortal_kombat___mileena_by_inzaniak_dg9l5kf-fullview.jpg?token=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJzdWIiOiJ1cm46YXBwOjdlMGQxODg5ODIyNjQzNzNhNWYwZDQxNWVhMGQyNmUwIiwiaXNzIjoidXJuOmFwcDo3ZTBkMTg4OTgyMjY0MzczYTVmMGQ0MTVlYTBkMjZlMCIsIm9iaiI6W1t7ImhlaWdodCI6Ijw9ODU0IiwicGF0aCI6IlwvZlwvNzAyNmJkMmMtMjIwNC00ZWQwLTg2MzUtYjU3MDdlYTA5NjI3XC9kZzlsNWtmLTc4MDhjZThmLTJhOTctNDM4Yy1hYWRkLTEyNjY2YjUzYWRhNS5wbmciLCJ3aWR0aCI6Ijw9MTI4MCJ9XV0sImF1ZCI6WyJ1cm46c2VydmljZTppbWFnZS5vcGVyYXRpb25zIl19.wsKALO7nI2ASiDv-Tfs_wNS8Fr8ZAT4fxyjaXdhI_No" alt="Mortal Kombat - Mileena" />The gud workflow is really similar to the previous one with the difference that I run and fix the picture multiple times until I'm completely satisfied. The above artwork of Mileena took multiple hours to complete.<h1 id="heading-67628">Software/Hardware</h1>The pictures you see on my pages have been created using the following tools:<ul><li>Automatic111 UI: this is my go-to SD UI. I've also used comfy in the past, but it was too "mechanic" for my process</li><li>Krita: I started learning this since I started posting pictures here on CivitAI. It has quickly become my favorite drawing tool, and I am also using the awesome krita-ai-diffusion extension to edit pictures assisted by realtime SD</li><li>RTX 4080: The most overpriced piece of hardware I've ever bought, but still incredibly powerful</li><li>Samsung Galaxy Book Flex: This was my original notebook I used to edit and publish all my artworks until I replaced it with:</li><li>Surface Studio Laptop: This laptop is insane. I also have a Macbook M1 Pro which I use for music production, but the Surface is my favorite by a large margin.</li></ul><h1 id="heading-67629">Conclusions</h1>I hope this "essay" will give you an idea of the amount of effort you'll need to put into one of my models to get results similar to the one I'm showing in my samples.<h1 id="heading-67630">Support Me</h1>I've started developing custom models for myself a few months ago just to check out how SD worked, but in the last few months it has become a new hobby I like to practice in my free time. All my checkpoints and LoRAs will always be released for free on Patreon or CivitAI, but if you want to support my work and get early access to all my models feel free to check out my Patreon:<a target="_blank" rel="ugc" href="https://www.patreon.com/Inzaniak">https://www.patreon.com/Inzaniak</a>If you want to support my work for free, you can also check out my music/art here:<ul><li><a target="_blank" rel="ugc" href="https://inzaniak.bandcamp.com/">Bandcamp</a></li><li><a target="_blank" rel="ugc" href="https://www.deviantart.com/inzaniak">DeviantArt</a></li></ul>

61687b59-cef2-49df-8ead-25e04cdafd42

From Noise to Illustrations: How I Generate AI Pictures with Stable Diffusion

sexual situations

physical violence

disturbing

male nudity

hanging

hate symbols

nazi party

revealing clothes

weapon violence

female swimwear or underwear

male swimwear or underwear

partial nudity

white supremacy

adult toys

graphic male nudity

illustrated explicit nudity

nudity

wide hips

convenient censoring

blowjob

sexual activity

sexual intent

undressed

male underwear

female swimwear

genitals

female underwear

thick thighs

breasts out

strapless leotard

vore

breast out

one breast out

huge breasts

gigantic breasts

huge butt

covered nipples

hair over breasts

no panties

sitting on face

anal

dildo riding

downblouse

oral

porn

futanari

hentai

nude

lingerie

nsfw

incest

graphic violence or gore

peeing

exposed female nipple

suggestive

sexy

graphic female nudity

pg-13

corpses

oral invitation

emaciated bodies

female nudity

child on child

self injury

extremist

hate speech

diapers

urine

scat

latex clothing

swimwear

bukkake

fellatio

cumshot

implied fellatio

eat_cum

cumdrip

cum in pussy

cum on face

after fellatio

cum on hair

cum on body

cum on tongue

cum on hands

cum in mouth

triple fellatio

autofellatio

fucked silly

cum on pussy

pov fellatio