Sign In

illustrious prompting guide | v0.1 | generate anime art with AI

illustrious prompting guide | v0.1 | generate anime art with AI

tl;dr

Positive: 1girl, general, danbooru tags go here, highres, masterpiece, newest

Negative: lowres, worst quality, multiple views, comic, text, signature, watermark

Dataset

Illustrious v0.1 uses the danbooru2023 dataset. Trained at 1024x1024 with danbooru tags separated by commas. Besides danbooru tags it has some custom tags.

The danbooru dataset is just danbooru. It isn't other boorus. It isn't a certain furry booru used by PonyXL. It's just that one site called danbooru so if you plan to use it you better familiarize yourself with danbooru.

Danbooru has a lot of NSFW stuff in it and it's all in this dataset. Some of it is disgusting and the model can generate it sometimes because it just happens. If you're reading this guide because you want to teach minors how to generate drawings you should stop what you're doing right now and go buy them some pencils instead.

If you don't know what a tag means just check the danbooru wiki

Best illustrious model

The official illustrious fine tuned model is Illustrious XL. It sucks.

Don't use it.

Use other fine tunes like Hassaku XL because it's more consistent so if you change artist for example you get similar pose and composition.

So below is what Illustrious XL with 12 steps does

Prompt: close up, 1boy, kanzaki hideri, holding green apple, smug, frown, licking, half-closed eyes, traditional media, watercolor \(medium\), pastel \(medium\), watercolor \(medium\), from side, looking at viewer, forehead, green stile uniform, halftone background, highres, masterpiece

Negative: lowres, worst quality, multiple views, text, watermark, signature, username, wallpaper, dutch angle, vanishing point, comic, anime screenshot, game cg, sketch
And now Hassaku XL with 12 steps, same prompt same seedThis is illustrious XL with 60 steps

Some fine tunes add images/concepts that aren't in the v0.1 model so other models won't have it. Some loras have different results from model to model and some pony loras work in some illustrious models.

My V-pred model/lora doesn't work

If you're using a v-prediction model locally you need something that supports vpred. A1111 doesn't support vpred yet. Download the eps/epsilon version instead

Tag order

person count, character names, rating, general tags, artist, score range based rating, year modifier

  1. person count = 1girl, 1boy, 1other, 2boys, 2girls

  2. character name = not necessary

  3. rating = not necessary, see below

  4. artist = not necessary but probably good idea

  5. general tags = everything else

  6. score = probably a good idea

  7. year modifier = probably not necessary

So for example you are supposed to type it like this

1boy, kinoshita hideyoshi, general, building, itomugi-kun, masterpiece, newest

So that is

  1. person count = 1boy

  2. character name = kinoshita hideyoshi

  3. rating = general

  4. everything else = building

  5. artist = itomugi-kun

  6. score = masterpiece

  7. year modifier = newest

This doesn't matter a lot but changing the order of the tags will change what image is rendered because the thing was trained mainly to render prompts in this order.

btw you obviously can add as many tags as you want and you don't even character and artist tags if you don't want

1boy, building

Best prompt size

So when you type a prompt it's turned into tokens and SDXL can only do 75 tokens. If it's more than 75 it gets broken up into batches of 75 and that can really affect quality

If you want to render something very complex you'll need to inpaint it. civitai doesn't have this feature I think so you need to do it locally. and if you inpaint you want to soft inpaint the masked area instead of the whole image and tag only the things in that area like if you're inpainting the face you just tag the eye color and expression and maybe hair color

Are commas needed in prompts?

The model was trained with commas so yeah they are necessary because the commas are converted into tokens too

so instead of sleeveless striped collared shirt you need the tags sleeveless, striped shirt, collared shirt because striped collared shirt isn't a danbooru tag but striped shirt and collared shirt are.

Name order

Danbooru character names are in japanese order so it's kinoshita hideyoshi NOT hideyoshi kinoshita. If you don't know what that means go watch more anime

Upper case or lower case

Danbooru tags are in lower case so it's masterpiece not Masterpiece, no cap.

Illustrious tags NOT from Danbooru

These special tags added in illustrious v0.1 training don't exist in danbooru

Synonyms

Woman instead of 1girl. Don't use these. Just use 1girl like everyone else.

Quality tags

worst quality = 8% quality score.

bad quality = 20%.

average quality = 60%.

good quality = 92%.

masterpiece = 100%.

What is the score criteria? idk it doesn't say. Two possibilities. First is some AI rated them. Second is these are users ratings who clicked on the upvote in the site. Both sound terrible. But everyone puts worst quality in the negative prompt and masterpiece in the positive prompt so just do that i guess

Newest/Oldest tags

oldest = ~2017 and this physically hurts me

old = ~2019

modern = ~2020

recent = ~2022

newest = ~2023

Underscores in tags

On danbooru tags have underscores like open_mouth. It works without underscores like open mouth but sometimes you need them like +_+.

Emoji \:\) tags

On danbooru some tags are emoji like

:) = happy

:( = sad

:p = tongue

:d = :D because all tags are lower case on danbooru remember

If you use A1111 to prompt then prompts like (1boy:1.5) and [1boy:1girl:5] are special. So (, ), [, ], and : are special.

To use :p you need to write it like \:p

\:\) = :)

btw if you see ? in a prompt that's actually a tag on danbooru not a special thing. ! is a tag in too. v is a tag. w is a tag. seriously just try them sometime

Parentheses in tags

Same rule for parentheses in names of artists, characters, etc.

watercolor \(medium\)

astolfo \(fate\)

Rating tags

Posts on danbooru have a rating that tags it as nsfw or not except the tag isn't nsfw or sfw and it was changed in 2022

general = new SFW for all audiences

sensitive = everything that was rated "safe" before 2022 became this

questionable = sexually suggestive

explicit = adult audiences / NSFW

also on danbooru to search these you type like rating:questionable but I don't think you need the rating part in illustrious

Basically you could use general in your positive prompt and explicit in the negative to avoid rendering NSFW but because general was added in 2022 and illustrious uses the danbooru2023 dataset using general excludes like 99% of the whole thing.

absurdres, highres, lowres tags

Posts on danbooru are tagged according to their size

lowres = 500×500 or smaller

no resolution tag = larger than 500x500 and smaller than 1600x1200

highres = at least 1600×1200

absurdres = at least 3200×2400

incredibly absurdres = any dimension over 10000

Illustrious was trained for 1024x1024 remember so how did they use lowres and absurdres images in training if the size didn't match? idk but they probably stretched or cropped them and that's bad. so if that's bad why everyone uses absurdres?

so I did some tests and basically the tl;dr is don't use them at all because lowres will make your gen worse but absurdres won't make it better and any improvement you see is just the AI being randomalso remember you only got 75 tokens so you don't want to waste 2 tokens on a tag that is unpredictable

Prompt writing tips

Avoid negative prompts

Negative prompts are kind of a black box because you don't know how the AI learned a concept so if you tell it to not generate the concept you never know what is actually going to get removed

like some tags can mess the background detail or change the saturation and if you have 30 negative tags you won't know what is doing what

There's a lot of tags in danbooru. Every single thing on danbooru affects your gens a bit. If you tried to exclude every thing you don't want in your gen the prompt would be immense.

Just write better positive prompts instead

Good negative tags

you'll probably want these tags if you are generating a bunch of pics

monochrome and/or greyscale so you don't get black and white images. it's spelled grey with an e on danbooru btw.

multiple views so you don't get the same character in multiple angles

text, watermark, signature, translation request so you don't get text

comic, 4koma, 2koma so you don't get panels

Prompt: 1boy, general, otoko no ko, monochrome, greyscale, multiple views, text, watermark, signature, translation request, comic, 4koma, 2koma, space, holding weapon, drill sidelocks, blue hair, aqua eyes, serious, full body

in case you're wondering yeah the letters look right but they're just placed randomly so it doesn't mean anything. if you put random japanese text in your gens and you can't read it it's probably going to say something you don't want

Use style tags

If you want to make some good gens and not just generic AI slop you need to use style tags

so on danbooru most posts are digital art so digital art is generic art

to change the style you need either an artist tag or a medium tag

if you don't know any artists go to this page type a tag you care about like landscape, artist category order by frequency. It's going to show a number next to the artist. That's how many images they got on the booru. If the number was more than 100 in 2023 you can probably use their style

Prompt: 1boy, full body, fog, cable, building, mountain, outdoors, solo, standing, tree, utility pole|asteroid ill

medium tags are tags like

traditional media

watercolor (medium)

graphite_(medium)

colored pencil (medium)

millipen (medium)

photo (medium)

watercolor (medium)

painting (medium)

marker (medium)

there's also 1980s (style), 1980s (style), 2000s (style), retro artstyle, pc-98 (style), game gc, anime screenshot

ofc the number of images varies on danbooru so the styles with more images are stronger

You don't need to make your gen 100% a style. Like you can use (traditional media:0.5) to make it only half traditional media.

Also you can combine multiple styles in one prompt like combining two artist styles together or two media

Prompt: 1boy, otoko no ko, artist's hand in frame, kirigami, photo \(medium\), pen \(medium\), looking at viewer, full body, ?

Use background/border styles

Danbooru has a lot of background/border tags you can use like

stripped background

argyle background

white background

ornate frame

lace border

So basically the reason your gens look so fake and generic is that 99% of art that gets posted has a white background but AI slop always has ultra detailed interiors/exteriors. it looks fake because a manga artist wants to draw cute anime boys they would never waste so much time drawing furniture

Prompt: 1boy, ciel phantomhive, watercolor \(medium\), averting eyes, ornate border, white background, eyepatch,

Use camera tags

if you want to control the camera

straight-on = front view

from side = side view

from above = top view

from below = tbh crotch view 9 out of 10 times

three-quarter view

isometric

symmetry

vanishing point

fisheye

full body

head out of frame

close-up

Prompt: 1boy, kinoshita hideyoshi, otoko no ko, standing, isometric, chibi, halftone background, grass

Use pose tags

you don't need control net to pose your characters just these tags like these and inpainting

sitting

standing

jumping

kneeling

on back

on side

knee up

leg up

standing on one leg

arms up

arms behind head

arms behind back

hand on another's head

hand on own hip

thumbs up

clenched hand

head tilt

Prompt: 1boy, otoko no ko, wariza, hand on own knee, v, smile, one eye closed, head tilt, from above, fisheye, graphite \(medium\), monochrome

Use effect tags

Some cool effects you can use

blurry foreground

blurry background

depth of field

glitch

bloom

spot color

chromatic aberration

Prompt: 1boy, otoko no ko, wariza, mechanical eye, glitch, chromatic aberration, blurry blurry foreground, blurry background, depth of field, anime screenshot,

Best boy tags

Always use these tags to make your gens better

otoko no ko = femboy

toned = muscular

collarbone = self explanatory

flat chest = male cleavage

shark teeth = ^wwww^

skin fang = self explanatory

groin = V lines

midriff

bare shoulders

fingerless gloves

very short hair

hair over one eye

Bad tags illustrious doesn't understand

Cargo cult tags

Basically if the tag isn't in danbooru and isn't a illustrious one illustrious won't understand it.

score_9, 8k, 4k, hdr aren't tags illustrious understands. Even high quality isn't a tag. Detailed isn't a tag. Many isn't a tag.

That doesn't mean they won't have an effect. Many does work sometimes but that's because multiple is a tag in danbooru I think. But basically assume that if it isn't a tag in danbooru it won't work even if it worked in other models like PonyXL

Low quantity tags

Since danbooru has like a billion images if a tag isn't used a lot illustrious doesn't know what it is but it does a lot better than other models which is why it's great.

Basically if a tag has less than 100 images on danbooru assume it isn't going to work. The more it has the better it will work.

Your favorite artist only has 3 images on danbooru? Too bad. You can't use them as a tag. Go make a lora.

You wish you could render something specific but those dweebs at danbooru don't appreciate it like you do so they never spent hours tagging hundreds of images with the tag so now illustrious doesn't know what you are talking about? Too bad. Create a lora. Better yet maybe you should go to danbooru create an account yourself and go on a tagging rampage so maybe the next model trained on danbooru will have your contributions in it and that benefits everybody.

But if a tag has more than 100 images on danbooru it's probably going to work. Unless those images were tagged after 2023 so it's not in the dataset.

So basically most artists don't work as tags because most artists don't even have 100 posts on their pixiv

But some artists will have even over 1k pics so you can use their style.

They don't post hundreds of screenshots of anime on danbooru so even if an anime exists on danbooru most of its characters probably won't work specially the less popular male characters.

Sometimes a tag works but it doesn't look like what you want because what's posted on danbooru is only what people will draw

Redesigns

Like bridget (guilty gear) has 3500 images but it got a redesign for strive and it's the same tag for both strive and xx. I scoured 2 boorus and the entire pixiv to make a lora. I found like 260 pics of the xx design. This is all pics of og bridget, ever. So like 90% of the pics are the new design. You can actually tell when strive was released just by browsing the booru. Start from the last page and go back a few pages then it's all the new bridget and you're like wait that's it?

Ambiguous tags

Some tags are used a lot but it's not obvious what it means or they are confused with more common tags.

Like contortion is used in all sorts of contortion poses so illustrious can't figure out what contortion images have in common

Danbooru has a tag for halo behind head but you get one over the head if you try to use it because just halo is more common

Conclusion

Now you know how to prompt

Go gen some cute anime boys

40

Comments