Sign In

Queratogray Sketch (eddiemauro-mix)

519
4.2k
43
Verified:
SafeTensor
Type
Checkpoint Trained
Stats
4,168
Reviews
Published
May 30, 2023
Base Model
SD 1.5
Usage Tips
Clip Skip: 2
Trigger Words
sketch artstyle
Hash
AutoV2
3DF958B49A

Before to use

  • You have to know how works Stable Diffusion. I recommend using Automatic1111 like an interface to launch the model.

  • This is a model trained on SD 1.5 model, so you have to consider that it is not perfect. I had to make so much testing before to arrive to a stable generation. I will enhance the model when a better base model arrives (like SD XL new one).

  • This is a Checkpoint dataset.

  • I recommend you to follow me on my instagram account, where I will explain about AI image generation: https://www.instagram.com/eddiemauro.design/

Intro

QUERATOGRAY SKETCH (eddiemauro-mix) CHECKPOINT: Hi, I’m a product and car designer, and I’m so excited to test with AI, I think is a good tool for designing. I decided to collaborate with a friend, called ‘Joell Martínez Tenjo’, who is a product designer, but focuses on animation and illustration. We took more than 50 sketch style of his grayscale/monochrome illustrations "Sketchbook" series, then I trained a model and mixed it with some other models to regularize training, obtaining aprox. 50% of his final style. You can check his profile here: https://www.behance.net/queratoilustracion

The style is centered just on generating people images, but you can combine it with other Lora's to extend the use with different kinds of things.

If you want to support my work and help me to upload more models (with better quality), you can do it by entering here and donating, I would greatly appreciate it: https://ko-fi.com/eddiemauro

Installation

  • I use Automatic1111, the best UI for Stable Diffusion image generation, so I recommend you to install locally or use it online with some Colab or other hosting. You can find online instructions or videos to do that. If you are going to install locally, you can watch this tutorial online and I recommend you to have at least a 6-8 Gb of VRAM Graphic Card (nvidia) to have a stable interface and launch with “Microsoft Edge” because you will have problems on “Google Chrome”. Try also to install “medvram” or “lowvram” options besides “xformers” (search online how to).

  • You have to install the Checkpoint model to use.

  • Please for image creation you have to follow all my recommendations, if you don't, it is impossible to generate a good image quality. Also, you have to consider that from today AI image generation is not so consistent and perfect, you have to invest time to get it and make plenty of tests.

Recommendations for image generation

  • Activation token/caption: Inside prompt space, the first word has to be: ““sketch artstyle” to activate the style. It is mandatory, if you don't do it, it will not work properly.

  • Another recommended prompting: Inside prompt you can use those words that will enhance the image generation: in the positive space,grayscale, monochrome, ((solo))”; in the negative space: “out of frame, multiple people, missing fingers, extra digit, fewer digits, (((many people))), blurry, color”. You can also watch the image metadata of example images here and simulate the prompt.

  • Textual inversion/embedding or Lora tool recommended: Don't use Negative Embeddings if you are looking to preserve the style, if you use it, it will turn into a sketch grayscale normal style. Try to have a simple Negative prompt (see image examples) because this will destroy the style. It is good at generating faces and eyes, so it is not so necessary "face restoration" or other embeddings. I you will use Negative Embedding, I consider that “EasyNegative” is one best of textual inversion for negative prompt space, you should use it. Download here and install it, putting the file inside “embeddings”.

  • VAE: For sketch style is mandatory to use "kl-f8-anime2". Download and install if you don't have it.

  • Clip Skip: Use 2.

  • Steps and CFG: It is recommended to use Steps from “20-40” and CFG scale from “7-8”, the ideal is: steps 30, CFG 7. For next models, those values could change.

  • Sampler: I use mostly “EulerA” or “DPM++SDE Karras”. Euler tends to be simpler and more creative. Make experimentation with other samplers if you like.

  • Batch: In txt2img try to put a value of 4 to generate more than 1 image and watch the generations. If you have a good graphic card, you can use “Batch size”, this will create at same time 4 images, increasing generation time; but if your computer cannot handle this, change to “Batch count” that will create 4 images in a row (not a same time), but generation time will be more.

  • Image aspect: Try to use these dimensions: 512x512, 768x512, 512x768. Don't generate bigger images because the style could be lost, if you want to create a bigger image, use hires.fix in txt2img mode, img2img increase method or Ultimate SD Upscale script extension + ControlNet, or just upscaling with GAN models.

  • Create bigger images: There are 4 different methods to create large images in Stable Diffusion, you can check online how to. For first method “txt2img hires.fix”, I recommend you to use upscale model called “4x-AnimeSharp”, downloading here just “.pth” file, and then installing it, putting inside “ESRGAN” file. In hires.fix option put any “upscale by” value, and then with a “denoise strength” of “0.5-0.7”. For the second method, you have to select first the image generated in txt2img and then putting in img2img mode, increasing dimension at least “1,5 times” with a “denoise strength” from “0.3-0.5”. For the third method, you can use the same configuration of img2img, but activating “tile” mode of “ControlNet” extension and also the script of “Ultimate SD Upscale”, but for that, I recommend you to watch a tutorial here. For the last method, you have to pass the generated image in txt2img to “extras” and then select a GAN model and scale it, you can also use the “4x-UltraSharp” model.

  • Get more control of your creation: Use “ControlNet” extension to generate a more controlled shape of what you want, and even you can test it with sketches. Use “Scribble” or “Lineart” modes. For that, I recommend you to install this extension and then learn to how to use. There are plenty of online videos about it.

  • Copy prompt for image metadata: You can download my example images here and put it inside “PNG info” tab from Automatic1111.

Example Prompting:

Positive prompt: 

A young man, sketch artstyle, grayscale, monochrome, ((solo))

Positive prompt (lose style): 

A young man, sketch artstyle, grayscale, monochrome, ((solo)), ((masterpiece)), HDR, highly detailed, professional

Negative prompt: 

out of frame, multiple people, missing fingers, extra digit, fewer digits, (((many people))), blurry, color

Negative prompt (lose style): 

EasyNegative, (worst quality:2), (low quality:2), (normal quality:2), out of frame, multiple people, missing fingers, extra digit, fewer digits, (((many people))), blurry, color

Steps: from 20-40 (For EulerA is enough 20, and you can also use DPM++SDE Karras, but EulerA is better mostly).

CFG scale: 7-8 (7 Ideal).

What comes for the future

I’m already trying to enhance the model. This was trained with 512 image aspect, so I will try with 768 (bigger one), and also other configurations (like changing captions, steps, epochs, etc.). If you like a better model of this version, try to keep supporting me on ko-fi, if there are more people supporting me, I can invest more time to train and enhance models, but if this doesn't happen I cannot. 

I launched my first private model for my Ko-fi membership lv.1, called "eddiemauro scene" minimalistic scenery creation for rendering. If you want to access to private models, you can support me and subscribe to this membership. I will also start to upload here more models centered on product and car design.


License

Watch here a Stable Diffusion license link. In the case of this specific model, use it just for experimentation. It is prohibited:

  • Upload this model to any server or public online site without my permission.

  • Share online this model without my permission, using my exact model with a different name or uploading this model and then run it on services that generate images for money.

  • Merge it with a checkpoint or a Lora, and then publish it or share online, just talk to me first. In the future,

  • Sell this model or merges using this model.


Supporting

You can follow me on my social networks. I will show my process and also design tips and tools. Also, you can check my webpage and in case of you need a design service, I work like a freelance.

http://eddiemauro.design/

https://www.facebook.com/eddiemauro.design

https://www.instagram.com/eddiemauro.design

https://www.linkedin.com/in/eddiemauro

https://www.behance.net/eadesign1