Type | |
Stats | 53 0 |
Reviews | (9) |
Published | Nov 25, 2024 |
Base Model | |
Hash | AutoV2 2F10E40C33 |
Very, Very alpha version of a SD3.5M finetune. Posting just to keep a record of training settings/give others a base to start from/proof of concept. 1st version is diffusers. New workflow (You'll want to grab the workflow from one of my images below) to solve corruption issue -- basically, if you put too many tokens into CLIP SD3.5M starts corrupting your image, so breaking the prompt up and feeding directly into L, G, and T5 can fix this. Use tags or barebones prompt for CLIP (under 77) and T5 gets full prompt. Moving forward, more focus will be put on character recognition rather than art styles (although I might still incidentally get some of those while training characters) and specific poses (starting with danbooru tag poses and equivalent natural language caption for them). All images straight from the model -- no lora, or inpainting. Use Art by __________ as first tag. Photographic images use photograph of a __________, real life, __________. Can do SFW or NSFW. List of Artist Styles for Dynamic Prompt can be found at https://files.catbox.moe/w413ru.txt.