Type | |
Stats | 1,824 3,149 |
Reviews | (313) |
Published | Apr 27, 2023 |
Base Model | |
Training | Steps: 1,700 Epochs: 1 |
Trigger Words | warhammer 40k commissar |
Hash | AutoV2 8CFB1F82A8 |
I got tired of trying to use 1940s Germans to generate commissar art for Warhammer 40K games. So, I've invested some time and learned how to make LoRAs.
It's my first time making a LoRA, but I think this was somewhat of a success.
This LoRA is generally good at making the uniform part, but I wasn't able to make proper insignias or aquilas.
Version 1.0
This was my first attempt. I really didn't know what I was doing, so I made some rookie mistakes.
First of all, the LoRA wasn't directed to learn to replace uniforms, and it trained on the general vibe of the image, which was mostly Warhammer 40k commissars. When used, it affects the whole of the image, which might not be preferable if used for more specific situations.
The second mistake was that I used a phrase to activate the LoRA made up of several words: "warhammer 40k commissar". Theoretically, this could affect more of the image and conflict with other prompts.
Still, this version provides some details I couldn't capture in version 2.0 that were ignored. Also, because of the learned vibe of Warhammer 40K art, it reproduces that while generating images.
Version 2.0
This time I actually included the painfully crafted captions for the training images in the learning process. Which should have helped focus LoRA training on the uniform. By extension, that should have limited the LoRA's effect on the rest of the image, making it more versatile. Also, this time I used regulation images while training, hopefully improving the end result.
The result of all of this was that the LoRA didn't learn the overall style of Warhammer 40K art (to some disappointment) and now creates more consistent commissar uniforms. Some details that were accidentally learned in version 1.0, like the horizontal golden cords on the commissar shirt or the unique red sash, are rarer. Though I learned that prompting for the sash made it more frequent.
Version Comparison
The prompt was a simple concept: "girl, black uniform, arms crossed". Although V2 has affected more of the image than just the uniform, the impact is way less than that of V1, which didn't follow that the prompt "black uniform" was connected to the commissar.
What would be perfect is to completely replace the military uniform on the right with a commissar uniform, leaving the rest of the character untouched. Alas, V2 and V1 affect the image more than just the character.
Future plans
I am a bit miffed that some of the details are missing in the current LoRA, and I am thinking of ways to bring that out. This will most likely involve a lot of trial and error. If I am successful, I will post a new version here.
Although I think the concept of the commissar uniform is better understood by V2 than V1, the LoRA still affects the rest of the image in a significant way. I think maybe better captioning for the training and more processed training data would result in a better version.
Negative TIs used: Deep Negative V1.x, badhandv4