Type | |
Stats | 458 1,307 |
Reviews | (58) |
Published | Dec 5, 2023 |
Base Model | |
Training | Steps: 7,870 Epochs: 10 |
Usage Tips | Clip Skip: 2 Strength: 0.9 |
Trigger Words | n3wp1r4t3 |
Training Images | Download |
Hash | AutoV2 3CC7F6DBB2 |
Work in progress. i was really dissatisfied with the current models for pirates. The inferences always seemed to include a comic skull and crossbones painted on a hat. And the tricorn hats always looked wrong. So I am attempting to make a more aesthetically rewarding models for pirates. I posting four versions along with their datasets. I hope someone can give me some guidance on how to better elaborate details like the shape of a cutlass, the shape of a tricorn or bicorn hat.
Version 1
First out the of the gate. Came out a little too Johnny Deppy, if you know what I mean. Still tends to consistently deliver credible looking pirate captains, mates and women pirates. Made a mistake and baked this with a dimension of 9 and an alpha of 1. But I like the results. I'm going to experiment with using lower dimensions and alphas in the future. Works with the trigger keyword p1r4t3s, but you can omit it for different effects. The other keywords are "captain", "mate", and "female". Don't use the word "pirate" in your prompt or you will get a bunch of dorky cheesy pirate costume dreck that is in the base models.
Version 2
Jacked up the network alpha and dimensions to 128, but didn't make many changes to the captions or weighting in the data set. Same trigger and keywords as the first version. Aslo changed clip skip to 2, just to see what happens.
Version 3
Lowered the network dimension and alpha to 16. Changed clip skip to 2. I did a lot of reweighting of the data set so that Johnny Depp was weighted a lot less and female images were weighted more, especially the character of Anne Bonny from Black Sails. This improved the originality of the images produced.
Version 4
Kept dimension and alpha at 16 and clip skip at 2, Added a lot more images to the dataset of particular items such as the tricorn hat, cutlass, bicorn hat and slouch hat. The hope that was that more detailed data on these features would lead to better inferences. I'm not sure that was the case. On this version the learning rate was a lot lower than earlier versions. I don't see a huge difference. One thing I noticed is that the items that I added images for are rendered in the inferences in the scale that they appeared in the data images, not at the scale that would mesh with the rest of the scene. Perhaps I should try making them relatively smaller in the training set so that they are approximately the same size as they will appear in the finished inference.