Sign In

An extensive comparison of checkpoints based on embeddings for female faces

An extensive comparison of checkpoints based on embeddings for female faces

1. Introduction

An analysis was made to compare a selection of 92 StableDiffusion 1.5 checkpoints based on the impact of 30 different embeddings of women available on civitai. The goal was not necessarily to find the best resemblance of the character that the embedding is supposed to represent. Rather it was to give an indication of what combinations result in beautiful female faces, as well as to discover the extent to which embeddings impact the chosen models.

2. Settings

A number of preliminary test runs were made in an attempt to find most suitable settings to achieve as similar results on all checkpoints as possible, which were then used for every render. These settings were as follows:

Prompt: detailed portrait of extremely beautiful <embedding>, 20yo, flawless skin, detailed eyes, sultry expression, perfectly lit face
Negative: BadDream, easynegative, pale skin, painted lips, nude, blushing, upper body, wide shot, hands, arms, lying, sitting, breasts, (black and white image)
Sampling method: Euler a
Sampling steps: 50
Width: 512
Height: 512
CFG Scale: 7.5
Clip skip: 2
SD VAE: vae-ft-mse-840000-ema-pruned.ckpt

The X\Y\Z plot script was then used to generate a table of images over checkpoints and embeddings. In the prompt the name of an embedding was placed on where it says <embedding>, which in turn would be replaced by others from the type Prompt S/R on the X axis, and the Y axis received checkpoint name, while Z was unused.

A VAE was chosen explicitly to assist checkpoints that do not have VAE baked in.

3. Categorizing and batching results

92 checkpoints were used against 30 embeddings. That means a total of 2,760 faces were rendered. In order to make this amount manageable the images were split up into batches ranging from A1 to E5. The letters A to E represent groups of checkpoints and the numbers one to five represent groups of embeddings. The checkpoints were grouped based on similarity, while the embeddings where only grouped alphabetically.

Checkpoints were categorized as follows:

  • 2D: images resemble cartoon, manga or drawing,

  • 3D: images resemble 3D models,

  • 2.5D: images that are in between 2D and 3D/realistic images,

  • 2.9D: images that are close to - but not entirely - realistic,

  • P: images that resemble photoshop realism. That is to say images that resemble photos that have been brushed up to make them look more beautiful.

Note: checkpoints were subjectively categorized based on on a small number of renders. Therefore the level of realism of checkpoints within a category may vary and you may find some checkpoints more suitable belonging to an adjacent category instead.

The checkpoints within categories were sorted alphabetically. After that they were grouped as follows (the size of the group is given in brackets):

  • A: 2D and 3D checkpoints (9)

  • B: 2.5D checkpoints (18)

  • C: 2.9D checkpoints part 1 (14)

  • D: 2.9D checkpoints part 2 (14)

  • E: P checkpoints (17)

For a complete list of checkpoints and embeddings used - as well as links to their pages on civitai.com - please scroll down to the bottom of this article.

4. Results

Images are downscaled for this article. Click on them to view the full sized version on Github, or [visit the repository directly](https://github.com/aiartsy77/checkpoints_against_embeddings).

4.1. Group A: 2D & 3D checkpoints

Batch A1

Batch A2

Batch A3

Batch A4

Batch A5

4.2. Group B: 2.5D checkpoints

Batch B1

Batch B2

Batch B3

Batch B4

Batch B5

4.3. Group C: 2.9D checkpoints part 1

Batch C1

Batch C2

Batch C3

Batch C4

Batch C5

4.4. Group D: 2.9D checkpoints part 2

Batch D1

Batch D2

Batch D3

Batch D4

Batch D5

4.5. Group E: P checkpoints

Batch E1

Batch E2

Batch E3

Batch E4

Batch E5

Note: the model deliberate_v2 was accidentally in group C as well as E.

5. Conclusions

Some embeddings which I feel were struggling a bit more to achieve good looking results than others are: j3nn1f3rl0v3h3w1tt, alexistxass.

In retrospect I find some results of 2.9D checkpoints more belonging to the 2.5D category, particularly beautyfool_v20, the two fcradianceapexRadiance checkpoints and madtoons_v10 contain some images that are less realistic.

The embeddings do not seem to have much effect on the first two models in group A: abyssorangemix2_Hard and abussorangemix3AOM3_aom3a1b . Therefore, in order to achieve distinct faces, it would not make a much sense to use embeddings given the settings at the beginning of this article. To a lesser extent the same seems to be true for meinaunreal_v41, and some other checkpoints seem to influence the outcome of the face, you could call it a bias. One checkpoint I found to have a particularly strong bias towards a specific face apart from the embeddings is meichidarkMix. To a lesser extent the same is true for realdosmix_ , forgottenmixCartoon2_v10 and hello25dvintageanime_v20.

Embeddings that stood out to me for producing recognizable women across most checkpoints are Daisy Ridley, Cameron Diaz and Candice Owens, which goes to the credit of those embeddings. Another embedding that stood out to me as particularly distinct was AdeliaChz.

Please keep in mind that tweaking the settings can impact on the outcome, and that the results for a given model and embedding may improve.

6. References

6.1. List of checkpoints

6.1.1 Group A - 2D and 3D

Category 2D

Category 3D

6.1.2 Group B - 2.5D

6.1.3. Group C - 2.9D part 1

6.1.4. Group D - 2.9D part 2

6.1.5. Group E - P

6.2. List of embeddings

6.2.1. Group 1

6.2.2. Group 2

6.2.3. Group 3

6.2.4. Group 4

6.2.4. Group 5

27

Comments