Sign In

[SD1.5] OpenSolera - Anime Mix

69
1.3k
11
Updated: Jan 2, 2025
base modelanime2dgeneral use
Verified:
SafeTensor
Type
Checkpoint Merge
Stats
153
Reviews
Published
Dec 31, 2024
Base Model
SD 1.5
Usage Tips
Clip Skip: 1
Hash
AutoV2
09A9112BF3

(Make sure to check the comments for each model in Discussion before choosing a model. Some models are better at certain thing than others, and a2 [Pastel] is currently the most stylistically distinct model of this series.)

Personal model ranking (for image gen): a4=a2>=a5>=a7>a1>a3>a6

Personal model ranking (for img2img): a3=a2>=a7>a2>a5>a1>a6

What is this model?

For people familiar with my previous model: This model is a remake of my original Solera model for commercial use, based on many of the models used in it and additional models for more functionalities. Currently, a6 [Fleur] is the last version of the model until conditions are better. Until then, a1 and a4 are decent starting points, with a4 being closest in style

For people unfamiliar with my previous model: This is a merged Stable Diffusion 1.5 model meant for generating 2D-2.5D style anime pictures made from the following models:

Checkpoints:

LoRAs:

  • AIOMonsterGirls - LoRA/LyCORIS (CivitAI)

  • CluelessC's Hololive - v6.3-eps-resized-512 (HuggingFace)

  • (a3) Ken Sugimori style LoRA - v3_offset (CivitAI)

  • (a4 onward) Ken Sugimori style LoRA - v3 offset - ChatGPT caption (CivitAI)

  • (a4 onward) Better Half-closed Eyes LyCORIS (CivitAI)

The main goal of this model is to create a "jack-of-all-trades, master of one" anime model that can do almost anything. This model can output SFW and NSFW images, but current testing indicates that this model tends towards SFW more, with decent NSFW outputs.

All models have a baked-in VAE, Check the model details for what VAE was used.

How to use this model?

Prompting

Same as most other anime models, this model mainly uses booru-style prompts with no underscores (see Danbooru's Tag Groups wiki for more details and NovelAI's Basics page alongside looking at the tags and AI metadata on booru sites (e.g: AIBooru, Gelbooru, Danbooru,...) for usage), but can also use natural language in some cases.

When prompting this model, be specific and mention every detail you want to include. Basically, prompt it like Anything V5.

Also note the prompt parser used as well. This model is mainly tested against SD.Next's "Full attention" parser (as of a3), reForge's defaults (from a4 onward) and ComfyUI's default parser.

This model can generate good results using minimal positive quality prompts and negative quality prompts. To get a better idea on this, look at the example images.

  • Positive (optional): best quality, absurdres

  • Negative: (worst quality:1.4), (low quality:1.4) + any additional issues

Note: Most of the strengths listed here are suggestions. Do what you like, I don't take responsibility for your bad prompting or what you generate with this thing.

Recommendation: Do not use any negative embeddings with this model. Not only does it completely destroy the style that it has, it also makes prompting accuracy plummets off a cliff.

Sampler

This model is mainly tested against DPM++ 2M Karras, but DDIM, Euler ancestral, UniPC (with ComfyUI's "simple" scheduler) are other great options.

Step count is dependent on sampler, but 20 steps is a good start for most. For furry generations, lower steps works better.

CLIP Skip works best at 1-2, depending on each version.

Resolution

This model supports all three normal SD1.5 resolutions: 512x512, 512x768, 768x512

Each resolution deals with things differently, with prompt accuracy being best at 512x512, followed by 512x768, then 768x512. Details-wise, 768x512 is the best (imo) and 512x768 can generate at similar quality.

Post-processing

For best images, Hi-res fix (or latent upscaling) is recommended. Current recommendation is bicubic 2x upscaling.

For model upscaling, these are my personal recommendations:

x2: AniScale2 Omni + Refiner, x2 AniSD SPAN 190k + DeBleed

x4: 4x Remacri and x4-AnimeSharp

Q&A - Lightning Round:

Which model should I use? This or the original Solera?

Use this one. The original Solera has a couple of differences, but not significant. For legacy and testing purposes, I still keep SoleraV7 around for comparison. In most cases, I think this model would out-perform my original by quite a bit, if you can tolerate losing some contrast.

Why OpenSolera?

It's Solera, but open to everyone (including commercial uses). Pretty simple, really. Also differentiate it from the original Solera model. Internally, this model has another name: SoleraV9.

What happened to previous versions of Solera?

  • V1-V3: Lost to time. They're all very rudimentary merges of AbyssOrangeMix3 (specifically AOM3, Anything V5, and Hassaku.

  • V4: Don't like it that much, still exists but not really usable or useful.

  • V5: Absorbed too much of Pastel-Mix, completely destroyed the background. Was my main driver in 2023.

  • V6 (internal name SoleraRemix): Don't like it that much. It was a decent foundation for future merge ideas, but not much remains.

  • V7: Uploaded to CivitAI as SoleraV7-a9.

  • V8: Went nowhere. Like, actually nowhere. It was the same situation as V6

  • V9: Uploaded to CivitAi as OpenSolera. Incremental updates are marked as "Alpha" versions

  • V10: Not officially called SoleraV10 by me, but Alpha 4 [Noto] is basically that. The OpenSolera line continues from a4 onwards with new codename-based versioning. Currently stopped updating at a6 [Fleur].

Where did the name "Solera" came from?

From the Infinity Bottle: "An infinity bottle, also referred to as a "solera", "fractional", or "living" bottle, is your own personal blend of whiskeys that is ever changing. By simply combining a bit of your favorite whiskeys in an empty bottle (or by adding to a base whiskey) you get a personal, never-ending blend of your own creation. This blend will likely have never been possessed or enjoyed by anyone else, ever. As you continue to add more, the flavor profile of the bottle is always new, evolving, and taking on a life of its own."

I don't actually drink myself, but I find the concept interesting, so I adapted the name for my merges.

I want to use your model for...

See the license near my profile card for more information. For any other purpose, contact me on Discord: @azeluslightvale

If any of the original model owners have issues with this model (due to licensing or otherwise), please contact me there. I try to be as thorough as possible, but I might miss something.

Please also note that I don't check my Discord very often, if at all. If I take a few weeks to respond, please forgive me, I don't use social media. (T_T)

Where's the merge recipe?

Check the zip file labelled "Training Data" on the version you're downloading. The actual merge takes too long to list now.

Basically, it's 13+ steps with a lot of long fractional numbers. Can't really copy them over properly.

Do you actually use this model or are you just making them for clout?

Yes, I do use the models I create for making things (see my Chub.ai page for an example, all images comes from a version of Solera (see above) or OpenSolera for examples). I rarely generate things, what with lack of motivation or ideas. I don't want to post all the time, only when I want to.

Is this similar to SD-Silicon?

No. This model series does not use any form of automation (except XYZ Grid for testing) in merging. Goal-wise, though, they are quite similar.

Is this similar to @AnonymousM Basedxx series of models?

Weirdly enough, yes. It's not an attempt to clone their work or anything, but I did get inspired by them when making this series of models (especially in using the Hololive LoRA). Other than that, I'm not sure if there's any other similarities.