Type | |
Stats | 747 |
Reviews | (94) |
Published | Oct 27, 2024 |
Base Model | |
Hash | AutoV2 A43CFD37A2 |
Deep under a mountain lives a sleeping giant, capable to eighter help humanity or create destruction...
A Colossus arise...
After my SDXL series its time for the FLUX series of this Project... This time I trained this thing from ground up. For training I used my own images. I have created them with my schnell Flux model DemonFlux/Colossus Project schnell + my SDXL Colossus Project 12 as refiner.
This SD Flux-Checkpoint is capable to produce nearly everything.. Colossus is very good creating extremly realistic pictures, anime and art.
If you like it, feel free to give me some feedback. Also if you want to support me you can do this here. I have spend some money only to build a computer that is capable to actually train this model.
https://ko-fi.com/afroman4peace
Version 2.1_de-distilled_experimental (MERGE)
This version is completely different and works actually different than a normal Flux model!
Its a experimental merge between my version 2.0 and a de-distilled version https://huggingface.co/nyanko7/flux-dev-de-distill. This happend a bit by accident but the results are mindblowing. You will get mindblowing details. Also follows the prompts extremely well... So the next thing I am gonna do is to train on the de-distilled model directly. I have already done some test Loras with it. This is highly experimental so please let me know if you find errors which are not listed down below. If you have good images post them.. post also the bad ones this can help improve thing :-). May try also version 2.0 and tell me which type of checkpoint fits you best.
!Attention!
The normal Flux workflow isn't working with this version. YOU NEED to download my workflow for it!
You also can figure something yourself out but please don't blame me for bad images. Also this is a highly experimental model... check the downsides below..
UP- and Downsides of this checkpoint:
Well this checkpoint can create extrem details..This will come with a price.. Its slow compared to the normal Flux- checkpoints. The upside of it is that you often doesn't need a additional upscale anymore. Instead of using the Flux Guidance this model uses the cfg scale. Which also mean that it will not work with standart workflows.
You can use negative Prompts! This helps to get stuff out of the image you don't want.
Sometimes can artifact appear.. You can solve this by a small and simple upscale (I am working on this). Here is an example.. this strangly happens not with every seed.. UPDATE: This is not a issue with the model itself.. more a workflow one.. I am working on fix for it. If this happens you can try setting the first upscale to 1.14 instead of 1.2.
Settings and Workflow V2.1:
Here you can find the workflow for it: https://civitai.com/articles/8419
Settings: other then the normal Flux it doesn't need the Flux Guidance scale. Use the cfg instead. I mostly use 3 cfg for the workflow.. Some images may require lower cfg-scales
the most important thing is may to shut off the flux guidance scale..
Without the Workflow I have tested it with 30 steps and 1-3cfg. This is also may the settings for Forge. try to experiment here.
I recommend using the word "blurry" in the negatives
Sampler and scheduler:
You can pick from a range of working samplers:
Euler,Heun, DPM++2m, deis, DDIM ware working great.
I mostly used "simple" as scheduler
If you find better settings tell me.. :-)
For Forge I recommend using the AIO model.. here is a example setting for Forge
Version 2.0_dev_experimental
Well.. this a experimental version.. The goal was to create a more coherent and faster model. I have trained in some additional own trained loras and then merged the resulting models in a special way (Tensor merge). It got a costom T5xxl which I have modified with "Attention Seeker". For gaining speed and additional quality I have merged in the Hyper Flux lora from ByteDance. This means that it shifted the working area.. I show you what this means.. Here is the main title image..
16 steps V 2.0
30 steps V 1.0
Downsides:
Well first.. This version is a bit bigger than the last one.. second I still have to create the Unet only version. I will update this when its done..
Settings and Workflow V2.0:
You can run the model now with less steps.. 16 steps equals 30 steps from the old model.
I still recommend using around 20- 30 steps because it will get you more quality in most cases.
Sampler: I prever Euler with Simple as scheduler. The guidance can be set from 1.5-3 (feel free to test it outside this range of course). The guidance of 1.8 still works well for realistic images. You can also test out other samplers. DPM++2M and Heun also working great.
Workflow 2.0:
I have created a new workflow for V2.0 and V1.0. This got the new Flux Prompt Generator. Additionally I got the second upscaler stage working. https://civitai.com/articles/7946
Forge:
I have tested this model also with Forge and it worked very well.. The images may can differ between Comfy UI and Forge though..
Version 1.0_dev_beta:
This model is my first entry of the series. So please give me some feedback and post some images. This helps me to improve this project further. There are several versions to choose from. The best model regarding quality is the FP16 version Well the FP16 version is huge in size and will need a beefy graphics card and lots of RAM. The FP8 version is the version I consider as good solution between quality and performence. If you want to get a GGUF version download the Q8_0. The GGUF Q4_0/4.1 version was a request. They small in size but you will loose some quality.
There are basically two types of my models "All in one" models which only needs one file to download. It got the Clip_l, T5xxl fp8 and the VAE baked in. (look down below). Place this inside your checkpoints folder.
The other versions are the UNET-ONLY ones. Here you need to load all files seperately.
In any case you need to download my Clip_L for those to get them working right..
Also important is to choose the right T5xxl clip. For the FP8 version it is the fp8_e4m3fn t5xxl clip. For the FP16 it is the FP16 clip. make sure to select the default weight type. (down below is a example image for the fp8 version)
For the GGUF version you need the GGUF loader!
Some known things for now regarding V1.0:
This is just the first model of the series so at the moment it might can struggle with some prompts or styles like art. The next version will receive more training. Let me know some things the model can't do..
Settings and Workflow:
I have tested it with around 30 steps, Euler with Simple as scheduler. The guidance can be set from 1.5-3 (feel free to test it outside this range of course)
The guidance of 1.8 works well for realistic images.
Feel free to experiment with those settings.. If you get good results, please post them.
I have added the showcase images as training data.. Inside it is the workflow for Comfy. Here is the workflow for download: https://civitai.com/articles/7946
"All in one" model:
UNET_only:
You need download the clip_L as well. its the 240MB file.