Type | |
Stats | 1,485 |
Reviews | (124) |
Published | May 14, 2024 |
Base Model | |
Hash | AutoV2 12EB7EFDDD |
==========================================
Prompt Suggestion
Movie Still Generation
Positive Prompt:
upperbody/fullbody realistic photo of
Negative Prompt:
anime, cartoon, graphic, text, painting, crayon, graphite, abstract, glitch, deformed, mutated, ugly, disfigured, noise background, worst quality, worst anatomy, distortion, low quality
cfg: 4
sampler: dpm++ 3m sde
steps: 30
Text Generation
Positive Prompt:
blurry foreground with text "{text}" {main subject}
Negative Prompt:
worst quality, worst anatomy, distortion, low quality
cfg: 4
sampler: dpm++ 3m sde
steps: 30
==========================================
20240515 version 3.0
Trained with movie still images, manually picked up aesthetic images.
Improve "Chinese", "Traditional cloth", etc
This version finally achieve the shadow and lighting effect of what I want.
So, version jumped to 3.0
Something got nerf due to this training:
text ability
hands
eyes
Might fix above with new fixing in further version.
20240510 version 2.6
This version is trained on generated images by 2 pass workflow, pixart-sigma2openxl2.5
Mainly improved shadow and light
Maintain the same level of text generation as before
Fixed "borning" standing pose due to version 2.5 training
20240504 version 2.5
Creative photo was added as a stylish tag.
The following version would continue improve this tag.
20240502 version 2.4b
Slightly improve text accurate. Most of time would be improved compare to 2.4a. But a few time the 2.4a still better than 2.4b.
Adjust photorealistic generation.
please read the suggestion of 2.4a for text generation.
2.4c might be a dpo on top of 2.4b.
20240428 version 2.4a
Focus on text generation, suggested prompt for text generation:
Positive Prompt:
blurry foreground with text "{text}" {main subject}
Negative Prompt:
worst quality, worst anatomy, distortion
cfg: 3.5
sampler: dpm++ 3m sde
using align your steps: 10
not using align your steps: 30
Reminder: version 2.4a is alpha of openxl2.4. It might have many version based on 2.4.
20240425 version 2.3e
Improve shadow and light
Improve face detail
20240423 version 2.3c
restore clip to version 2.2 which perform better
20240422 version 2.3
Trained with PAG generated images from version 2.2
Improve structure, anatomy, skin color etc
Might slightly impact the text generation.
20240417 version 2.2
mainly improve fingers
slightly improve shadow
20240415 version 2.1
Adjusted skin and shadow
slightly improved anatomy
20240412 version 2.0
Fully retrained from sdxl base, multi round training
dataset:
a few anime images, fashion images, filtered pickscore dataset, 4k video captures, cosplay photo, nvidia inthewild dataset, etc
Trigger words:
anime artwork, fashion photo, cosplay photo, raw photo, cotton doll, woman, man, etc
To achieve realistic images, please use raw photo of at the beginning and don't use something like unity, cg, etc
To achieve cute image, might try to add cotton doll to get a shape of cotton doll
To generate woman, please use woman rather than 1girl. It would usually generate a girl when using 1girl.
Merged list:
sdxl dpo lora
openxlv1.4
--kohaku alpha and beta
No animagine v3 and pony diffusion in merge
Please beware, chinese woman, chinese traditional cloth, something related to chinese race extended weird sdxl chinese biases. It would be improved in further version. But now, please don't use this tag to generate realistic image.
20240323 version 2.0 beta
20231229 Version 1.4 Human Preference Improvement
Finally, before 2024 version 1.4 is made.
Trained with pickapicv2 dataset with 4000 filtered dataset.
Aims to improvement the aesthetic, realistic, pupil, shadow and light, composition etc.
It is a overall improvement compared to old version.
If any want to use turbo version, I suggest use the turbo lora or lcm lora with is more efficent than I merge with the lora or model.
Appreciate comment or image post. Thank you.
20231201 Version 1.3 Turbo Merge And Female Faces Adjustment
Merge with SDXL Turbo to provide quality output with 10 steps fast generation.
Adjust female face details such as shadow, lips, contour, etc
Openxl v1.3 turbo suggested generation config:
Steps: 10
Cfg: 1~5 suggested 2
Sampler: dpmpp_3m_sde
Scheduler: sgm_uniform
Full version output would be slightly different than the turbo.
It is suggested to use turbo version as a fast generation and full version for the quality.
20231128 Version 1.2 Realistic Shadow and Eyes Generation Improved Version
Mainly adjusted the realistic shadow and improved realistic eyes generation. Reduce the affect of mixing anime model.
20231127 Version 1.1 Hands and Anime Improved Version
Version 1.1 is the first version merged with anime model aims to improved anime style.
All merged checkpoints would be added at end of description.
Aside of anime model, another big improvement is the hands generation.
It trained with a few of hands dataset using llm for captioning.
Carefully fine tune and tested with various checkpoint and
Merged with a lora using LECO tech from their recent paper.
Test result:
70% exactly 5 fingers in 100 generation of waving hands test.
Test prompts:
Positive:
good hands, photograph of a beautiful woman waving hands for her boyfriend
Negative:
pool drawing hands, unfinished drawing hands, sketch, abstraction, anime
Road map:
Finished:
Hands Generation v1.1
Anime Style v1.1
Realistic Shadow v1.2
Eyes Generation v1.2
SDXL Turbo Merge v1.3
Female Face Adjustment v1.3
Further Development:
Faces
Pose
Expression
Age group
Specific Anime Character
Cosplay Costume
Artstyle
===========================================================
Training Method:
The newest update has used various training method, including:
Quality training from Meta emu
Descriptive caption from Openai Dalle3
Direct fine tune
etc
The training dataset didn't include any image from nijijourney. I don't like the niji style much.
This checkpoint aims to as an improved version of SDXL which could provide various style.
User Instruction:
Aspect Ratio:
SDXL standard aspect ratio, please avoid to use 512*512, 512*768 those SD1.5 width height to generate images.
Prompt Style:
[Style word] [description] [supporting word]
It is recommanded to use above format to generate image in certain style.
Because SDXL is capable to generate in various style, it should state the style before your subject to control the image style.
If it is not enough to generate certain style, please use neg prompt to state the style you don't want.
For example:
Pos:
photo of an anime pikachu playing basketball in a realistic wordon, a closed laptop on a desk, detailed background
Neg:
white background, 3d render
It is not suggested to use a huge combination of negative prompt which used in SD1.5.
You might want to try with or without the negative prompt to see the different.
Classifier Free Guidance (CFG):
It is recommended to use 2.5~5.5 cfg.
Sampler:
It is recommanded 3m sde gpu.
Scheduler:
It is recommanded karras.
Steps:
25~40
Just try it for various prompts and please share the image🖼️ and feedback📓 if you like it.
Thank you❤️.
Contact Method:
Wechat:
fkdeai
===========================================================
Merge List:
20231127 version 1.1
Kohaku-XL beta 6.9
https://civitai.com/models/162577?modelVersionId=203416
Kohaku-XL alpha nyan
https://civitai.com/models/136389/kohaku-xl-alpha
SDXL Cross Style Hand Fixing Lora
https://civitai.com/models/211577/sdxl-cross-style-hand-fixing-lora?modelVersionId=238349