Sign In

[Anima] preference optimization RL (hands, text, aesthetic etc.)

0

Updated: Jun 24, 2026

concept

Download

1 variant available

SafeTensor

8.37 MB

Verified:

Type

LoRA

Stats

198

0

Reviews

Published

Jun 22, 2026

Base Model

Anima

Hash

AutoV2
0FBD22D370
default creator card background decoration
Acolyte Badge
levzzz's Avatar

levzzz

License:

Anima

The Anima Model is licensed by CircleStone Labs LLC. Copyright CircleStone Labs LLC. IN NO EVENT SHALL CIRCLESTONE LABS LLC BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH USE OF THIS MODEL.

Built on NVIDIA Cosmos

first time trying my hand at preference optimization RL... this one was trained at 25 samples (i tried to get more, it's painful to pick by hand). should have slightly improved anatomy, text and aesthetics. will implement proper DPO and make a new version later, for now this is the best i could do.
works with flash as well

update: turns out the quality of this lora is limited by the dataset, as of now. Better loss didn't produce better results. So, unfortunately i have to do a bunch more work.