home models images videos 3D Models articles comics challenges updates shop

[Anima] preference optimization RL (hands, text, aesthetic etc.)

Name: [Anima] preference optimization RL (hands, text, aesthetic etc.)
Rating: 5 (50 reviews)
Author: levzzz

335

257

Updated: Jun 24, 2026

concept

Download

1 variant available

SafeTensor

8.37 MB

Verified: a month ago

Download (8.37 MB)

Details

Type

LoRA

Stats

335

257

224

Reviews

Very Positive

(50)

Published

Jun 22, 2026

Base Model

Anima

Training

Steps: 100

Epochs: 4

Hash

AutoV2

0FBD22D370

Tensors

Recommended Resources

default creator card background decoration

479

1.3K

levzzz

Joined Feb 15, 2024

License:

Anima

The Anima Model is licensed by CircleStone Labs LLC. Copyright CircleStone Labs LLC. IN NO EVENT SHALL CIRCLESTONE LABS LLC BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH USE OF THIS MODEL.

Built on NVIDIA Cosmos

first time trying my hand at preference optimization RL... this one was trained at 25 samples (i tried to get more, it's painful to pick by hand). should have slightly improved anatomy, text and aesthetics. will implement proper DPO and make a new version later, for now this is the best i could do.
works with flash as well

update: turns out the quality of this lora is limited by the dataset, as of now. Better loss didn't produce better results. So, unfortunately i have to do a bunch more work.