home models images videos posts articles challenges events updates shop

Seiichi Kinoshita - LTX2 (SHIROBAKO)

Name: Seiichi Kinoshita - LTX2 (SHIROBAKO)
Rating: 5 (14 reviews)
Author: tosermepls

Updated: Feb 14, 2026

character

audio chubby ltx2 i2v t2v

Verified: 2 months ago

SafeTensor

Seiichi Kinoshita LORAs

Collection - 5 items

Details

Type	LoRA
Stats	92 0
Reviews	Positive (14)
Published	Feb 14, 2026
Base Model	LTXV2
Usage Tips	Strength: 1
Trigger Words	A chubby man with glasses. He is wearing a blue polo shirt. Anime style.
Hash	AutoV2 2109387711

1 File

About this version

My first attempt at a LTX2 character LoRA. Trained on Musubi Tuner LTX2 Fork. Big Thanks to AkaneTendo25 for the amazing fork.

This is a beta release. I already have ideas for further improvements but I am quite happy with results so far.

The Lora can do I2V but it was trained with default I2V support (ltx2_first_frame_conditioning_p 0.1) which means I2V isn't great. But this is also because LTX2 in general isn't currently work great with 2D flat animation.

Source I2V images come from my Seiichi Illustrious LoRA (on my profile).

default creator card background decoration

tosermepls

Main prompt structure: A chubby man with glasses. He is wearing a blue polo shirt. He says in Japanese "Kon'nichiwa, hajimemashite". Medium shot. Anime style.

Note: You can further enhance the prompt, LTX2 likes long detailed prompts.

Recommended models:
- Lora tested on LTX2-19b-DEV-FP8 + LTX2 Distilled Lora + abliterated Gemma3 text encoder but it should work on all versions of LTX2/Gemma3

Workflow examples for I2V and T2V + explanations on parameters can be found here: https://civitai.com/articles/26140/

Recommended generation settings:
- Sampler: (1st and 2nd stage): Euler
- Resolution: wide aspect ratio (e.g. 1280x704 / 1920x1024) is preferred, other resolutions like 1:1 square (1024x1024) or portrait (768x1024) will also work but may cause problems
- Frame rate: I recommend 25 by default, however, using higher frame rate can increase motion and reduce blurry artifacts
- CFG: 1 Steps: 8

Note: CFG/Steps will depend on what kind of workflows, models and Loras you use (e.g. Distilled). Refer to gallery videos comments for examples and parameter recommendation from the linked article.

Prompt tips (green - positive, red - negative):
- Anime style - required to keep the style, you can make him fully realistic if you remove this tag or use Realistic instead
- For correct Japanese dialogue use He says in Japanese " " and put text as romaji in the quotes
- Shot types: Close-up shot, Medium close-up shot, Medium wide shot, Wide shot
- He is wearing a grey cap, a white and green striped polo shirt over a blue undershirt - alternative clothes

I2V support:

- The Lora works with I2V, however, do not expect amazing results. It was not trained with I2V focus and LTX2 right now does not perform great with flat 2D styles either.

Most of the I2V source images can be found on my Seiichi Kinoshita Illustrious page.

Known issues:

The Lora is currently biased towards close-up/medium shots by default. Solution: provide more details about what the character is wearing (e.g. trousers, shoes) to force wider shots or change resolution to square/portrait
Will occasionally generate in realistic style. Solution: either change seed or add "Anime style animation" or "Animated in Anime style" in the prompt
In weird/unusual angles will sometimes lose character consistency and become skinny Solution: add fat/obese to describe him

Training info:

Trained on 82 videos and 8 images using the Musubi LTX2 fork: https://github.com/AkaneTendo25/musubi-tuner
Training buckets used - images: 960x512 / 864x864 / 1152x640 || videos: 736x416 / 800x448
Frame length: between 25 and 121 frame videos
Training time: 7000 steps total, Lora taken from 5300 epoch
Training parameters below extracted from Lora metadata:

"ss_logit_mean": "0.0",
"ss_steps": "5300",
"ss_logit_std": "1.0",
"ss_max_grad_norm": "1.0",
"ss_discrete_flow_shift": "1.0",
"ss_fp8_base": "True",
"ss_base_model_version": "ltx2_v1",
"ss_optimizer": "bitsandbytes.optim.adamw.AdamW8bit(weight_decay=0.0001)",
"ss_max_train_steps": "7000",
"ss_network_dropout": "0.1",
"ss_gradient_accumulation_steps": "1",
"ss_mixed_precision": "bf16",
"ss_sd_model_name": "ltx-2-19b-dev.safetensors",
"ss_guidance_scale": "1.0",
"ss_full_bf16": "False",
"ss_vae_name": "ltx-2-19b-dev.safetensors",
"ss_gradient_checkpointing_cpu_offload": "False",
"ss_network_dim": "32",
"ss_network_alpha": "32.0",
"ss_lr_scheduler": "constant_with_warmup",
"ss_timestep_sampling": "sigmoid",
"ss_full_fp16": "False",
"ss_epoch": "58",
"ss_sigmoid_scale": "1.0",
"ss_lr_warmup_steps": "100",
"modelspec.title": "NewEraV14Audio",
"ss_learning_rate": "0.0001",
"ss_gradient_checkpointing": "True",