Um... Strictly speaking it's not stable (two-person image).
The main training set images come from the anime screenshots of the theatrical version.
Maybe there will be a new version in the future?
I think using the following prompt words as negative may improve the stability of the two-person image.
(blonde hair:0.5),(yellow eyes:0.5),
style
anime screenshot