Mildly organised template for doing Stable Diffusion txt2img, then sending that to Stable Video Diffusion, then sending that to RiFE.
Groups are placed less than ideally because the wires display over the text instead of under, so I was trying to avoid that. If that didn't happen I would have made the groups placement more efficient and logical.
Set settings for
Stable Video Diffusion,
Video Output. Since
Stable Video Diffusiondoesn't accept text inputs, the image needs to come from somewhere else, or it needs to be generated with another model like
Stable Diffusion v1.5.
Send latent to
Send decoded latent to
Stable Video Diffusion img2vid Conditioning
Send conditioned latent to
Send decoded latents to