Why Kandinsky for Text-to-Video?
Kandinsky turns written prompts into short video clips using a keyframe-plus-interpolation approach. Because it’s built on a strong text-to-image backbone, frames have solid composition and style before motion is added. It’s well-suited for quick, stylized scenes, fast prototyping, and smooth illustrative motion without heavy compute.
The workflow JSON is attached below, or you can try it directly on Floyo for free. No setup needed, everything is pre-installed and ready to run.
Key Inputs
Prompt – Describe the subject, action, and environment
Clip Length – Typically 4–12 seconds
Output Resolution – Usually 480p–720p, depending on the model variant
Use cases
Short stylized scenes or B‑roll from text descriptions, especially where an illustrative look is acceptable or desired.
Prototyping shot ideas before committing to heavier models: prompts stay the same while you refine actions, pacing, and framing.
Educational or explainer snippets that benefit from colorful, smooth motion rather than strict photorealism.


