Updated: Jun 12, 2026
toolTurns your videos into smart identity-aware masterpieces.
Who it's for: creators who want this pipeline in ComfyUI without assembling nodes from scratch. Not for: one-click results with zero tuning - you still choose inputs, prompts, and settings.
Open preloaded workflow on RunComfy
Open preloaded workflow on RunComfy (browser)
Why RunComfy first
- Fewer missing-node surprises - run the graph in a managed environment before you mirror it locally.
- Quick GPU tryout - useful if your local VRAM or install time is the bottleneck.
- Matches the published JSON - the zip follows the same runnable workflow you can open on RunComfy.
When downloading for local ComfyUI makes sense - you want full control over models on disk, batch scripting, or offline runs.
How to use (local ComfyUI)
1. Load inputs (images/video/audio) in the marked loader nodes.
2. Set prompts, resolution, and seeds; start with a short test run.
3. Export from the Save / Write nodes shown in the graph.
Expectations - First run may pull large weights; cloud runs may require a free RunComfy account.
Overview
Create identity-preserving, reference-guided videos with the Bernini workflow for ComfyUI. Blend video, image, and prompt inputs for seamless editing and transformation. Achieve motion-consistent results while keeping subject integrity. Perfect for creators seeking detailed, camera-aware short video outputs. This workflow merges text and visual cues for intelligent, adaptable video generation.
Important nodes:
Key nodes in Comfyui Bernini multimodal video generation and editing workflow
BerniniConditioning (#34)
Builds Bernini‑native conditioning by combining your text encodings, VAE, source video, and reference imagery. It also prepares the starting latent volume and handles spatial and temporal sizing. Tune width and height to match your target resolution and use length to control the number of frames generated. If the reference subject is small in the image, increase ref_max_size so the model better perceives identity details. This node is part of the Bernini custom pack: ComfyUI‑RH‑Bernini.
LoraLoaderModelOnly (#11)
Applies the LightX2V high_noise LoRA to the HIGH backbone. Raising its strength_model increases adherence to the reference at the structural stage, useful when the subject’s silhouette or coarse features do not match the source video. Lower it if the edit becomes too rigid or suppresses natural motion. Use in tandem with the LOW‑stage LoRA to balance fidelity and fluidity.
LoraLoaderModelOnly (#29)
Applies the LightX2V low_noise LoRA to the LOW backbone. This LoRA refines textures like hair, skin, and clothing while keeping the motion set by the HIGH stage. If identity details drift between frames, increase strength slightly; if textures oversharpen or look overfit, reduce it. Together with the HIGH‑stage LoRA it forms a complementary pair.
SplitSigmas (#17)
Divides the denoising schedule into HIGH and LOW ranges. Moving the split earlier yields gentler edits that keep more of the original video, while moving it later grants the HIGH stage more influence for stronger replacements. Adjust the split when you change prompts or LoRA strengths so both stages remain balanced. This control is especially helpful for camera‑locked, motion‑preserving edits.
…
Notes
Bernini Multimodal Video Generation in ComfyUI | Video Edit + Identity - see RunComfy page for the latest node requirements.

