Sign In

LTX-2 First Last Frame: Controllable Video & Audio Generation

Updated: Mar 23, 2026

toolltx-2

Type

Workflows

Stats

83

0

Reviews

Published

Mar 23, 2026

Base Model

LTXV2

Hash

AutoV2
1FAB5CA7B3
default creator card background decoration
RunComfy's Avatar

RunComfy

🚀 Create flawless, audio-synced cinematic video transitions from just a start and end frame.

▶️ Run Directly in Cloud:
https://www.runcomfy.com/comfyui-workflows/ltx-2-first-last-frame-in-comfyui-audio-visual-motion-control?utm_source=civitai


💡 Overview

LTX-2 First Last Frame is a powerful ComfyUI workflow tailored for creators who demand precise cinematic control. Define your starting frame and your ending frame, and the pipeline will seamlessly generate the motion between them—complete with synchronized audio and visuals in a single pass.

By conditioning on both boundaries (with an optional guiding middle frame), the workflow perfectly preserves your subject's identity, framing, and lighting. It’s the ultimate tool for executing narrative beats, flawless scene transitions, and complex camera movements where temporal continuity and audio sync are an absolute must.

✨ Key Features

  • Absolute Motion Control: Lock in your first and last frames; the workflow handles the smooth transition in between without identity loss.

  • 1-Pass Audio & Video: Utilizes the LTXV Audio VAE to generate perfectly synchronized sound effects, dialogue, or ambience alongside your visual action.

  • Dynamic Camera Trajectories: Fully compatible with camera LoRAs, allowing you to easily execute Dolly In/Out, Jib Up/Down, and Static shots.

  • Integrated 2X Upscale: Features a built-in spatial upscaling pass to cleanly resolve complex lighting, refine background elements, and deliver crisp, high-fidelity micro-details.

🚀 Getting Started

  1. Model Setup: The core engine is LTX-2 19B (dev). (Note: For machines under 2x Large specs, please ensure the fp8 safetensors model is selected to avoid out-of-memory errors.)

  2. Prompting: Describe the scene action in your positive prompt. List any unwanted characteristics in the negative prompt.

  3. Configure Control: Upload your start and end images. Fine-tune the first_strength and last_strength nodes to dictate how strictly the workflow adheres to your frame references.

  4. Generate: Execute the prompt. The workflow will base sample an AV latent, run a targeted upscale, and automatically mux the decoded frames into a polished, ready-to-use MP4 video.


Click the "Run Directly" link above to bypass local setup and test this workflow immediately in your browser.