home models images videos posts articles comics challenges events updates shop

LTX2.3 All in one [SFW / NSFW] - Prompt Relay + ID LoRA + ControlNet + Detailer + Upscaler + Custom Audio + Keyframes

Name: LTX2.3 All in one [SFW / NSFW] - Prompt Relay + ID LoRA + ControlNet + Detailer + Upscaler + Custom Audio + Keyframes
Rating: 5 (193 reviews)
Author: LatentHeart

193

5.1k

169

Updated: May 18, 2026

tool

workflow ltx2 ltx2.3

Download

1 variant available

Config Other

628.95 KB

Verified: 4 days ago

Download (628.95 KB)

This checkpoint includes a config file, download and place it along side the checkpoint.

Details

Type

Workflows

Stats

1,356

Reviews

Very Positive

(56)

Published

May 15, 2026

Base Model

LTXV 2.3

Hash

AutoV2

EA0469AF72

Recommended Resources

About this version

default creator card background decoration

LatentHeart

This workflow supports 3 types of models currently:

Standard LTX 2.3 distilled
LTX 2.3 distilled GGUF
10Eros

💡 The models are self-contained. You can safely delete the entire group of whichever model you don't use without breaking the workflow. The remaining model groups will work independently without any additional changes needed.

This workflow is a modular and flexible text/image/audio-to-video generation system built in ComfyUI, designed to give full control over video creation using LTX-based models. It allows you to easily mix and match multiple generation modes such as text-to-video, image-to-video, lipsync, and fully guided animation by enabling or disabling grouped nodes.

📝 Personal notes:

The 10Eros model is better for NSFW content, whereas the standard model is better for SFW generations, although the body movement of the 10Eros model can be beneficial in some cases for SFW content too, but in general, use each model as I just said.
Try to always use 2 phase sampling generations (Half res + 2x upscaler), this yields the best quality and character consistency, LTX is not good at all at preserving character ID, so don't make it worse by doing a single pass generation. The upscaler model adds extra detail and improves character consistency, that's why I recommend using it.
Don't use the detailer when generating "Amateur look" videos, it adds a light layer of detail to the final result, and most of the time it will look too "polished" for a real amateur recording; amateur style videos look more real when they look low quality.

Main features

GGUF support
Prompt relay for segmented prompts
NSFW prompt enhancer
Text, image, audio, and ControlNet-driven video generation
LoRA support (character, style, and voice via ID LoRA)
Custom or AI-generated audio with automatic syncing
Reference image + up to 7 keyframes (FFLF animation control)
ControlNet video guidance with hybrid reference support
Half-res sampling + 2× upscaling for faster high-quality results
LTX detailer for enhanced final output

Common Setups

Text to video:
All bypassers disabled + Prompt + Default audio
Image to video:
Prompt + Reference image + Default audio
Lipsync:
Prompt + Reference image + Custom audio
Audio to video:
Prompt + Custom audio only
Character LoRA + voice cloning:
Prompt + Character LoRA + ID LoRA + Default audio
Voice reference to video:
Prompt + ID LoRA + Default audio
OR
Prompt + ID LoRA + Reference image + Default audio
Character animation:
Prompt + ControlNet + Reference image + (Custom or Default audio)
First frame → last frame:
Prompt + Keyframe 1 + Keyframe 2 + (Custom or Default audio)
First → middle → last frame:
Prompt + Keyframe 1 + Keyframe 2 + Keyframe 3 + (Custom or Default audio)
Character animation with custom voice:
Prompt + Reference image + ID LoRA + ControlNet + Default audio

Detailed instructions are contained in the workflow itself:

Red nodes are instructions and useful notes.
Yellow nodes are configurable elements you can adjust to your needs.