Sign In

Lightricks LTXV

152
2.7k
65
Verified:
SafeTensor
Type
Checkpoint Trained
Stats
1,994
Reviews
Published
Nov 26, 2024
Base Model
LTXV
Hash
AutoV2
6311126410
Moderator Badge
theally's Avatar
theally

Lightricks LTXV

Originally posted on Huggingface.

Read our Lightricks LTXV Quickstart Guide on the Education Hub!

This model card focuses on the model associated with the LTX-Video model, codebase available here.

LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched. Trained on a large-scale dataset of diverse videos, the model generates high-resolution videos with realistic and varied content. We provide a model for both text-to-video as well as image+text-to-video usecases.

General tips:

  • The model works on resolutions that are divisible by 32 and number of frames that are divisible by 8 + 1 (e.g. 257). In case the resolution or number of frames are not divisible by 32 or 8 + 1, the input will be padded with -1 and then cropped to the desired resolution and number of frames.

  • The model works best on resolutions under 720 x 1280 and number of frames below 257.

  • Prompts should be in English. The more elaborate the better. Good prompt looks like The turquoise waves crash against the dark, jagged rocks of the shore, sending white foam spraying into the air. The scene is dominated by the stark contrast between the bright blue water and the dark, almost black rocks. The water is a clear, turquoise color, and the waves are capped with white foam. The rocks are dark and jagged, and they are covered in patches of green moss. The shore is lined with lush green vegetation, including trees and bushes. In the background, there are rolling hills covered in dense forest. The sky is cloudy, and the light is dim.