Updated: Feb 1, 2026
base modelDownload
1 variant available
Join LUXED AI, the best AI community: https://discord.gg/HxfP9TnctJ
💚 ChronoEdit | 🖥️ GitHub | 🤗 Hugging Face | 🤖 Gradio Demo | 📑 Paper
ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation
ChronoEdit-14B enables physics-aware image editing and action-conditioned world simulation through temporal reasoning. It distills priors from a 14B-parameter pretrained video generative model and separates inference into (i) a video reasoning stage for latent trajectory denoising, and (ii) an in-context editing stage for pruning trajectory tokens. ChronoEdit-14B was developed by NVIDIA as part of the ChronoEdit family of multimodal foundation models. This model is ready for commercial use.
Overview of the ChronoEdit pipeline. From right to left, the denoising process begins in the temporal reasoning stage, where the model imagines and denoises a short trajectory of intermediate frames. These intermediate frames act as reasoning tokens, guiding how the edit should unfold in a physically consistent manner. For efficiency, the reasoning tokens are discarded in the subsequent editing frame generation stage, where the target frame is further refined into the final edited image.

