Powerful unified model for image generation and editing
Who it's for: creators who want this pipeline in ComfyUI without assembling nodes from scratch. Not for: one-click results with zero tuning — you still choose inputs, prompts, and settings.
Open preloaded workflow on RunComfy
Open preloaded workflow on RunComfy (browser)
Why RunComfy first
- Fewer missing-node surprises — run the graph in a managed environment before you mirror it locally.
- Quick GPU tryout — useful if your local VRAM or install time is the bottleneck.
- Matches the published JSON — the zip follows the same runnable workflow you can open on RunComfy.
When downloading for local ComfyUI makes sense — you want full control over models on disk, batch scripting, or offline runs.
How to use (local ComfyUI)
1. Load inputs (images/video/audio) in the marked loader nodes.
2. Set prompts, resolution, and seeds; start with a short test run.
3. Export from the Save / Write nodes shown in the graph.
Expectations — First run may pull large weights; cloud runs may require a free RunComfy account.
Overview
Experience the power of OmniGen2's unified multimodal generation in ComfyUI. This workflow uses a 7B parameter model with dual-path Transformer architecture to deliver exceptional text-to-image generation and text-guided image editing. Built on Qwen 2.5 VL foundation, OmniGen2 excels at compositional understanding, long prompt following, and precise image modifications while maintaining visual quality and consistency.
Important nodes:
EmptySD3LatentImage
CLIP Text Encode (Prompt)
Notes
OmniGen2 ComfyUI Workflow | Unified Text-to-Image Generation — see RunComfy page for the latest node requirements.

