Build lifelike voices, fast and easy, across any language or style.
Who it's for: creators who want this pipeline in ComfyUI without assembling nodes from scratch. Not for: one-click results with zero tuning — you still choose inputs, prompts, and settings.
Open preloaded workflow on RunComfy
Open preloaded workflow on RunComfy (browser)
Why RunComfy first
- Fewer missing-node surprises — run the graph in a managed environment before you mirror it locally.
- Quick GPU tryout — useful if your local VRAM or install time is the bottleneck.
- Matches the published JSON — the zip follows the same runnable workflow you can open on RunComfy.
When downloading for local ComfyUI makes sense — you want full control over models on disk, batch scripting, or offline runs.
How to use (local ComfyUI)
1. Load inputs (images/video/audio) in the marked loader nodes.
2. Set prompts, resolution, and seeds; start with a short test run.
3. Export from the Save / Write nodes shown in the graph.
Expectations — First run may pull large weights; cloud runs may require a free RunComfy account.
Overview
With this voice synthesis workflow, you can design natural speech, multilingual dialogs, and cloned voices in one efficient setup. The audio node suite offers both standard and Turbo TTS generation with reference-guided voice control. You can test and compare speech modes fast, making it ideal for prototyping narration, virtual character voices, or AI performance projects. Every setting is optimized to give creators flexibility in tone, accent, and pacing. Great for voice design experiments and creative storytelling through sound.
Notes
ChatterBox TTS ComfyUI Workflow | Multilingual Voice & Dialog — see RunComfy page for the latest node requirements.

