Sign In

Comfy UI F5 TTS - Text To speech

34
596
30
Updated: Jan 11, 2025
assetsaudiottstext to speech
Type
Workflows
Stats
596
0
Reviews
Published
Jan 9, 2025
Base Model
Other
Hash
AutoV2
80F14C4AB1
default creator card background decoration
Silver Assets Badge
rocky533's Avatar
rocky533

Using

https://github.com/niknah/ComfyUI-F5-TTS

Text to speech in comfy UI

Features

  • generates audio from a text prompt

  • custom voice from 15 seconds of talking audio.

Included simple flow with just TTS saving audio

Included advanced flow with lipsync video loader and face restore after.

Included 2 face video samples to test on

Included 3 voice samples

pokimane - Streamer

https://www.instagram.com/pokimanelol/?hl=en

Ruby - ASMR Artist

https://www.youtube.com/@rubybubyruby

Voice samples must go in the input directory directly. No subfolders.

To make your own sample just get a .wav of the voice you want. Cut it to 15-50 seconds

Put it in the input directory, then make a empty .txt file with same name as the .wav and it should populate in comfy node when you refresh. Thats it.

Notes

Audio can be saved as sets

voice.wav

voice.txt

voice.emotion.wav

voice.emotion.txt

Using just the voice.wav as sample, These can be called in the prompt with

{main} or {emotion} before the text allowing you to change tones.

This can be used to store many people in 1 set, allowing talking between people.

talkset.jenny.wav

talkset.jenny.txt

talkset.molly.wav

talkset.molly.txt

Called with {jenny} or {molly} before the text allowing you to change people.