Hey Ghosties! Geeky Ghost just dropping in to explain the workflow a little bit. I use Geeky Kokoro TTS, Sonic, Flux Schnell, Live Portrait, and Hunyuan i2v to make short videos of people talking with some additional animation. It's a work in progress, but currently makes some cool results. Let's just dive right in shall we? Step 1) We generate an image to start with, I personally use Flux Schnell, well a merged and customized checkpoint version of Flux Schnell. You are free to use what ever you like. If you want, you can find my flux checkpoint here. <a target="_blank" rel="ugc" href="https://civitai.com/models/1324705/geeky-flux-schnell-tweaked-and-merged-model">https://civitai.com/models/1324705/geeky-flux-schnell-tweaked-and-merged-model</a><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/5159a243-5811-40e2-8bfc-b8e02ee3f2d1/width=525/5159a243-5811-40e2-8bfc-b8e02ee3f2d1.jpeg" />Now we have our Starting Image using the prompt Side view close-up of a woman in a business suit walking while looking directly at you. Woman with brown hair and brown eyes in a blue business suit and looking directly at you. She has her hair up in a high ponytail. She's in her 30's. She's on the left side of the scene walking to the right. The background is a long side view of a hallway in an office building. The hallway goes from left to right. Half body shot. Cinematic, detailed, crisp and clear photo quality image. <img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/5e22924d-b88a-47af-b5d8-b8fa8f291daf/width=525/5e22924d-b88a-47af-b5d8-b8fa8f291daf.jpeg" />Step 2) Generating a video using Hunyuan Image2Vid model. <img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/3da21d49-a8fd-4479-b0a9-f7eb5c65c475/width=525/3da21d49-a8fd-4479-b0a9-f7eb5c65c475.jpeg" />Now we have our video using the promptSide view close-up of a woman in a business suit walking while looking directly at you. Woman with brown hair and brown eyes in a blue business suit and looking directly at you. She has her hair up in a high ponytail. She's in her 30's. She's on the left side of the scene walking to the right. The background is a long side view of a hallway in an office building. The hallway goes from left to right. Half body shot. Cinematic, detailed, crisp and clear photo quality image. <a target="_blank" rel="ugc" href="https://civitai.com/images/62318812">https://civitai.com/images/62318812</a> Step 3) Next we generate the voice. I like doing a mix of Heart and Nicole at a 0.5 blend Ratio. <img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/f9b62336-2427-4637-9a87-aad2bc62fc23/width=525/f9b62336-2427-4637-9a87-aad2bc62fc23.jpeg" /><img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/fbaac369-2dad-460f-88eb-90d6268215e0/width=525/fbaac369-2dad-460f-88eb-90d6268215e0.jpeg" />Now we have our voice. Step 4) Now we make the lip sync video. <img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/860f3f5e-6cab-4971-89c7-cd710efd3e3b/width=525/860f3f5e-6cab-4971-89c7-cd710efd3e3b.jpeg" />I use Sonic to make the lip sync, loading the audio and image we created in previous steps. <a target="_blank" rel="ugc" href="https://civitai.com/images/62318784">https://civitai.com/images/62318784</a>Step 5) Putting it all together with Live Portrait. Using the Sonic Video as the driver and the Hunyuan video as the source, we run it through live portrait to merge the two. We have to cap the frame count of the source video to that of the driver for live portrait to work. It also doesn't like distant faces, so keep that in mind. <img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/1ae2cbf7-067b-4fa6-bd07-35e6d1f4aecc/width=525/1ae2cbf7-067b-4fa6-bd07-35e6d1f4aecc.jpeg" /><a target="_blank" rel="ugc" href="https://civitai.com/images/62320624">https://civitai.com/images/62320624</a>Step 6) Full Video We concatenate the Live portrait video with the left over frames from the Hunyuan video to create the full video. <img src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/14df5173-c195-45d7-a239-8615a521e614/width=525/14df5173-c195-45d7-a239-8615a521e614.jpeg" />The Full Workflowhttps://civitai.com/models/1325787?modelVersionId=1508281

workflow.png

Geeky Kokoro TTS Animation Station

large_banner_image.webp

physical violence

weapon violence

wide hips

revealing clothes

downblouse

convenient censoring

pg-13

corpses

suggestive

oral invitation

pg13

sexy

huge breasts

thick thighs

sexual situations

male nudity

disturbing

male swimwear or underwear

female swimwear or underwear

partial nudity

undressed

female nudity

breasts out

exposed female nipple

breast out

lingerie

male underwear

hair over breasts

female swimwear

gigantic breasts

no panties

graphic violence or gore

covered nipples

huge butt

strapless leotard

sitting on face

emaciated bodies

one breast out

female underwear

nude

nsfw

graphic male nudity

adult toys

illustrated explicit nudity

nudity

graphic female nudity

hentai

futanari

porn

sexual intent

genitals

peeing

vore

oral

sexual activity

anal

blowjob

dildo riding

incest

hanging

hate symbols

nazi party

white supremacy

diapers

scat

self injury

hate speech

urine

extremist

child on child

latex clothing

swimwear

bukkake

fellatio

cumshot

implied fellatio

eat_cum

cumdrip

cum in pussy

cum on face

after fellatio

cum on hair

cum on body

cum on tongue

cum on hands

cum in mouth

triple fellatio

autofellatio

fucked silly

cum on pussy