Sign In

Ace Step 1.5 Turbo and SFT - TEXT to AUDIO model with Ollama

Type

Workflows

Stats

204

0

Reviews

Published

Feb 13, 2026

Base Model

Other

Hash

AutoV2
AEC182C10B
Howling Aurora
tremolo28's Avatar

tremolo28

V1.2 Ace Step 1.5. Turbo and SFT model with Ollama Text to Audio/Song (examples below)

  • small update to GUI, system prompts and SFT sampler "engine"

  • bypass "ConditioningZeroOut" node in subgraph of "sampler SFT" in case you want to revert back to v1.0 "engine"

credits to reddit user Fornax_460 for the system prompt.


V1.0 Ace Step 1.5. Turbo and SFT model with Ollama Text to Audio/Song

Ace Step uses TAGS and LYRICS to create a song. These can be generated by Ollama or by own prompts.

  • Can use any Song, Artist as reference or any other description to generate tags and lyrics.

  • Will output up to two songs, one generated by Turbo model, the other by the SFT model (experimental).

  • Keyscales, bpm and song duration can be randomized.

  • able to use dynamic prompts.

  • creates suitable songtitle and filenames with Ollama.

  • Lora Loader included, hope to see some Loras soon!

Important: Do not use sage attention in your comfyui starting parameters, avoid --lowvram setting, as this might force Texencoder to run very slow on CPU instead of GPU. Recommend to toggle Link visibility to hide the wires.


Download Files:

Ollama Models, required for tags, lyrics and songtitle, you can choose 1,2 or 3 different models, tags and lyrics might need a bigger model >7b, songtitle can use a smaller model:


Update 9th of Feb 26: Alternative Turbo and SFT Models :


Which models to start with ? => Turbo, SFT-Shift1 and Llama3-NeuralDaredevil for Ollama.

  • My current choice: Turbo-SFT merge_ta_0.5, SFT-Shift1 and Llama3-NeuralDaredevil for Ollama. Using these settings:

    • Turbo-SFT_merge model with 22 steps, sampler: er_sde, scheduler: beta57

    • SFT-Shift1 model with 100-140 steps, sampler euler, scheduler: normal


Save Location:

  • 📂 ComfyUI/

  • ├── 📂 models/

  • │ ├── 📂 diffusion_models/

  • │ │ └── acestep_v1.5_turbo.safetensors

  • │ ├── 📂 text_encoders/

  • │ │ ├── qwen_0.6b_ace15.safetensors

  • │ │ └── qwen_4b_ace15.safetensors (or 1.7b)

  • │ └── 📂 vae/

  • │ └── ace_1.5_vae.safetensors


Custom Nodes used:

optional (use Beta57 scheduler for a bit more punch, requires RES4LYF): https://github.com/ClownsharkBatwing/RES4LYF


Examples various styles: