Run this workflow on InstaSD
Get started in minutes! Run this ComfyUI workflow online - no setup required.
Description
Generate realistic AI voiceovers directly inside ComfyUI using this Kokoro TTS Workflow. This setup allows you to create free, offline text-to-speech audio, blending multiple speaker models for custom voices. Ideal for narration, storytelling, and character dialogue generationβall without cloud services.
π― Features
- Free & Local β No online services required; fully runs offline using Kokoro.
- Multiple Speaker Voices β Includes support for models like
am_onyxandam_adam. - Voice Blending β Combine two voices via
KokoroSpeakerCombinerfor unique speech tone. - Text-to-Speech Node β
KokoroGeneratorturns your typed text into audio using the selected voice. - Audio Export β Output is saved as a playable
.wavfile viaSaveAudio.
π‘ Use Cases
- YouTube Narration β Generate commentary and explanations for videos.
- Game Development β Create character dialogue with distinct tones.
- Voiceover for Animation β Narrate animated scenes or intros locally.
- Storytelling & Audiobooks β Read out long texts with expressive voice control.
- Virtual Assistants β Add speech to AI bots or desktop assistants.
βοΈ How It Works
- Load Speakers β Use two
KokoroSpeakernodes to select different voices (e.g.am_onyx,am_adam). - Blend Voices (Optional) β Use
KokoroSpeakerCombinerto merge voices with adjustable ratio. - Type Your Text β Input your line into
KokoroGenerator, choose speed/language settings. - Generate Speech β Connect the combined speaker into
KokoroGeneratorto create the voice audio. - Save Audio β Use
SaveAudioto export the voiceover as a.wavfile.
Credits: pixaroma
Nodes
KokoroGeneratorSaveAudioKokoroSpeakerCombinerKokoroSpeaker