Home
Softono
ComfyUI-Whisper

ComfyUI-Whisper

Open source Python
242
Stars
39
Forks
4
Issues
2
Watchers
1 week
Last Commit

About ComfyUI-Whisper

# ComfyUI Whisper Transcribe audio and add subtitles to videos using [Whisper](https://github.com/openai/whisper/) in [ComfyUI](https://github.com/comfyanonymous/ComfyUI). Support multiple languages, prompt guidance and multiple whisper models. **Last tested**: 07 June 2026 (ComfyUI v0.23.0 | Torch 2.12.0 | Triton 3.7.0 | Python 3.12.3 | L40S | CUDA 13.0 | Ubuntu 24.04) ![demo-image](https://github.com/yuvraj108c/ComfyUI-Whisper/blob/assets/recording.gif?raw=true) ## ⭐ Support If you like my projects and wish to see updates and new features, please consider supporting me. It helps a lot! [![ComfyUI-Depth-Anything-Tensorrt](https://img.shields.io/badge/ComfyUI--Depth--Anything--Tensorrt-blue?style=flat-square)](https://github.com/yuvraj108c/ComfyUI-Depth-Anything-Tensorrt) [![ComfyUI-Upscaler-Tensorrt](https://img.shields.io/badge/ComfyUI--Upscaler--Tensorrt-blue?style=flat-square)](https://github.com/yuvraj108c/ComfyUI-Upscaler-Tensorrt) [![ComfyUI-Dwpose-Tensorrt](https://img.shields.io/badge/ComfyUI- ...

Platforms

Web Self-hosted

Languages

Python

Links

ComfyUI Whisper

Transcribe audio and add subtitles to videos using Whisper in ComfyUI. Support multiple languages, prompt guidance and multiple whisper models.

Last tested: 07 June 2026 (ComfyUI v0.23.0 | Torch 2.12.0 | Triton 3.7.0 | Python 3.12.3 | L40S | CUDA 13.0 | Ubuntu 24.04)

demo-image

⭐ Support

If you like my projects and wish to see updates and new features, please consider supporting me. It helps a lot!

ComfyUI-Depth-Anything-Tensorrt ComfyUI-Upscaler-Tensorrt ComfyUI-Dwpose-Tensorrt ComfyUI-Rife-Tensorrt

ComfyUI-Whisper ComfyUI_InvSR ComfyUI-Thera ComfyUI-Video-Depth-Anything ComfyUI-PiperTTS

buy-me-coffees paypal-donation

Installation

Install via ComfyUI Manager

Usage

Load this workflow into ComfyUI

Models are auto-downloaded to /ComfyUI/models/stt/whisper

Supported Models

'tiny.en', 'tiny', 'base.en', 'base', 'small.en', 'small', 'medium.en', 'medium', 'large-v1', 'large-v2', 'large-v3', 'large', 'large-v3-turbo', 'turbo'

Nodes

Apply Whisper

Transcribe audio and get timestamps for each segment and word.

Add Subtitles To Frames

Add subtitles on the video frames. You can specify font family, font color and x/y positions.

Add Subtitles To Background (Experimental)

Add subtitles like wordcloud on blank frames

Save SRT

Export alignments as SRT files in /ComfyUI/output/srt directory

Updates

7 June 2026

Credits

License

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)