Free Subtitle Generator Guide

How to Add Subtitles to a Video
for Free

Generate SRT and VTT subtitles locally on Windows in 96 languages. StarWhisper transcribes the audio track on your PC and gives you a subtitle file you can drop into Premiere, DaVinci, CapCut, or any editor. No upload, no per-minute pricing.

Download for Windows
Microsoft Store
  • Trusted by Windows
  • Quick 30-second setup
More
"Subtitles.srt exported, ready for Premiere..."

Five Steps to a Subtitled Video

Local-only workflow, works with any editor.

1

Extract the audio track from your video

Use FFmpeg, which is free and the de-facto Windows tool for audio extraction. The one-liner below pulls audio out of any common video format and saves it as MP3:

ffmpeg -i input.mp4 -vn -acodec libmp3lame audio.mp3

If you do not want to use FFmpeg, you can also export an audio-only version directly from your video editor (Premiere, DaVinci, CapCut all support this).

2

Drag the audio into StarWhisper

Open StarWhisper and drop the extracted audio file onto the app window. The desktop app auto-detects the format, picks a Whisper model that suits your hardware (CUDA on NVIDIA, Vulkan as a fallback, CPU baseline), and starts transcription. A 10-minute audio finishes in 1 to 2 minutes on a modern GPU.

3

Export as SRT or VTT

Use the Export menu to save as .srt (SubRip, the universal default) or .vtt (WebVTT, used by browsers and YouTube). Both formats include phrase-level timestamps that any video editor or video player can read. Keep both around if you are not sure which one your target tool wants.

4

Import the SRT into your video editor

Drag the .srt onto the timeline in Premiere Pro, DaVinci Resolve, Final Cut Pro, CapCut, Kdenlive, Shotcut, or any editor that accepts subtitles. The editor places the lines on a captions track aligned with the timeline. Most editors let you adjust timing, font, size, color, and position from there.

5

Style, review, and export

Skim the timeline, fix any mis-heard proper nouns (Whisper is good at words, less good at brand names it has not seen). Adjust styling to match your video's look. Export the video either with subtitles burned in (good for TikTok, Reels, Shorts) or with the SRT as a sidecar file (good for YouTube, Vimeo, and accessibility).

Why a Local Subtitle Generator Wins

What you actually get versus online subtitle generators and paid editor features.

No upload, no NDA risk

Online generators upload your video to their servers. For client work under NDA, unreleased content, internal corporate videos, or anything sensitive, that is a non-starter. StarWhisper runs locally and the video never leaves your PC.

SRT and VTT output

Both major subtitle formats are supported. SRT works in every editor and player on the planet. VTT works for HTML5 video and YouTube uploads. Pick the one that fits, or export both.

96 languages

Whisper supports far more languages than Premiere's built-in transcription. If your content is in Spanish, French, German, Japanese, Korean, Hindi, Arabic, Turkish, Polish, Vietnamese, or any of dozens of others, the same workflow applies.

No per-minute pricing

Online subtitle services charge by the minute. A 1-hour video can cost $5 to $10 per pass. StarWhisper has a free plan for short videos and a flat $10/month or $80/year Pro for unlimited use.

Works with any editor

StarWhisper produces a standard subtitle file. From there you can use Premiere, DaVinci Resolve, Final Cut, CapCut, Kdenlive, Shotcut, OBS, or even free subtitle editors like Subtitle Edit and Aegisub.

GPU acceleration

On NVIDIA hardware with a CUDA pack installed, StarWhisper transcribes at many times real-time speed. A 30-minute video is ready for editing in 2 to 5 minutes instead of waiting overnight on a free online tool.

The State of Subtitle Generation in 2026

There are three real options for adding subtitles to a video on Windows. The first is to use a paid subtitle feature inside an editor like Adobe Premiere Pro, DaVinci Resolve Studio, or Final Cut Pro on Mac. Quality is good and timing is precise, but the price tag is steep (Premiere is a Creative Cloud subscription, DaVinci Studio is a one-time $295, Final Cut is Mac-only at $300). The second is to use a free online subtitle generator that uploads your video. Quality varies, language coverage is uneven, and the privacy story is bad for any non-public content. The third is to run transcription locally with a tool like StarWhisper and import the resulting SRT into whatever editor you already use, free or paid.

The third path is the one that has gotten significantly more viable in the last two years because OpenAI Whisper, the underlying model, is genuinely good. Whisper's large variant matches or exceeds the accuracy of paid commercial systems on a wide range of content, and it runs on a normal PC. StarWhisper wraps Whisper in a Windows desktop app with file drop, language detection, GPU support, and subtitle export, so you do not need to invoke Python or build anything to get a usable .srt.

Paid Subtitle Tools vs Free Local Generation

Comparing real options honestly.

Tool Cost Languages Privacy
Premiere Pro auto-captions Creative Cloud subscription ~15 strong Adobe cloud processing
DaVinci Resolve Studio captions $295 one-time ~13 Local in Studio
Rev AI subtitles $0.25/min ~36 Uploaded to Rev
Online free subtitle generators Free with caps Varies Uploaded
YouTube auto-captions Free ~13 strong Uploaded to YouTube
StarWhisper SRT export Free up to 500 w/day, $10/mo Pro 96 Local, no upload

The honest case for paid tools: integrated editing, broadcast-grade per-word alignment, and styling baked in. The case for StarWhisper: privacy, language coverage, and price. For most independent creators, YouTubers, podcasters, and small studios, the trade favors the local approach.

FFmpeg Cheat Sheet for Audio Extraction

FFmpeg is the standard tool for pulling audio out of video. It is free, runs from the command line, and handles every container format you will encounter. Install it from ffmpeg.org and add it to your PATH so you can call ffmpeg from any folder. Common one-liners:

// MP4 video to MP3 audio
ffmpeg -i input.mp4 -vn -acodec libmp3lame audio.mp3

// Any video to WAV (best quality for transcription)
ffmpeg -i input.mov -vn -acodec pcm_s16le -ar 16000 -ac 1 audio.wav

// Trim to a specific section, useful for testing
ffmpeg -ss 00:01:00 -to 00:02:00 -i input.mp4 -vn -acodec libmp3lame clip.mp3

// Batch process every .mp4 in a folder (PowerShell)
Get-ChildItem *.mp4 | ForEach-Object { ffmpeg -i $_.Name -vn -acodec libmp3lame ($_.BaseName + ".mp3") }

For transcription, mono 16 kHz WAV is the gold standard input. MP3 at any bitrate is fine for general use. Whisper does not need high-bitrate stereo audio.

Importing SRT Into Common Editors

Adobe Premiere Pro

Window > Captions, then drag the .srt onto the Captions panel. Premiere creates a captions track on the timeline that you can style and position. Export with "Burn Captions Into Video" to embed.

DaVinci Resolve

In the Edit page, right-click the media pool and Import Subtitle. Drop the resulting subtitle track onto the timeline. Customize style under Inspector > Captions.

CapCut Desktop

Click Text > Captions > Import. Select the .srt file. CapCut adds a caption layer matched to your timeline. Useful for vertical edits and social formats.

Kdenlive and Shotcut

Both free editors accept SRT imports natively. Kdenlive: Project > Subtitles > Import. Shotcut: Drop the SRT into the timeline like a clip.

Subtitle Edit (free standalone)

If you want to refine subtitles before importing into a video editor, Subtitle Edit is the best free tool. Open the .srt, the audio waveform, and adjust lines visually with sync to playback.

YouTube Studio

For YouTube uploads, open the video in Studio, choose Subtitles, then Add Language, then Upload File, and pick the SRT. The Whisper-generated captions usually beat YouTube's auto-captions on accuracy and language coverage, especially for non-English audio. See the YouTube transcription guide for related workflows.

Burning Subtitles vs Sidecar Files

Two ways to ship a subtitled video. Each has its place.

Sidecar: A separate .srt file that travels alongside the video. The viewer's player (or YouTube, Vimeo, etc.) loads it on top of the video at playback time. Advantages: viewers can toggle subtitles on and off, multiple languages can ship as separate files, and the video itself is unmodified. Use this for YouTube, Vimeo, accessibility on websites, and anywhere you want viewers to control captions.

Burned in: The subtitles are rendered as part of the video frames. They cannot be turned off. Advantages: works on platforms that strip sidecar files (TikTok, Instagram Reels, Twitter video, many email clients), and the styling is exactly what you designed. Use this for short-form social, ads, anything where viewer settings cannot be trusted to display captions correctly.

You can burn subtitles directly with FFmpeg if your editor's export is overkill for the task:

ffmpeg -i input.mp4 -vf subtitles=subs.srt output.mp4

For styled subtitles (font, color, position), most editors offer better control than the FFmpeg one-liner.

Multi-Language Workflows

Whisper handles 96 languages, and the export workflow is identical for each. If you produce content in multiple languages, the standard pattern is:

  • Transcribe in the source language. Export as video.en.srt (or whatever language code).
  • Translate the SRT to a target language using any translation tool (DeepL, Google Translate, or LLMs like ChatGPT preserve the timestamps well if you ask them to). Save as video.es.srt, video.de.srt, etc.
  • Upload all language tracks to YouTube under the same video, or include them as separate sidecar files for other platforms.

The same workflow extends to podcasters publishing video versions of their shows, content creators producing for international audiences, and educators making lecture content accessible.

Frequently Asked Questions

What file formats does my video need to be in?
Any common video format works because StarWhisper only needs the audio track. MP4, MOV, MKV, AVI, WebM, FLV, WMV, and most others can be handled by extracting the audio with FFmpeg, then transcribing that audio. If you do not want to run FFmpeg yourself, you can also export an audio-only version from your editor or use any free audio extractor. The transcript and subtitles are produced from the audio either way.
Does it export both SRT and VTT?
Yes. StarWhisper exports SubRip (.srt) and WebVTT (.vtt), the two formats every major video editor accepts. SRT is the universal default and works with Premiere Pro, DaVinci Resolve, Final Cut Pro, CapCut, Kdenlive, Shotcut, and any video player including VLC. VTT is the web-native format used by YouTube and HTML5 video players. Both files contain the same transcript and timing data, just in different syntax.
How accurate are the timestamps?
Timestamps are accurate to within a few hundred milliseconds in most cases, which is well inside the tolerance for human-readable subtitles. Whisper produces phrase-level segments rather than per-word alignment by default, which is how most professional subtitles are structured. If a particular line drifts you can nudge it in your editor in seconds, but the typical workflow does not require it. For broadcast-grade per-word sync, professional captioning tools still have an edge.
Can I edit the subtitles after generation?
Yes, on three fronts. You can open the .srt or .vtt file in any text editor and rewrite lines directly, which is fine for small fixes. You can use a free dedicated subtitle editor like Subtitle Edit or Aegisub for line-by-line work with audio playback. Or you can import into Premiere, DaVinci, CapCut, or your editor of choice and edit inside the timeline alongside the video. All three routes are valid.
Does it work for podcasts and YouTube videos I own?
Yes. Podcasts are already audio so they go directly into StarWhisper without extraction. For your own YouTube videos, download the source file (or the published version) and extract the audio the same way as any other video. The resulting SRT can be uploaded to YouTube Studio to replace or supplement YouTube's auto-captions, which are often less accurate than Whisper, especially for non-English content.
Can I burn the subtitles into the video?
Yes, but that step happens in your video editor or in FFmpeg, not in StarWhisper. StarWhisper produces the subtitle file. To burn it permanently into the video frames, use your editor's export-with-burned-subtitles setting, or run an FFmpeg command like ffmpeg -i input.mp4 -vf subtitles=subs.srt output.mp4. Burned subtitles are useful for social platforms that strip sidecar files like Instagram Reels and TikTok.
What languages does the subtitle generation support?
StarWhisper supports 96 languages via OpenAI Whisper, including strong coverage of English, Spanish, French, German, Italian, Portuguese, Dutch, Polish, Swedish, Japanese, Chinese, Korean, Hindi, Russian, Arabic, Turkish, Vietnamese, and Indonesian. If your audio is multilingual or code-switched, Whisper handles transitions without requiring you to set a language manually. The same SRT export works for all of them, so subtitle workflows for non-English content do not require a different tool.
Is my video uploaded to a server?
No. StarWhisper Local Mode processes audio entirely on your PC. The audio file is read from disk, Whisper runs on your CPU or GPU, and the SRT or VTT is written back to disk. Nothing is uploaded. This matters for unreleased footage, client work under NDA, internal corporate videos, and anything you would not want sitting on a third-party transcription service. There is an opt-in Cloud Mode for maximum-accuracy needs, but Local Mode is the default.

Generate Subtitles for Free

SRT and VTT export, 96 languages, fully offline. No upload, no per-minute fees.

Download StarWhisper for Windows