A free Windows-native dictation tool against a cloud transcription studio for audio and video files. Two valid workflows, two different price models. See which one matches the job you are actually trying to get done.
Both tools transcribe speech. They package it for completely different jobs.
StarWhisper runs OpenAI Whisper locally on your Windows PC. Press a hotkey, talk, words appear in whatever text field you are using. Free for 500 words a day.
Happy Scribe is a cloud transcription studio for audio and video files, with timestamps, speaker labels, a collaborative editor, and subtitle export workflows.
Six honest differences between a dictation hotkey and a transcription studio
StarWhisper is push-to-talk into the cursor: you trigger it, speak, words land in the active app. Happy Scribe is upload-and-process: you give it a file and get back a transcript with timestamps and speaker labels.
StarWhisper processes audio on your machine and works offline. Happy Scribe uploads files to its EU cloud, processes them on its servers, and stores transcripts in your account.
StarWhisper Pro is $10 per month flat for unlimited dictation. Happy Scribe charges by hours of audio transcribed, starting around 12 EUR per month for 5 hours of machine transcription.
Happy Scribe has a polished web editor with click-to-jump audio, in-browser speaker correction, and one-click subtitle export. StarWhisper outputs clean text; you bring your own editor.
StarWhisper supports 96+ languages through Whisper, all local. Happy Scribe markets 120+ languages with paid translation features. Both are strong for European languages.
StarWhisper: audio stays on the device, no exceptions. Happy Scribe: GDPR-compliant EU cloud, audio leaves your machine. Different defaults for different content sensitivities.
The honest framing is not "which is better" but "which job are you doing right now." StarWhisper and Happy Scribe both turn speech into text, but they wrap that core function in completely different products. Treating them as alternatives without understanding the workflow each one is built for leads to picking the wrong tool and paying for features you do not use.
StarWhisper is a desktop application for Windows 10 and 11. It sits in the system tray, listens for a global hotkey, and when triggered records your microphone, runs the audio through a local copy of OpenAI Whisper, and types the result into whatever text field is in focus. The model lives on disk, so the app works offline and processes audio without ever sending it to a server. It also handles drag-and-drop of audio and video files for one-off transcription, but the primary product is the hotkey: a way to talk instead of type, anywhere on Windows.
Happy Scribe is a Barcelona-based web application focused on producing finished transcripts from audio and video files. You upload a file, choose machine or human transcription, and Happy Scribe returns a transcript in its editor with timestamps and speaker labels. From there you can click to jump to a moment in the audio, edit the text, correct speakers, translate to other languages, and export the result as SRT or VTT subtitles, DOCX, PDF, or other formats. It is the tool podcast editors, video producers, journalists, and broadcasters reach for when they need to ship a polished transcript or subtitle file.
Neither tool tries to do the other's job. StarWhisper does not have a multi-speaker timeline editor. Happy Scribe does not type into your Slack messages.
Verified against the StarWhisper app and Happy Scribe's public pricing and product pages as of May 2026.
| Feature | StarWhisper | Happy Scribe |
|---|---|---|
| Primary use case | Real-time dictation into any text field | File transcription with editor and subtitles |
| Audio processing | Local, on device | EU cloud |
| Works offline | Yes | No, requires upload |
| Operating systems | Windows 10, Windows 11 | Web (any OS) |
| Hotkey dictation into apps | Yes | No |
| Audio/video file transcription | Yes (drag-and-drop) | Yes (core product) |
| Timestamps | Basic | Word-level |
| Speaker labels | No | Yes |
| In-app transcript editor | No | Yes, collaborative |
| Subtitle export (SRT, VTT) | No | Yes |
| Translation | No | Yes, paid add-on |
| Human transcription option | No | Yes (premium) |
| Free plan | 500 words/day, 3,500/week | 10 free minutes (trial) |
| Pro monthly price | $10 / month (unlimited dictation) | ~12 EUR / month (5 hours machine) |
| Pro annual price | $80 / year | ~120 EUR / year (varies by tier) |
| Languages supported | 96+ via Whisper | 120+ |
| Underlying model | OpenAI Whisper (local) | Proprietary cloud + Whisper |
| GPU acceleration | NVIDIA CUDA, Vulkan | Server-side |
This is where the two pricing models diverge most visibly. StarWhisper sells a flat-fee software license: pay once per month, dictate as much as you want. Happy Scribe sells hours of transcription: the more files you process, the more you pay.
See StarWhisper Pro for the full feature list and free tier details for the limits in plain language.
Happy Scribe prices in euros and has shifted its tiers a few times over the years. As of May 2026 the public lineup looks roughly like:
The prices change. Always check Happy Scribe's pricing page directly. The shape is per-hour metering, not unlimited.
If you process a small number of files per month, Happy Scribe Pro is in range and you get the editor and subtitle workflow on top. If you process more than 5 hours of audio per month, the per-hour overages add up quickly. StarWhisper has no per-hour meter at any tier, so dictating all day is the same cost as dictating for an hour.
The honest read: if your bottleneck is "I have lots of files to transcribe with timestamps and speaker labels for a client deliverable," pay Happy Scribe for the workflow. If your bottleneck is "I type all day and want to talk instead, plus occasionally transcribe a personal recording," pay StarWhisper for the unlimited dictation.
StarWhisper runs Whisper locally. Audio is captured by your microphone, sent through the model on your machine, and converted to text. There is no upload, no cloud account, no log retained by a vendor. Unplug your network cable and dictation still works.
Happy Scribe is a cloud product. Files are uploaded to its EU-based infrastructure, processed there, and stored in your account so you can use the editor. Happy Scribe is GDPR-compliant and uses European data centers, which is a meaningful trust signal for European customers compared to US-hosted alternatives. It is still a cloud product, though, which means your files leave your machine.
For most podcast and content workflows that is fine, and the GDPR posture is genuinely better than many US alternatives. For workflows where files cannot leave the device, local processing is the safer default. See Whisper local vs cloud for a deeper discussion and HIPAA-compliant dictation for regulated-industry context.
It would be silly to pretend Happy Scribe is not a great product. The team has spent years building a focused transcription studio and the polish shows. The web editor is one of the cleanest transcript-editing experiences on the market. The subtitle workflow is built around how video editors actually work, not bolted on as an afterthought. The reseller integrations with European broadcasters and the GDPR-first data posture make it a sensible default for EU-based content teams. The human transcription option is useful when machine output is not enough.
StarWhisper is not trying to replace any of this. There is no plan to ship a collaborative web editor, a subtitle timeline view, or a marketplace of human transcribers. Different product, different problem.
If you came to this page looking for "the best tool to produce a polished transcript with timestamps, speaker labels, and subtitles I can hand off to a client," Happy Scribe is the answer, not StarWhisper. We are a desktop dictation tool, not a transcription studio.
Several content teams we have talked to actually use both. Happy Scribe for client deliverables (interview transcripts, episode transcripts, subtitled video) and StarWhisper for the working day of email, draft notes, voice memos turned into text, code comments, and personal journals. The tools complement each other.
For the dictation side, the most popular workflow is described on the voice-to-text for writers page. For the file-transcription side, see how to transcribe interviews for a guide that walks through both Happy Scribe-style cloud workflows and local Whisper-based approaches.
Free local Whisper versus per-minute professional transcription pricing.
Real-time Windows dictation versus cloud meeting transcription.
Local desktop dictation versus the Notta cloud transcription platform.
Step-by-step guide for getting clean interview transcripts with timestamps.