Move your .m4a recording from iPhone to Windows, drop it into StarWhisper, and get a full transcript locally. 96 languages, no upload, no monthly fee for short memos.
Works for any Voice Memo, old or new, any of the 96 supported languages.
Tap the Voice Memos app icon (the white waveform on a black background). Find the recording you want to transcribe in the list. If you keep memos organized in folders, drill into the correct folder first. Tap the recording once to expand the playback bar so the share button is visible.
Tap the three-dot menu next to the recording, then choose Share. Pick whichever route reaches your Windows PC fastest: AirDrop to a nearby Mac then forward to Windows, save to iCloud Drive and pull from icloud.com, email the file to yourself, send it through WhatsApp or Telegram and download from the desktop client, or plug the iPhone in with a USB cable and copy the .m4a out of File Explorer.
Download StarWhisper from starwhisper.ai or the Microsoft Store. The installer takes under a minute. On first launch you can stay on the free plan, which covers 500 words per day and is enough for several short memos. Pro is $10 per month or $80 per year if you transcribe heavily.
Open File Explorer to wherever you saved the Voice Memo (Downloads, Desktop, iCloud Drive folder). Drag the file onto the StarWhisper window. The app detects the audio, picks a Whisper model that fits your hardware (CUDA on NVIDIA, Vulkan as a fallback, CPU as a baseline), and begins transcribing. A two-minute memo usually finishes in seconds on a GPU and within a minute on CPU.
When transcription completes, the full text appears in StarWhisper. Copy it to the clipboard, paste it straight into Notes, Word, Notion, an email, or a chat window, or save it as a plain text file for archiving. If you want timestamped output you can export as SRT or VTT for use in subtitle workflows.
Free Voice Memo transcription that works for the languages and recordings iOS skips.
Whisper supports far more languages than Apple's in-app transcription. Strong coverage for English, Spanish, French, German, Italian, Portuguese, Dutch, Polish, Japanese, Chinese, Korean, Hindi, Russian, Arabic, Turkish, Vietnamese, and many more.
iOS 18 introduced built-in transcription, but many users find it missing or unreliable on recordings made before they updated. StarWhisper does not care when the memo was recorded. Drag any .m4a in, get the transcript.
Any transfer route works: USB cable, email, WhatsApp, Telegram, Drive, Dropbox, OneDrive, or AirDrop via a Mac. You do not need to enable iCloud Drive or pay for extra iCloud storage just to move a recording.
The default Local Mode keeps the audio on your PC. No upload, no cloud account, no monthly limits beyond the 500 words per day free tier. Confidential interviews, medical thoughts, legal notes stay on-device.
If your PC has an NVIDIA GPU, StarWhisper uses CUDA to transcribe at many times real-time speed. A 30-minute memo finishes in roughly 2-3 minutes. Vulkan is available as a fallback for AMD and Intel GPUs.
Drop multiple .m4a files at once and StarWhisper queues them. Useful when you finally decide to clean out a backlog of voice notes that have been piling up in the Voice Memos app for months or years.
iPhone Voice Memos is the path of least resistance for capturing audio. You pull the phone out, hit record, and you have a recording without choosing an app, signing into anything, or configuring formats. The problem starts later when you want the words on a page. The recording is sitting in an app on a phone that is not designed for writing, in a format (.m4a) that most desktop tools cannot read without extra steps, in a language that Apple's transcription may or may not handle well.
Most people who search "iphone voice memo to text" or "transcribe iphone voice memo free" are dealing with one of a few real situations. A journalist captured an interview on the phone and now needs a written transcript to quote from. A student recorded a lecture and wants searchable text to study from. A founder dictated a long product idea on a walk and needs it as a document to share. A doctor or lawyer recorded a quick note between appointments and needs it appended to a case file. A novelist talked through a scene out loud and now wants the rough draft on a screen.
All of these situations need three things: a way to get the .m4a off the phone, a transcription engine that does not charge per minute or upload private audio, and an output that can be pasted somewhere useful. StarWhisper covers the last two, and the first one is mostly a matter of knowing your options.
Apple added in-app transcription to Voice Memos in iOS 18. It works, sometimes, but it has narrow boundaries that send a lot of users looking for an alternative. Here is what each path actually delivers.
| Feature | iOS Voice Memos (built-in) | StarWhisper on Windows |
|---|---|---|
| Languages | English plus a short list, accuracy varies sharply | 96 languages with consistent accuracy |
| OS requirement | iOS 18 or newer | Any Voice Memo on any iOS, transferred to Windows 10 or 11 |
| Cost | Free, but Apple-only | Free up to 500 words/day, $10/mo or $80/yr Pro for more |
| Where the transcript lives | Inside Voice Memos on the iPhone | On the PC where you actually edit and share |
| Output formats | Display only | Plain text, SRT, VTT, copy to clipboard |
| Privacy | Apple-controlled on-device processing | Local Mode runs Whisper on your PC, no upload |
The honest summary: if you have a recent iPhone, you recorded in English, and you only need the transcript on your phone, the built-in tool is convenient. For everything else (older memos, other languages, Windows workflow, batch jobs), a desktop transcriber on your PC is the better fit.
This is the step that people get stuck on. The Voice Memos app does not expose iCloud sync to Windows directly, and dragging files between iPhone and File Explorer is not as obvious as it should be. Here are five proven routes, ordered roughly by speed.
Plug the iPhone in, accept the trust prompt, then open File Explorer. The iPhone shows up under "This PC" as a removable device. Voice Memos live in a specific Apple folder, but the simpler path is to open Voice Memos on the phone first, tap Share, choose Save to Files, then connect the iPhone and copy the .m4a out of the iCloud Drive or On My iPhone folder you saved it to.
Tap Share in Voice Memos, pick Mail, send to your own address. Open the email on Windows, download the attachment. This is the slowest route for long files (Mail attachments cap around 25 MB) but the easiest one to remember.
Share to a chat with yourself, open the desktop client on Windows, save the file. Telegram handles files up to 2 GB free, which covers any plausible Voice Memo length.
Save to Files in iCloud Drive, then on Windows open icloud.com in any browser, sign in, download from the Drive view. No iCloud for Windows app needed.
If you have any Mac, AirDrop the .m4a to it, then transfer to Windows via shared folder, USB stick, or any chat app. Round-trip but very fast for large files.
StarWhisper detects the audio format automatically. iPhone Voice Memos use AAC inside an .m4a container, which Whisper handles directly. The app picks a Whisper model that fits your hardware. On an NVIDIA GPU with a CUDA pack installed, it uses the large model and runs at multiples of real-time speed. On Intel or AMD GPUs it can use Vulkan. On CPU-only systems it falls back to a smaller model that still gets accurate results, just at slower speed.
Transcription happens entirely locally in Local Mode, which is the default. The .m4a file is read, processed, and the transcript is held in app memory and shown on screen. Nothing is uploaded to a server, nothing is logged, and the audio is not retained anywhere except where you already saved it. There is an opt-in Cloud Mode that uses the paid OpenAI Whisper API for the maximum-accuracy large model when your CPU cannot keep up, but it is clearly labeled and never used by surprise.
When the transcript is ready you can copy it to the clipboard, paste straight into another app, save it as .txt, or export with timestamps as SRT or VTT. The clipboard path is the fastest for a quick note. The .txt path is the most portable. The SRT path is useful if you intend to use the audio as a subtitle track later, for example over a slideshow or screen recording.
Apple's in-app transcription tilts heavily toward English. If you record voice memos in Polish, Czech, Hungarian, Romanian, Vietnamese, Thai, Indonesian, Hindi, Tamil, Bengali, Arabic, Turkish, Swahili, or many other languages, the iOS tool either does not support the language at all or produces results that are unusable for any real purpose. Whisper supports 96 languages and tends to do well on all of them.
Bilingual or code-switched recordings are another place where iOS struggles and StarWhisper does fine. If you record a conversation that switches between English and Spanish or English and Tagalog, Whisper transcribes both sides without you needing to set a language manually. For users who think and speak in multiple languages, this is usually the decisive feature.
Anyone working in journalism who records interviews in foreign languages, researchers capturing field notes abroad, or anyone whose phone has a backlog of memos from travel will see the difference on the first recording.
Voice Memos are personal. They are how-tos for yourself, half-formed ideas, complaints recorded after a meeting, family voice notes, voice journals, confidential thoughts you only said out loud because the recorder was rolling. Sending these to a cloud service for transcription means trusting that service with audio that you would not normally publish.
Free online voice memo transcribers usually upload the file to a server, process it, and email or display the result. Many also retain the audio for some period to "improve the service." For anyone who recorded something they would not want a third party to keep, this is a problem.
StarWhisper Local Mode skips all of that. The .m4a is read directly off your disk, Whisper runs on your hardware, and the transcript is written back to your disk. Nothing is uploaded. Nothing is retained beyond what you choose to save. For situations like therapy notes, medical thoughts, legal observations, or anything you recorded by accident along with your intended note, that matters.
Other recordings you might want as text.
Recorded interviews to clean text for quoting and articles.
Zoom, Teams, Google Meet recordings to searchable notes.
Long-form audio to readable transcripts for show notes.
Field interviews, fast turnaround, foreign-language coverage.