Google Meet Transcription Guide

How to Transcribe a
Google Meet Call for Free
No Workspace Required

Google's official Meet transcription is locked behind Workspace Business Standard at about 14 dollars per user per month. This guide shows the free workaround: a Meet recording, a free Windows app, and a transcript in minutes.

Download for Windows
Microsoft Store
  • Trusted by Windows
  • Quick 30-second setup
"Meeting notes ready in 3 minutes..."

Five Steps from Google Meet to Transcript

Works with or without a Workspace subscription.

1

Get a Meet recording

If your account is on Workspace Business Standard or higher, recording is built in: click the three-dot menu inside Meet, then Record meeting. If you are on a personal Google account or a Workspace tier without recording, run a free local screen recorder before joining the call. OBS Studio is the most common choice. The built-in Xbox Game Bar (Win+G) and ShareX both work too. Whichever you pick, make sure "Capture system audio" is enabled, or you will record video without sound.

2

Download the MP4

If Google Meet recorded the call for you, it saves to My Drive, Meet Recordings as an MP4. Open Drive, find the file, and download it. If you used a screen recorder, the file is already on your disk wherever you told the recorder to save it. Either way you should end up with a single .mp4 file. File sizes typically run 50 to 300 MB per hour depending on resolution and screen-share content.

3

Install StarWhisper

Download StarWhisper from the homepage. The installer is around 200 MB on first run because it bundles the Whisper model. There is no account signup, no credit card, and no cloud component required. After install, launch the app once and complete the 30-second setup (pick your default microphone, choose a hotkey, accept defaults for everything else). You are now ready to transcribe.

4

Drag the MP4 into StarWhisper

Open File Explorer to wherever you saved the recording. Drop the .mp4 onto the StarWhisper window. The app extracts the audio track automatically, detects the spoken language, and starts transcribing. A 60-minute call typically takes 5 to 15 minutes on a recent laptop CPU, or 1 to 3 minutes on an NVIDIA GPU using the CUDA acceleration pack. Progress shows in real time. You can minimize the window and keep working.

5

Review and export the transcript

When the run finishes, the transcript appears in the StarWhisper window. Read it on screen, copy the full text to clipboard, or save it to a .txt file. Paste into Google Docs, Notion, Confluence, OneNote, or your team's notes system. The transcript is plain text without speaker labels, which keeps the file portable. Total elapsed cost: zero. No Workspace upgrade, no Otter subscription, no per-minute transcription fee.

Why This Beats Workspace Business Standard for Transcripts

A free workflow that does what the 14 dollar per user plan does.

Zero ongoing cost

Workspace Business Standard is roughly 14 dollars per user per month, or 168 per year per seat. For a five-person team that is 840 a year just to enable transcription. This workflow stays free for occasional transcribers, or 10 dollars per month per person on Pro if you transcribe long meetings every day.

Audio stays on your device

The MP4 sits on your hard drive. StarWhisper runs the transcription locally on your CPU or GPU. Nothing is uploaded to Google, OpenAI, or any third party during the transcription itself. Privacy and offline architecture details.

Works for any meeting platform

Same workflow handles Zoom, Microsoft Teams, Webex, Slack Huddles, and anything else you can record. The transcription engine treats every recording as just an audio file. See the related guides on transcribing Zoom calls for the platform-specific recording steps.

96 languages for international calls

Distributed teams running Meet calls across English, Spanish, German, French, Japanese, and Mandarin all benefit. Whisper auto-detects the spoken language. Multi-language support page.

Real-time dictation in any text field

Same install also lets you dictate into Google Docs, Chat, or any Windows text field by holding a hotkey. See the voice-to-text in Google Docs guide for the press-and-hold workflow.

GPU acceleration available

NVIDIA GPU owners process a one-hour call in 1 to 3 minutes via CUDA 11 or 12. Without a GPU, modern CPUs handle the same workload in 5 to 15 minutes. Either path is faster than re-listening to the meeting.

Why Google Meet Transcription Is So Locked Down

Google Meet has live captions for free, but the official transcript export is gated behind Workspace Business Standard or higher. Business Standard is about 14 dollars per user per month, billed annually. For a solo freelancer or a small team where only a few calls per month actually need a transcript, that is a heavy line item. Many teams keep the free tier or a cheaper Workspace plan and end up either taking handwritten notes or paying for an outside service like Otter or Fireflies on top of Workspace.

The cheaper outside services have their own tradeoffs. They join the meeting as a bot, which announces itself in the participant list and unsettles people on confidential calls. They upload meeting audio to their servers, which is a problem for legal, medical, HR, or M&A discussions. And they add another monthly subscription on top of Workspace, which is what people were trying to avoid by not upgrading to Business Standard in the first place.

This guide describes the workflow most independent transcribers and privacy-minded teams settle on. Capture the Meet call (either with Workspace recording, with a free screen recorder, or with one shared by a colleague), then transcribe the resulting MP4 with StarWhisper on your Windows PC. Free, local, no bot in the meeting, no per-minute fee.

Recording a Google Meet Call Without Workspace

The trick to this whole workflow is having a recording in the first place. Three common ways to get one:

Option A: Workspace Business Standard recording (if your account has it)

Inside the meeting, click the three-dot menu, choose Record meeting, confirm the prompt that asks you to notify participants. When the call ends, the recording is processed and lands in My Drive, Meet Recordings, usually within a few minutes. You get an MP4 with mixed audio of everyone plus video of the active speaker and any screen share.

Option B: Local screen recorder (free and platform-agnostic)

Before joining the call, start OBS Studio, the built-in Xbox Game Bar (Win+G in Windows 10/11), ShareX, or any other screen recorder. The critical setting is "Capture desktop audio" or "Record system sound", which records what your computer is playing through its speakers. Without this you get video only. Pick MP4 as the output format if your recorder offers a choice. Start recording right before you join the call and stop it after everyone leaves.

Option C: Recording someone else shared with you

If the host or another participant recorded the call and sent you a Drive link, click the link, download the MP4, then jump to step three.

Always tell the other participants you are recording. Most jurisdictions require at least one-party consent, but professional and ethical practice is to disclose. Some workplaces and contracts explicitly forbid local capture of internal meetings, so check before you rely on this for sensitive calls.

What the Recording Actually Contains

A Google Meet recording (either Workspace or screen-recorder) is a single MP4 with one mixed audio track containing every voice plus a single video track of whoever was on screen at the time. The audio is what matters for transcription. There are no per-speaker channels, so neither StarWhisper nor any other single-track transcriber can automatically label who said what.

StarWhisper will produce a clean continuous transcript with sentence breaks and natural punctuation. For typical action-items-and-decisions meetings this is fine: skim the transcript, mentally attribute the lines to whoever you remember speaking, lift out the four or five decisions and action items, share with the team. If you need formal verbatim transcripts with speaker labels (court proceedings, depositions, academic research interviews), you will need either a paid cloud diarization service or a multi-microphone setup where each speaker has their own track.

Speed and Hardware Expectations

Transcribing a recorded meeting is faster than real time, sometimes much faster. Approximate runtimes for the default medium Whisper model across common hardware:

Hardware30-min meeting60-min meeting2-hr meeting
Modern laptop CPU (i7 or Ryzen 7)3 to 6 min6 to 12 min12 to 25 min
NVIDIA RTX 3060 (CUDA)30 to 60 sec1 to 2 min2 to 5 min
NVIDIA RTX 4090 (CUDA)10 to 20 sec20 to 40 sec1 to 2 min
Older CPU (5+ years)10 to 20 min25 to 45 min50 to 90 min

For most office laptops bought in the last three years, expect a one-hour Meet recording to finish transcribing in 6 to 12 minutes. If you do this regularly and have an NVIDIA GPU in the machine, the CUDA pack drops the time by roughly an order of magnitude.

International Calls and Multi-Language Meetings

Distributed teams running Meet calls across Berlin, Tokyo, and Sao Paulo are a major use case for this workflow. Whisper supports 96 languages with strong accuracy in English, German, Spanish, French, Italian, Portuguese, Dutch, Polish, Japanese, Chinese, Korean, Hindi, Russian, Arabic, and Turkish, among others. The model auto-detects the spoken language at the start of the file.

For meetings where speakers switch languages mid-call (a common European pattern), Whisper handles short code switches reasonably well, though it commits to a primary language. If you have a half-Spanish half-English meeting, you may get better results by splitting the recording into two clips and transcribing each in its declared language. The multi-language feature page covers per-language accuracy in more detail.

Translation is also possible. StarWhisper can take a non-English recording and transcribe it directly into English text using Whisper's translate mode. This is useful for internal teams in the US or UK trying to follow a partner meeting in another language without paying a translator. Quality is generally good for major languages and degrades for less-common ones.

Privacy: What Stays Local and What Does Not

This workflow keeps the meeting audio and transcript on your device. Workspace recordings live in your own Google Drive; you control sharing. Screen-recorder recordings save to your hard drive. StarWhisper Local Mode processes the file locally on CPU or GPU. The transcript output is a plain .txt file on your PC. None of this leaves your network unless you choose to share it (paste into a cloud doc, email it, upload it).

Compare to cloud transcription services. Otter, Fireflies, Notta, and similar tools join the call as a bot and upload audio to their servers. Even Google's own transcription processes audio in Google's cloud. For confidential calls (M&A discussions, performance reviews, customer interviews under NDA, legal strategy, medical case reviews) the local-only workflow is a meaningful improvement in data control.

If you are in a regulated industry, the same architecture supports your compliance posture. The HIPAA compliance FAQ covers what local processing means for protected health information specifically.

For Sales, HR, and Customer-Success Teams Specifically

Recruiters running candidate screens, sales reps on discovery calls, and CS leaders doing renewals all want transcripts but rarely justify a separate transcription line item. The workflow here is the same as any other meeting: record locally, transcribe afterward. For sales teams doing volume work, the voice-to-text for sales reps guide covers integration with CRMs. For HR and recruiting workflows, the voice-to-text for HR managers page covers candidate screening transcripts and the confidentiality requirements that come with them. For a deeper integration with Teams instead of Meet, the voice-to-text in Teams guide is the direct equivalent.

Frequently Asked Questions

Do I need Google Workspace to transcribe my Google Meet calls?
No. Google's official meeting transcripts require Workspace Business Standard, which is currently around 14 dollars per user per month. With this workflow you skip that line item entirely. You either use an existing recording (whether yours, or one a colleague shared with you), or you capture the call with a free local screen recorder like OBS Studio. StarWhisper then transcribes the resulting file offline on your Windows PC. Total cost: zero, unless you outgrow the StarWhisper free tier of 500 words per day.
What about Google Meet's free live captions?
Meet does offer free live captions during a call. They are useful while the meeting is happening, but they are not downloadable as a transcript. As soon as the call ends, those captions are gone. There is no Save button, no export, and no way to pull them out of Meet after the fact. If you want a permanent searchable record of what was said, you need either a Workspace tier with transcription enabled or a recording plus a transcription tool like the one described here.
What file format does Google Meet save recordings in?
Google Meet recordings save as MP4 video files into your Google Drive, in a folder called Meet Recordings inside My Drive. The MP4 contains both the video grid (and any screen share) and the mixed audio of all participants. For transcription you only need the audio, but you do not have to extract it manually. Drop the .mp4 straight into StarWhisper and the app pulls the audio track automatically. The original file in Drive is left untouched.
Can I get speaker labels (who said what) in the transcript?
Not from this workflow. Google Meet recordings are a single mixed audio track with no per-speaker channels. StarWhisper does not currently do automatic speaker diarization either, so the transcript comes back as continuous text. For most action-items-and-decisions purposes this is fine and quick to clean up. If speaker labels are essential, the alternatives are paid cloud services like Otter or Fireflies, which upload your audio to their servers in exchange for diarization.
What about meetings I am not the host of?
If the host shares a recording with you (via a Drive link), download the MP4 and drop it into StarWhisper. Same workflow. If the host did not record, you can request a recording, or you can capture the call locally with a screen recorder yourself. Always tell the other participants you are recording. Most jurisdictions require at least one-party consent, but professional and ethical practice is to disclose. Some workplaces and contracts forbid local capture of meetings, so check before relying on this for client or internal calls.
Does this work for Zoom and Microsoft Teams too?
Yes. The transcription engine does not care which platform the meeting was on. For Zoom, use Local Recording (free for hosts) and drag the audio_only.m4a file into StarWhisper. For Teams, use the built-in Record button and grab the MP4 from OneDrive or SharePoint, or use the same screen-recorder approach. There are dedicated guides for each platform that walk through the platform-specific recording steps.
Does the audio leave my device?
No. StarWhisper runs in Local Mode by default. The MP4 (or whatever recording format you drop in) is processed entirely on your CPU or GPU using a Whisper model stored on your machine. Nothing is uploaded to OpenAI, Google, or any third party during transcription. You can verify this by disconnecting your network and running a transcription; the app keeps working. This matters for confidential calls (client interviews, candidate screens, internal reviews) where uploading the audio to a cloud transcription service is not acceptable.
Is it really free, or are there hidden limits?
The Windows app is free to download and use. The free tier caps you at 500 words per day of transcription output (or 3,500 per week), which is around 5 minutes of conversation per day. For occasional meeting transcription this is often enough. If you regularly transcribe long meetings, the Pro plan is 10 dollars per month or 80 dollars per year and removes the word limit. There is no per-minute fee, no upload fee, and no contract. Zero hidden costs.

Stop Paying Workspace Business Just for Transcripts

Free Windows download. Drop any Meet recording in, get a full transcript in minutes. No bot in the meeting, no upload.

Download StarWhisper for Windows