Hapi

Transcribe Zoom Meetings on Mac: Every Method Compared (2026)

Cloud Recording + AI Companion, Otter for Zoom, third-party bots, and local Mac capture — what each path actually costs you in privacy, plan tier, and accuracy.

5 min read·Meetings

Zoom is still the default video conferencing tool for many teams, and "how do I transcribe a Zoom meeting" remains one of the most-searched workflow questions on macOS. The honest answer in 2026 is that there are five different paths, each with real trade-offs across plan tier, host control, privacy, and accuracy.

This guide walks through every option, then explains where local Mac capture actually wins.

Five Ways to Transcribe a Zoom Meeting

1. Zoom Cloud Recording with Transcription

Available on Pro and higher. Recording goes to Zoom's cloud, transcription runs as a post-processing job, and the transcript is downloadable as VTT or TXT.

  • ✅ Built-in, well-aligned to Zoom's UI
  • ✅ Speaker labels by Zoom identity
  • ❌ Host-only — attendees cannot trigger or download
  • ❌ Audio + transcript live on Zoom's cloud
  • ❌ Counts against cloud-recording quota; older recordings are auto-deleted unless backed up

2. Zoom AI Companion

Bundled with most paid Zoom plans. Provides live captions, post-meeting summaries, and chat-style queries against the transcript ("what did Sarah commit to?"). Not always equivalent to a literal verbatim transcript — the surface-level summary is the primary deliverable.

  • ✅ Live captions in-meeting
  • ✅ Smart summaries with action items
  • ❌ Host enables it; attendees see captions but cannot retrieve the artifact
  • ❌ Underlying LLM stack may include third-party providers (OpenAI, Anthropic, others) per Zoom's published Trust Center
  • ❌ Quality of summaries varies; literal transcript fidelity is secondary

3. Local Recording

Zoom's "Record on this Computer" option saves an MP4 to your Mac. No transcription is performed — just audio + video. You then need a separate tool to transcribe the recording.

  • ✅ Local file you own
  • ✅ Available on free plans
  • ❌ No transcript, just media
  • ❌ Host can disable local recording for attendees

4. Third-Party Meeting Bots (Otter for Zoom, etc.)

Bots that join your meeting as a participant and stream audio to a cloud service for transcription.

  • ✅ Cross-platform (works regardless of Zoom plan as long as the bot is admitted)
  • ❌ Adds an awkward "[Tool] is recording" participant — distracting and sometimes not allowed
  • ❌ Audio leaves your device by design
  • ❌ Subscription cost on top of Zoom

5. Local Mac Capture

A menu-bar app on your Mac uses ScreenCaptureKit to record both your microphone and system audio, then runs transcription locally on the Neural Engine. No bot, no cloud, no host dependency.

  • ✅ Works for any meeting you can hear, on any plan
  • ✅ Audio stays on the Mac
  • ✅ No participant footprint in the meeting
  • ✅ Free on the local-tool side
  • ❌ One-Mac-per-attendee (does not give you the host's clean per-participant streams)

Side-by-Side: What Each Method Actually Gives You

CapabilityCloud RecordingAI CompanionLocal RecordingThird-Party BotLocal Mac Capture
Free Zoom plan✅ (no transcript)
Works as attendeeSometimes
Audio leaves device
Speaker labelsBy identityBy identityNoneDiarizedDiarized
Real-time captionsVariesPartial drafts
Action items✅ (local LLM)
Counts against Zoom quotaPartial

Why Privacy Is Not Theoretical for Zoom

Zoom has had a higher-than-average number of public privacy incidents — the 2020 routing-through-China issue, the historic E2E encryption misstatement, the AI Companion data-use clarifications in 2023, and various enterprise-tenant configuration leaks. Many of those were addressed; some required Cisco-level enterprise contracts to fully resolve.

For sensitive meetings — clinician calls, legal depositions, journalist interviews, M&A discussions — keeping the audio off Zoom's infrastructure is a defensible architectural choice. It does not mean Zoom is untrustworthy; it means that for material covered by privilege or compliance, the cleanest answer is to never put it on a vendor's pipeline in the first place.

Setting Up Local Mac Capture for Zoom

A practical setup that works on macOS Sonoma and later:

  1. Install Hapi from the website. Grant Microphone and Screen Recording permissions.
  2. Join the Zoom meeting normally. No browser extension, no virtual audio device, no kernel hack.
  3. Hapi auto-detects that Zoom is active and begins capturing mic + system audio in 16 kHz mono.
  4. Talk freely. No bot in the participant list, no "this meeting is being recorded" UI for other attendees from Hapi's side. Note: Zoom shows its own recording indicator if the host enables cloud recording, independent of Hapi.
  5. End the meeting. Hapi runs Parakeet for transcription, ECAPA-based diarization for speaker labels, and a local LLM for the summary and action items.
  6. Review, edit, search, export. Output formats: TXT, Markdown, JSON, SRT, VTT.

Total elapsed time after a 60-minute meeting is typically under 5 minutes on Apple Silicon, with no network usage.

Bottom Line

Zoom's built-in transcription is fine when you control the host account and accept Zoom as a sub-processor. It falls apart the moment you join meetings as a guest, work across companies, or handle material covered by professional privilege. Local Mac capture is the only path that gives every attendee — host or guest — the same artifact, without sending audio anywhere.

Related