Voice Notes & Dictation7 min read·

How to Transcribe Podcasts on Mac (Free Local Methods)

Learn how to transcribe podcast episodes on Mac for free using local AI tools. Get accurate transcripts with speaker labels without uploading audio to the cloud.

transcribe-podcasts-macpodcast-transcriptiontranscriptionmacOSspeech-to-textcontent-creation

How to Transcribe Podcasts on Mac

Podcast transcription serves multiple purposes: accessibility, SEO, content repurposing, and show notes. On Mac, you can transcribe episodes locally without uploading audio to cloud services.

This guide covers the best methods for transcribing podcasts on Mac.

Why Transcribe Podcasts?

Accessibility

Transcripts make your podcast accessible to:

  • Deaf and hard-of-hearing audiences
  • Non-native speakers who prefer reading
  • People in sound-sensitive environments
  • Screen reader users

SEO Benefits

Search engines can't listen to audio, but they can index text:

  • Episode pages rank for spoken keywords
  • Long-form content improves domain authority
  • Transcripts provide internal linking opportunities

Content Repurposing

One transcript enables:

  • Blog post summaries
  • Social media quotes
  • Newsletter content
  • YouTube video captions
  • Audiogram clips

Show Notes

Detailed transcripts make better show notes:

  • Accurate timestamps
  • Complete quotes
  • Name spellings verified
  • Links mentioned

Method 1: Hapi — Best for Individual Episodes

Hapi provides simple drag-and-drop transcription for podcast files.

How to Transcribe a Podcast with Hapi

  1. Open Hapi (download from speakhapi.com)
  2. Drag your audio file (mp3, m4a, wav) into the Hapi window
  3. Wait for transcription — typically 2-5x faster than audio length
  4. Review and edit — fix names, technical terms
  5. Export — TXT, SRT, VTT, or Markdown

Features for Podcast Transcription

Speaker detection: Hapi identifies different voices and labels them (Speaker 1, Speaker 2). After transcription, replace labels with actual names.

Smart formatting: Automatic punctuation and paragraph breaks based on pauses — no need to add manually.

Multiple formats: Export as plain text for blog posts, SRT/VTT for video platforms, or Markdown for static site generators.

Local processing: Your unreleased episodes never leave your Mac.

Record voice notes with a hotkey.

Auto-paste anywhere.

Download Hapi — Free

Method 2: Whisper.cpp — Batch Processing

For transcribing multiple episodes or automating your workflow.

Setup

# Clone and build
git clone https://github.com/ggerganov/whisper.cpp
cd whisper.cpp
make

# Download model (large recommended for podcasts)
./models/download-ggml-model.sh large-v3

Transcribe a Single Episode

# Convert to WAV if needed (Whisper.cpp prefers WAV)
ffmpeg -i episode.mp3 -ar 16000 -ac 1 episode.wav

# Transcribe
./main -m models/ggml-large-v3.bin -f episode.wav -otxt

Batch Transcribe Multiple Episodes

#!/bin/bash
# Save as transcribe-episodes.sh

for file in *.mp3; do
    # Convert to WAV
    ffmpeg -i "$file" -ar 16000 -ac 1 "${file%.mp3}.wav"

    # Transcribe
    ./main -m models/ggml-large-v3.bin -f "${file%.mp3}.wav" -otxt

    # Clean up WAV
    rm "${file%.mp3}.wav"

    echo "Completed: $file"
done

Output Formats

# Plain text
./main -m models/ggml-large-v3.bin -f episode.wav -otxt

# SRT subtitles
./main -m models/ggml-large-v3.bin -f episode.wav -osrt

# VTT subtitles
./main -m models/ggml-large-v3.bin -f episode.wav -ovtt

# JSON with timestamps
./main -m models/ggml-large-v3.bin -f episode.wav -ojson

Limitations

  • No GUI — command line only
  • No speaker detection (single text block)
  • Requires some technical setup
  • Slower than Hapi on Apple Silicon (doesn't use Neural Engine)

Best for: Podcasters who want automated batch processing or CI/CD integration.

Method 3: MacWhisper — GUI Alternative

A paid Mac app wrapping Whisper for file transcription.

Features

  • Drag-and-drop interface
  • Multiple model options
  • Export formats
  • Batch processing

Pricing

  • Free tier: Limited features
  • Pro: $30 one-time
  • Pro+: $60 one-time

When to Choose MacWhisper

  • Want a GUI but don't need speaker detection
  • Primarily transcribe files (not real-time recording)
  • Willing to pay for a focused tool

Method 4: Cloud Services

When local processing doesn't meet your needs.

Descript

  • Full editing suite
  • Overdub for corrections
  • Video editing included
  • $15-30/month

Best for: Podcasters who also edit in the same tool.

Otter.ai

  • Good accuracy
  • Speaker labels
  • Searchable archive
  • $16.99/month

Best for: Teams who need collaboration features.

Rev

  • Human + AI options
  • High accuracy
  • Pay per minute
  • $0.25-1.50/minute

Best for: One-off transcription of important episodes.

Comparison: Podcast Transcription Options

MethodPriceSpeaker LabelsBatchProcessing
HapiFreeYesNoLocal
Whisper.cppFreeNoYesLocal
MacWhisper$30-60NoYesLocal
Descript$15-30/moYesYesCloud
Otter.ai$16.99/moYesYesCloud
Rev$0.25+/minYesYesCloud

Optimizing Transcription Quality

Recording Quality Matters

Transcription accuracy depends heavily on audio quality:

  • Use quality microphones
  • Record in quiet environments
  • Minimize background music
  • Avoid heavy compression in editing

Pre-Processing Tips

If accuracy is low, try:

Noise reduction: Use Audacity or your DAW to reduce background noise before transcription.

Leveling: Normalize audio so all speakers are similar volume.

Remove music: Intro/outro music can confuse transcription. Consider removing or transcribing those segments separately.

Model Selection

Whisper model sizes for podcasts:

ModelSizeSpeedBest For
base150MBFastestClear audio, single speaker
small500MBFastGood audio, 2-3 speakers
medium1.5GBModerateVariable audio quality
large-v33GBSlowestMaximum accuracy

Recommendation: Use large-v3 for podcasts. The speed difference is minimal compared to accuracy gains.

Post-Transcription Editing

Always review transcripts for:

  • Names: People, companies, products
  • Technical terms: Industry jargon, acronyms
  • Numbers: Dates, statistics, prices
  • Homophones: "their/there/they're" type errors

Workflow for Regular Podcast Transcription

Weekly Podcast Workflow

  1. Record episode (as normal)
  2. Export audio (mp3 or m4a)
  3. Transcribe with Hapi (drag-and-drop)
  4. Edit transcript (15-30 minutes for a 1-hour episode)
  5. Export (Markdown for blog, TXT for show notes)
  6. Publish (transcript on episode page)

Automation Ideas

For podcasters publishing frequently:

  • Create a Shortcut that opens Hapi with the latest export
  • Use Whisper.cpp in a script triggered by folder watch
  • Set up a GitHub Action to transcribe when audio is pushed

Common Questions

How accurate is local podcast transcription?

For clear podcast audio with good microphones, expect 95%+ accuracy. Multiple speakers with crosstalk, background music, or poor audio quality will reduce accuracy.

Can I transcribe video podcasts?

Yes. Extract the audio track first:

ffmpeg -i video.mp4 -vn -acodec pcm_s16le -ar 16000 -ac 1 audio.wav

Then transcribe the audio file.

How do I add speaker names instead of "Speaker 1"?

After transcription in Hapi, use find-and-replace:

  • Find: "Speaker 1:"
  • Replace: "Host:"

Or export to text and edit in your preferred editor.

What about transcribing old episodes?

The same methods work for old episodes. If you have archives going back years, use Whisper.cpp batch processing to transcribe entire back catalogs.

Should I transcribe live recordings?

For live podcast recordings (no post-production), transcription may have more errors due to:

  • Background audience noise
  • Cross-talk between speakers
  • Uneven audio levels

Consider cleaning the audio before transcription, or accept that live show transcripts need more editing.

Summary

Podcast transcription on Mac works best with local tools:

  • Individual episodes: Hapi — drag-and-drop, speaker labels, free
  • Batch processing: Whisper.cpp — automated, flexible, free
  • GUI preference: MacWhisper — simple interface, one-time payment

Local transcription means your unreleased episodes stay private, you're not limited by subscription minutes, and processing speed depends only on your Mac — not your internet connection.

Why Hapi?

  • 100% local — nothing sent to the cloud
  • 25+ languages with auto-detection
  • Meeting recording with speaker labels
  • Free — no subscription

Transcribe anything on your Mac.

100% local. No cloud. No subscription.

Download Hapi — Free

Related Posts