Hapi

iPhone Speech to Text: Complete 2026 Guide (Dictation, Voice Memos, Apps)

Every way to do speech to text on iPhone — built-in dictation, Voice Memos transcription, third-party apps, and how to sync transcripts to Mac.

6 min read·Voice notes

iPhone speech to text has matured dramatically. Five years ago, dictation on iOS required an internet connection, struggled with accents, and was best for short messages. Today, iPhones with Apple Silicon (XS and later) run on-device speech models for many languages, the Voice Memos app transcribes recordings automatically in iOS 18, and third-party apps fill the gaps for long-form work.

This guide covers every way to do speech to text on iPhone in 2026.

Method 1: Built-in iOS Dictation

The fastest path. Works in any text field — Messages, Mail, Safari forms, Notes, third-party apps that use the system keyboard.

Setup

  1. Open Settings
  2. Tap General → Keyboard
  3. Toggle Enable Dictation on
  4. (Optional) Configure dictation languages further down the screen

How to use it

In any text field:

  1. Tap the microphone icon at the bottom-right of the keyboard (next to the spacebar)
  2. Speak naturally
  3. Tap the icon again to stop, or just stop speaking — dictation auto-pauses on silence

Voice commands while dictating

iOS dictation supports many of the same commands as macOS:

SayInserts
Period / comma / question mark. , ?
New line / new paragraphline break / paragraph break
All caps on / offUPPERCASE toggle
Smiley / frowny / winky🙂 🙁 😉
Open quote / close quote" "
Cap [word]Capitalizes the next word

What's new in iOS 17 and 18

  • On-device dictation for English, Spanish, French, German, Japanese, Mandarin, Cantonese, Korean, and more on supported devices (iPhone XS and later running iOS 16+).
  • Continuous dictation — no longer auto-stops at the same aggressive silence threshold. You can pause to think.
  • Mixed dictation + typing — you can type a word, dictate the next sentence, type a correction, and the keyboard handles the seam.

Limitations

  • No filler-word removal. "Um" and "uh" stay in the transcript.
  • No backtrack correction. "Not Tuesday — I mean Wednesday" lands as-typed.
  • Punctuation only via voice commands. No automatic prosody-based commas.
  • Per-language model. You can switch languages, but you can't auto-detect mid-sentence code-switching.

For everyday short notes and messages, these limitations don't matter. For long-form professional work, they do.

Method 2: Voice Memos with Auto-Transcription (iOS 18+)

Voice Memos finally got the transcription feature users had been requesting for a decade. In iOS 18 and later:

  1. Open Voice Memos
  2. Tap the + to record, or open an existing recording
  3. Tap the transcript icon (looks like a quote mark)
  4. The transcript appears beneath the waveform; you can search, copy, and share it

The transcription uses Apple's on-device speech models for supported languages and runs in the background after the recording ends. For a 30-minute recording on iPhone 15 or newer, transcription typically completes in 2-5 minutes.

This is the right tool for:

  • Capturing a meeting or interview where you can leave the phone on the table
  • Recording lectures and field notes for later review
  • Voice journaling

It is the wrong tool for:

  • Real-time dictation into another app (use Method 1)
  • Transcribing audio from a different source (use Method 3 or 4)

Method 3: Third-Party iPhone Speech to Text Apps

If built-in dictation is too limited, third-party apps fill specific gaps:

App typeBest for
Whisper-based local appsPrivacy-conscious users, offline operation, broader language support
Cloud transcription apps (Otter, Notta, Rev)Sharing transcripts across teams, deeper post-processing features
Voice journaling appsReflection, daily logs with sentiment over time
Meeting recorder appsCapturing in-person meetings with speaker diarization

The trade-off matrix is the same as on Mac: cloud apps are feature-rich but send your audio away; local apps protect privacy but need decent on-device hardware.

Method 4: iPhone-to-Mac Sync Workflow

For Mac users with an iPhone, the most powerful workflow combines both:

  1. Capture on iPhone during the day — quick voice notes between meetings, walking thoughts, gym ideas
  2. Sync via iCloud or a tool like Hapi that pairs iOS and macOS apps
  3. Review on Mac in the evening, where the larger screen and keyboard make editing fast

Hapi specifically takes this approach: an iOS companion app records to the same on-device pipeline as the Mac app, and transcripts sync through CloudKit so they appear in the Mac inbox automatically.

Privacy Considerations

iPhone speech to text has unusually clean privacy properties relative to other platforms — but only on-device:

PathAudio leaves device
Built-in dictation, on-device language
Built-in dictation, server-required language✅ (to Apple)
Voice Memos auto-transcription
Cloud transcription apps✅ (to vendor)
Local third-party apps (Whisper-based)

If you're recording sensitive content — medical, legal, journalistic, family — verifying which path your tool actually uses is worth a few minutes. The iOS Settings → Privacy & Security → Microphone screen shows recent microphone access; Settings → General → Keyboard → Dictation indicates whether your active language runs on-device.

Tips for Better iPhone Dictation Accuracy

  1. Hold the phone closer than you think. 6-12 inches from your mouth, not arm's length.
  2. Use a headset for long sessions. Built-in iPhone microphones are good but pick up wind, traffic, and HVAC noise.
  3. Speak in complete sentences. Punctuation comes from prosody on supported languages; choppy fragments degrade output.
  4. Wait a beat between paragraphs. A one-second pause produces a paragraph break reliably.
  5. Edit later, not in the moment. Stopping to fix typos breaks dictation flow worse than the typos themselves.

Bottom Line

iPhone speech to text in 2026 is finally a real tool, not just a typing-shortcut convenience. For most users, the built-in dictation handles 80% of needs. Voice Memos auto-transcription handles long-form recordings without any extra app. Third-party local apps fill the gap for users who want better formatting, broader language support, or seamless sync to Mac.

For Mac users, see also our complete speech-to-text on Mac guide and the iPhone dictation deep-dive.

Related