Hapi

Hebrew Voice to Text on Mac: Modern Hebrew, RTL, and Niqqud Policy

How to transcribe Hebrew audio on macOS — handling unvocalized output, modern vs liturgical Hebrew, RTL rendering, and Hebrew-English code-switching, fully on-device.

5 min read·Voice notes

Modern Hebrew is the working language of roughly 9 million speakers, mostly concentrated in Israel and the Jewish diaspora. Three structural features make Hebrew interesting for speech recognition: a writing system that normally omits vowels, a meaningful difference between modern and liturgical varieties, and right-to-left rendering that interacts with embedded English, numbers, and brand names.

This guide walks through how Hebrew speech recognition behaves on Mac in 2026 and how to set up an on-device flow that handles modern Hebrew, RTL output, and code-switching cleanly.

What Makes Hebrew Specific for Speech Models

1. Unvocalized writing (no niqqud)

Modern Hebrew is normally written without niqqud — the vowel marks that disambiguate pronunciation. Native readers reconstruct vowels from context. Speech models follow the same convention: the output is unvocalized, looking like the Hebrew you read in Haaretz, in WhatsApp, in tech blogs.

You only need vocalized output for specific use cases:

  • Religious texts (Tanakh, prayer books, Talmudic study materials)
  • Hebrew-as-a-second-language learning materials
  • Hebrew poetry where stress and pronunciation are the point
  • Text-to-speech inputs that need pronunciation hints

For all of these, a separate text-to-text diacritization model is the right tool — running locally, applied after transcription, no audio re-upload.

2. Modern Hebrew vs. liturgical Hebrew

Hebrew has been continuously written for ~3,000 years with distinct historical layers. Modern Hebrew (Ivrit) is the spoken language of contemporary Israel; Biblical, Mishnaic, and Rabbinic Hebrew are layers used in religious and scholarly contexts. Most training corpora are heavily skewed to modern Israeli media, so:

VarietyRealistic accuracy on Mac models
Modern Israeli Hebrew (news, conversation)Strong
Modern Hebrew with Mizrahi accent featuresGood
Liturgical chanting / cantillationWeaker
Biblical / Mishnaic with traditional pronunciationWeaker

For day-to-day Mac users, this is invisible — the model handles modern Hebrew well. For yeshiva-style transcription work, it is a real limitation.

3. Right-to-left rendering

Hebrew is written right-to-left, but numbers, dates, English words, brand names, code, and URLs run left-to-right inside Hebrew text. macOS handles this via Unicode bidirectional algorithm rules. The dictated output drops into native macOS apps and renders correctly without any manual configuration:

  • ✅ Pages, Notes, Mail, Safari, modern web editors
  • ⚠️ Older Cocoa apps and certain terminal contexts can still split bidirectional text awkwardly — that is a rendering issue in those apps, not a transcription issue

How Local Hebrew Dictation Works on Apple Silicon

Two on-device approaches dominate on Mac in 2026:

  1. Apple's built-in dictation. Hebrew is supported on Apple Silicon Macs, with on-device processing for offline-supported configurations.
  2. Third-party local apps. Hapi runs Parakeet (multilingual) and WhisperKit-derived models on the Neural Engine, with menu-bar capture and paste-at-cursor.
DimensionApple DictationHapi (local)
ActivationFn-key shortcut, requires text fieldGlobal hotkey, system-wide
Auto-paste anywhereNoYes
Filler-word cleanup ("יעני", "אמ")NoYes (heuristic)
HE/EN code-switchingManual language toggleAutomatic per segment
Punctuation styleStandardHebrew-aware
CostFree, built-inFree, separate install

A Realistic Hebrew Dictation Workflow

For day-to-day Mac use:

  1. Press the hotkey wherever your cursor is. The model runs locally and the result auto-pastes at the cursor.
  2. Speak naturally in modern Hebrew. No need to over-articulate or "switch to standard register" — the model handles your normal speech better than forced citation forms.
  3. Use natural pauses for punctuation. A half-second pause is enough for a comma; longer pauses produce paragraph breaks.
  4. Review the output for proper-noun edge cases. Personal names from less-common diaspora communities sometimes need a manual edit.

Common Failure Modes and Recovery

  • Final-form letters at the end of words. Hebrew uses different letter shapes at the end of words (ך, ם, ן, ף, ץ). Modern models get this right >99% of the time but very fast speech can occasionally trigger errors near word boundaries.
  • English brand names transliterated into Hebrew letters. "אפל" instead of "Apple." Multilingual models avoid this; older monolingual Hebrew models do not.
  • Hebrew numbers vs. digits. "חמש עשרה" sometimes lands as words rather than the digit "15". Most tools have a configurable normalization rule.
  • Proper nouns with non-standard spelling. Family names with German, Yiddish, or Arabic origins occasionally need manual correction.

Bottom Line

On-device Hebrew transcription on a modern Mac is good enough for daily professional use across modern Hebrew, bilingual Hebrew-English workflows, and most everyday content. For sensitive Hebrew-language content — therapy notes, family conversations, journalism — local-first is also the architecturally honest choice, since the audio stays on the Mac instead of routing through a foreign-cloud sub-processor chain.

Related