Dictation on the Mac finally got good. Apple ships strong on-device speech recognition, cloud engines are fast and shockingly accurate, and the gap between “talking” and “typed text in the right app” has shrunk to almost nothing. The hard part isn’t finding an app — it’s knowing what to weigh. Here’s how to choose.
01What actually matters
Most “best voice-to-text” lists rank apps by feature count. In practice, five things decide whether you’ll still be dictating a month later:
- Privacy. Does your audio stay on the Mac, or get streamed to a server? For notes, medical, legal, or anything under NDA, this is the whole ballgame.
- Accuracy. Word-error rate on your voice, accent, and vocabulary — not a vendor benchmark. Punctuation and proper nouns matter more than raw WER.
- Latency. How quickly text appears. Sub-second feels like magic; two seconds feels like fighting the tool.
- Languages. Coverage for the languages you actually speak, and whether you can mix them.
- Workflow fit. Does it insert text into any app, behind a global hotkey, without copy-paste gymnastics?
02The two families of dictation
Every option on macOS falls into one of two camps, and the trade-off between them is the single most important decision you’ll make.
On-device (local) recognition
The model runs on your Mac. Audio never leaves the machine, it works on a plane, and there are no per-minute fees. Apple’s built-in Speech framework — including the upgraded on-device models in macOS 26 — is the obvious example, and it has gotten genuinely good. The trade-off is that very large cloud models can still edge it out on messy audio or niche vocabulary.
Cloud recognition
Audio is streamed to a provider like Deepgram, OpenAI, or ElevenLabs and transcribed by a large model. You typically get top-tier accuracy and broad language support, at the cost of sending your voice off-device and paying per minute (often via your own API key). For a deeper look, see on-device vs cloud dictation.
03The main options, honestly
Apple Dictation (built in)
Free, on-device, and already on your Mac. Great for quick, private dictation in any text field. Where it falls short: limited control over models per language, no real menu-bar workflow tuned for power use, and accuracy that trails the best cloud engines on hard audio.
Cloud-only dictation apps
Apps built around a single cloud API can be extremely accurate and feature-rich. The catch is structural: your audio leaves the device by design, you’re often on a subscription whether you dictate once or a thousand times, and you’re locked to one engine’s strengths and weaknesses.
VTT
VTT is the approach we built because we wanted both columns of the table at once: on-device and private by default, with the option to reach for Deepgram, OpenAI, or ElevenLabs using your own key when you want maximum accuracy. It’s a native Swift/AppKit menu-bar app with a global hotkey and auto-insert, and you can route each language to the engine that handles it best. No account, and it’s free to start. (Yes, we’re biased — but the design exists precisely to dodge the on-device-vs-cloud compromise.)
Try private dictation on your Mac
VTT transcribes on-device by default, with optional cloud engines via your own API key. Free, no account, macOS 14+.
Download VTT04A quick way to choose
- Privacy is non-negotiable? Start with on-device. Add a cloud engine only for specific hard cases.
- You dictate in several languages? Look for per-language engine selection rather than one global model.
- You dictate all day? Avoid flat subscriptions if you can use your own pay-as-you-go key — and prefer on-device for the bulk to keep costs near zero.
- You just want it to work everywhere? Prioritize a global hotkey and reliable auto-insert over flashy extras.
05The bottom line
In 2026 the best Mac dictation setup isn’t a single app — it’s a sensible default (on-device, private) with an escape hatch (a cloud engine you control). Pick a tool that lets you have both, put it behind one hotkey, and you’ll stop typing things you could have just said.