Cory LaViska

TongueType for macOS - Local dictation for macOS without the subscription

TongueType is a macOS voice dictation app powered by Whisper AI running locally on Apple Silicon. No cloud, no accounts, no subscriptions. Hold a key, speak, release. Your words appear. Supports 12 languages and audio file transcription. TongueType gives you a configurable press-to-talk hotkey, audio and video file transcription, and configurable post-processing rules. It's customizable and fun (try Rainbow Mode!) and it's built to be the fastest dictation workflow possible.

Add a comment

Replies

Best
Cory LaViska
I'm Cory. I've been doing web development for a long time, but TongueType is my first macOS app. In the age of AI, many developers find themselves typing words more than code. I type fast, but my fingers have become a bottleneck for many tasks. I wanted a solid dictation app that runs locally, responds instantly, and has the accuracy of a world-class AI model. TongueType sits in your macOS menu bar and starts listening as soon as you press a hotkey. It transcribes your words and pastes them into whatever app you're currently in. Press. Talk. Words appear. macOS has had built-in dictation for a long time, but in my experience it's just not very good. I was surprised to see so many third-party dictation apps tied to a subscription and a bunch of cloud features I didn't want. While dictation itself isn't unique, TongueType focuses on being seamless: it feels tightly integrated with the OS because it gets out of the way and just works. Under the hood, TongueType uses OpenAI's Whisper small model compiled to Core ML so it runs directly on Apple Silicon. Some additional info that sets TongueType apart from other apps: 💸 No subscription. Free to try. Pro is a one-time $19.99 for up to 5 Macs. Buy it once, keep it forever. 🔒 100% private. Local-only. Zero telemetry. No account. Nothing logged, nothing uploaded. 🎛️ Yours to shape. Custom post-processing rules, spoken symbols, cancel phrases ("scratch that"), 12 languages with auto-detect. 🌈 A little personality. 20 accent colors including Rainbow Mode. None of it was necessary. All of it was fun. (Turns out that's what makes an app feel like it's yours.) I use TongueType constantly for prompting LLMs, writing emails, sending DMs, commenting code, typing commit messages...basically anywhere the thinking is already done and all that's left is getting the words out (which turns out to be a surprising amount of my day). It's been surprising to discover that many people don't seem to like dictation apps. I'm not sure if that's because they haven't worked very well in the past or if it's just a hard habit to get into. (Admittedly, my kids helped me form the habit. They'd see my typing and rightfully ask "why aren't you using TongueType?!") I'm genuinely curious to learn: what's the one thing that's kept you from sticking with a dictation app?
Keith Taylor

local-only and no subscription, rare combo! how do you handle the punctuation problem. say "comma" and it adds punctuation? how would "period" be decided as word or punctuation? congrats on your launch!

Cory LaViska

@hiyamojo I've gotten used to not using it, to be honest. But you can go to Settings > Postprocessing and experiment with replacements there! A friend of mine added, e.g. "smiley face emoji"' => 😊 and similar.

Jim Jeffers

For me, the thing that breaks the dictation habit is the cleanup tax after the words appear.

Dictation is great for raw thought, but different surfaces need different levels of cleanup: an LLM prompt can stay loose, a commit message needs precision, and an email needs just enough polish without losing the spoken cadence. The post-processing rules feel like the right place to solve that.

One thing I’d want is per-context presets plus a quick raw transcript / cleaned text comparison. That would make it easier to trust the tool because you can see whether it’s preserving the thought or silently over-smoothing it.

Cory LaViska

@jim_jeffers Thanks for the feedback! What would per-context presets look like and how would they operate?

Jim Jeffers

I’d think of them as small rewrite policies tied to the destination app, with a default fallback.

For example: Messages/Slack = keep it casual, trim false starts, don’t over-punctuate. Email = clean grammar and paragraph breaks, but keep my wording. Git commits = short imperative, no hedging. LLM prompt = preserve messy detail, only fix transcription errors.

Operationally, the lightweight version could be app-based rules plus a quick chooser/hotkey when the app guess is wrong. The raw/cleaned diff matters because if the preset silently makes me sound too polished, I’ll stop trusting it.

Cory LaViska

@jim_jeffers I'll think about this. I wonder if it can be done with a modifier key or spoken word, but that's a neat idea!

Jim Jeffers

@claviska If you want a tiny gut-check, the risky case is: raw dictation → cleaned output where the thought is technically clearer but no longer sounds spoken by me.

Even one anonymized raw/cleaned pair from your own usage would reveal whether the preset should be “remove friction” or “rewrite into prose.”

Patrick Peterson

This needs post AI rewrite in order for me to use it. Like what Typeless.com is doing.

Cory LaViska

@patrickpetcejj I can look into this, but I worry that a local pass on an LLM would slow things down. Are you mostly looking to remove things like "um" and cleaning up false starts?

Patrick Peterson

@claviska if I speak something, I don't have to worry exactly what I'm saying. I can just ramble, and it will get rid of the ums and everything.

Check out how Typeless does it. If you could make it work how they do it, you would do so well with your product.

Roop Reddy

Congrats. Where do I see pro and free features list?

Cory LaViska

Thanks! @roopreddy free includes every feature, but capped at 30 minutes of dictation per month and 10 seconds for file transcriptions. There's a Free vs. Pro page with more details.