Aaron O'Leary

Meet the winners of the 2025 Orbit Awards for AI Dictation Apps

by

What AI Dictation Tools Actually Are

AI voice input tools let you press a key, speak naturally, and instantly paste clean, structured text into any app. Most of them use a push-to-talk model (usually the Fn key) so you can trigger voice input anywhere without switching windows, opening a modal, or copying and pasting.

They’ve quietly become a new input layer for the computer: faster than typing, more natural than shortcuts, and flexible enough to drive workflows across writing, coding, email, and AI assistants.

A (Short) History of This New Wave

Voice tools have existed for decades. Dragon, Siri, Google Voice Typing, but they were slow, brittle, and rarely good enough to rely on. The current generation is a leap forward because of three things:

  1. Speed: modern Whisper-based models + custom inference engines mean near-instant text.

  2. Accuracy: context-aware LLMs clean and format speech automatically.

  3. Usability: configurable shortcuts, cross-app support, tone shaping, auto-edits, and integrated workflows.

The shift is big enough that “dictation” no longer describes what these tools do. They’re becoming a primary input method, not just an accessibility feature.

TL;DR — You Should be Using AI Voice Input

These tools are now:

  • faster than most people’s typing speed

  • more accurate in real prose than typing fast

  • dramatically better for thinking out loud

  • deeply integrated with personal and work AI tools

  • available everywhere (desktop, mobile, and now keyboards)

For most people — developers, writers, PMs, operators — voice is already better than typing for a large portion of daily work.

Snapshot: Pricing & Platform:

Funding (publicly disclosed where available):

  • Wispr: $81M+ total, including Series A and additional round in 2025

  • Aqua Voice: YC-backed; at least $500K+ publicly noted

  • Willow Voice: YC-backed; thus also at least $500k raised

  • Others: no public funding data; best understood as indie / bootstrapped / OSS.

TL;DR: Your Award Winners

See the full list of AI dictation apps here.

WisprFlow — The People’s Champ 🏆

TL;DR

  • Fastest cross-platform voice input in the category

  • Strongest review volume, depth, and year-long momentum

  • Built to make voice a primary input method, not a side feature

  • Now includes an iOS keyboard, giving full voice input anywhere on your phone

Founder Story

@Wispr Flow didn’t start as a dictation tool. it started as a neural interface company. The founder has been chasing “build Jarvis” since watching Iron Man at age 10. That obsession turned into early brain–computer-interface experiments, then a reality check: the world wasn’t ready for neural hardware, but it was ready for frictionless voice.

So WisprFlow pivoted into software, keeping the same ambition:
Voice should replace the keyboard. Not complement it. Replace it. That worldview shows up everywhere in the product: cross-platform support, real-time auto-edits, tone learning, and a push-to-talk workflow that feels more like thinking than typing.


Pricing & basics

  • Free tier with weekly word limits

  • Pro plan: $12/month (annual billing)

  • Runs on Mac, Windows, iOS, and via iOS keyboard for system-wide mobile input

  • Cloud inference tuned for ultra-low latency and high consistency

Review Signal
WisprFlow has the largest and deepest review footprint of the entire category. Several patterns stand out:

  1. Review spikes directly correlate to major releases

  • Windows launch → reviews from engineers and PMs adopting it at work

  • iOS app + iOS keyboard launch → big expansion into mobile workflows

  • Model upgrades + new features → consistent mid-cycle bumps tied to shipping, not hype

Dominant phrases show a very clear identity

Across reviews, the most common phrases are:

  • “natural”

  • “effortless”

  • “flow”

  • “no edits”

  • “finally fast enough”

  • “writes like I do”

Users describe WisprFlow in emotional terms — more like a tool they think with than a tool they type into.

Usage patterns are broad and high-leverage

WisprFlow shows up in workflows like:

  • PRDs, product specs, technical docs

  • Long Slack messages and email bursts

  • Continuous prompting inside Cursor / Claude / ChatGPT

  • Quick note capture between appsVoice-based coding sessions (e.g. Replit)

This is the widest usage range of any tool in the category.

Founder reviews: the strongest count in the field

More founders and builders have reviewed WisprFlow than any other voice input product — often calling it part of their daily workflows.

Weakness Signals

The small set of consistent critiques:

  • Users want more offline support

  • Advanced users want programmable voice commands

  • Subscription cost gives pause to casual or low-volume users

Why you’d pick WisprFlow over everything else

  • You want one voice layer across Mac, Windows, and your phone

  • You care most about speed, flow, and writing-style adaptation

  • You use AI tools daily and want voice as a front-end to them

  • You want the iOS keyboard for system-wide mobile input

  • You want a tool backed by serious funding and the clearest long-term execution on the “voice OS” vision

Aqua Voice — Readability Award 📖

TL;DR

  • Cleanest transcripts in the category

  • Designed by someone who depended on dictation for school and work

  • Best for long-form dictation that should look like writing, not raw speech]

Founder Story

@Aqua Voice 's founder is dyslexic and has been using dictation tools since sixth grade. He spent over a decade relying on software that never quite worked — always clunky, always messy. Aqua began as a voice-native editor, not a typical dictation box, which is why its output feels edited as you speak.

His belief: dictation shouldn’t look like a transcript of you talking; it should look like something you meant to write.

Pricing & Basics

  • Paid Mac app, with a trial

  • Mac only

  • Cloud-backed inference

Review Signals

The Aqua Voice review cluster is small but extremely sharp:

  • Near-perfect ratings

  • Review spikes lining up with key launches and upgrades in 2024–2025

  • Language users don’t usually use about dictation: “clean,” “readable,” “pleasant,” “ready to send”

Why you'd pick Aqua Voice over everything else

  • You dictate paragraphs, pages, or documents, not quick one-liners

  • You want the least editing afterward

  • You care more about how the text reads than about automation, commands, or system integration

  • You’re writing content others will see (articles, newsletters, briefs, scripts)

Superwhisper — Privacy Award 🔐

TL;DR

  • Fully local Whisper-based dictation on Mac

  • Strong pull from clinicians and other privacy-sensitive professionals

  • “Use it dozens of times a day” behavior, not just as-needed

Founder Story

@superwhisper started with one emotion: rage at macOS dictation. The founder built a client-side Whisper wrapper purely to fix his own workflow, then realized it had become the thing he used all day.

Worldview: AI dictation should be local, fast, boringly reliable. Not a funnel into someone else’s cloud.

Pricing & Basics

  • Free version plus Pro upgrade

  • Mac-only

  • On-device Whisper for all dictation

Review Signal

Patterns in reviews show:

  • Spikes around major releases (new models, performance upgrades)

  • Heavy use in reports, medical notes, documentation, and email

  • Users emphasize “no cloud,” “fast enough to use constantly” “a high degree of privacy for sensitive roles”

  • This is one of the few tools where usage looks like infrastructure rather than a toy.

Why you'd pick Superwhisper over everything else

  • You cannot put your audio in the cloud (healthcare, legal, sensitive corporate work)

  • You want a simple, local dictation engine with good accuracy

  • You send a lot of structured professional text (reports, notes, internal emails)

  • You don’t care about style learning or fancy rewriting, just solid text

Willow Voice — Everyday Communication Award 🗣️

TL;DR

  • Best fit for people who live in email, Slack, and DMs

  • Auto-formats and tones text so it’s basically ready to send

  • iOS keyboard for dictation anywhere on your phone

Founder Story

@Willow Voice's team came from building tech for assisted-living facilities, where older adults leaned heavily on dictation. They watched people try to use Apple’s built-in tools and hit accuracy, setup, and reliability walls.

Their belief: voice tools should be as simple and obvious as the keyboard, and ideally more forgiving. That’s why Willow’s default mode is: press a key, talk, get a complete email or reply back — with tone, punctuation, and structure already in place.

Pricing & Basics

  • Free tier + subscription

  • Mac app + iOS app

  • iOS keyboard for dictation into any app

  • Cloud inference, optimised for low latency

Review Signal

Willow’s review bumps line up with:

  • Mac adoption from people wanting something nicer than system dictation

  • A clear November spike when the iOS keyboard launched

People talk less about “accuracy” and more about:

  • “clean emails”

  • “no setup”

  • “it just works”

Why you'd pick Willow Voice over everything else

  • You spend most of your day in email, Slack, iMessage, support tools

  • You want dictation that formats and tones your message already

  • You need mobile dictation that’s actually integrated (keyboard, not just an app)

  • You value accessibility and smooth UX over tweaking knobs and workflows

Ito — Open Source Award 🧑‍💻

TL;DR

  • The only open-source voice tool in this orbit

  • Focuses on intent, not literal word-for-word transcription

  • Best for developers and power users who want to own and shape their assistant

Founder Story

@Ito founders looked at the space and saw two things they hated:

  1. Closed assistants (Siri, Alexa, etc.) you can’t inspect

  2. Dictation tools that only transcribe but don’t help

So they built Ito, a voice assistant that is:

  • Open source

  • Auditable

  • Forkable

  • Built around “VibeTyping”: you say what you mean, it writes what you should send

Their worldview: a serious assistant must be hackable and honest.

Pricing & Basics

  • Free, with a pro plan

  • Open source (self-hostable, forkable)

  • Mac, Windows, iOS

  • Local + optional cloud

Review Signal

Ito’s reviews are dense and technical:

  • Clustered around launch and follow-ups

  • Frequently mention code, PRDs, and structured writing

  • Highlight that it “writes what I meant, not what I said”

It’s not a mass-market dictation toy; it’s a builder’s tool.

Why you'd pick Ito over everything else

  • You’re a developer or technical user who wants to tune your own voice workflows

  • You want to keep control over data, models, and behavior

  • You want a tool that can write code, docs, and emails from fairly rough prompts

  • You believe your assistant should be inspectable, not a black box

Alter — System-Level Intelligence Award 🤖

TL;DR

  • Not just dictation: a Mac-wide AI layer living in the notch

  • Sees windows, apps, files, and can act on them

  • Best for people drowning in tabs and tools

Founder Story

@Alter 's founder started from a different pain: cognitive overload. Too many apps, too many tabs, too many micro-tasks. The idea wasn’t “better dictation.” It was: what if the Mac had a native brain?

Hence Alter: a notch-resident assistant that:

  • Watches your active app context

  • Can act across tools

  • Picks models on your behalf

  • Pulls in files, screenshots, browser context directly

Their belief: AI should be part of the operating system, not another window.

Pricing & Basics

  • Free tier + paid

  • Mac-only

  • Uses a mix of cloud + local / BYO models

Review Signal

Reviews cluster around:

  • “feels like part of macOS”

  • “reduces context-switching”

  • “meeting notes and actions in one place”

  • Rather than praising transcription, people praise orchestration.

Why you'd pick Alter over everything else

  • You’re a PM, founder, operator, or IC juggling many tools

  • You want an assistant that can see what you see and act on it

  • You care more about automation and context than raw dictation

  • You expect AI to be integrated with your OS, not bolted on

MacWhisper — Transcription Purist Award 🎙️

TL'DR

  • Straightforward local Whisper transcription

  • One-time purchase, minimal UI

  • Best for people with lots of recorded audio

Founder Story

@MacWhisper is classic indie DNA: one developer, a clear need (local Whisper on Mac with a usable UI), and a bias for shipping something simple that works.

Worldview: not everything needs to become an assistant. Sometimes you just need audio → text, reliably.

Pricing & Basics

  • One-off purchase

  • Mac-only

  • Fully local audio transcription

Review Signal

Reviews consistently mention:

  • “reliable,”

  • “offline,”

  • “simple to use,”

  • “exactly what I needed for recordings.”

No one is asking it to be more than it is — which is the point.

Why you’d pick MacWhisper over everything else

  • You record interviews, podcasts, talks, or calls

  • You just want local Whisper with a UI

  • You don’t need live dictation, formatting, or workflow logic

How These Tools Diverge

If you strip away branding and look at behavior and reviews, you end up with a very simple map:

  • WisprFlow — I want voice to replace typing, everywhere

  • Aqua Voice — I want my spoken thoughts to look like polished writing

  • Superwhisper — I need offline, local dictation I can trust professionally

  • Willow Voice — I want to respond to people faster, with clean emails and messages

  • ITO — I want to build and own my own voice assistant

  • Alter — I want AI integrated into my whole Mac, not just my text box

  • MacWhisper — I just need solid local transcription for recordings

Where This Category Is Going Next

Several trends were obvious across launches, reviews, and founder comments:

Hardware is coming back

WisprFlow literally started as a hardware company. As models get faster and smaller, the next wave will likely include:

  • dedicated voice input hardware

  • earbuds with low-latency inference

  • hybrid on-device chips

  • desktop microphones built for AI input

Speed and Accuracy still have room to grow

Real-time correction, faster inference, and smarter reformatting are already being shipped monthly.

Moving from ‘paste text’ to ‘command your computer’

Voice input today mostly outputs text. The next step is action:

  • navigate apps

  • trigger workflows

  • write code with context

  • call APIs

  • perform tasks across tools

@Wispr Flow already hints at this: you can vibe-code in Replit using native voice input, not as a gimmick, but as a real workflow.

The native OS players are waking up

Apple, Google, OpenAI, all shipping stronger voice systems. Voice input apps will face more competition from:

  • ChatGPT’s real-time voice mode

  • native dictation improvements

  • system-level hooks

But: Those improvements may also pull voice input deeper into apps, creating opportunities for third-party tools to power native voice experiences.

Voice Agents are becoming Infrastructure

There’s a growing ecosystem underneath all of this:

This is the voice agent infra layer, a category we’ll cover in later Orbit Awards.

20K views

Add a comment

Replies

Best
Curie S.

Very informative review, thanks @aaronoleary !
I've been testing @superwhisper and @Wispr Flow and agree with your points. WisprFlow has a smooth sync between iOS and Mac, but haven't enjoyed the SuperWhisper mobile experience.