Deepgram Brain

Drag and drop speech transcript editor that learns from you

Deepgram Brain is an AI transcript editor that makes it easy to drag and drop recorded files to edit. Transcription happens automatically and access to the editor is totally free. Use it to transcribe meetings, lectures, podcasts, do closed captions or just get free Speech-To-Text. API and Python SDK available for developers.

Would you recommend this product?
5 Reviews5.0/5
It’s great to be on Product Hunt! The Deepgram team has been hard at work bringing accurate AI-based, fast transcription to the world. And we're releasing it for free! The transcription editor in Deepgram Brain let's you: — drag & drop to upload audio and videos — we process the audio in seconds — then you can search by sound inside the file — get an automatically generated AI transcript — easily edit with synced audio to make it perfect — works best in Chrome! We hope this takes the sting out of transcribing (oh man, it's one of those things that sucks). Brain learns from the files you upload and edit, and gets better with time. There's an API and Python SDK that lets you programmatically programmatically understand your recordings. Deepgram Brain is tuned to analyze phone calls and will work on any audio you upload. We would love feedback on the user interface! It's difficult to get 'interacting with audio' to feel right. It's a next gen problem that we're tackling head on. There's keyboard shortcuts and two types of cursors: a Text Cursor and an Audio Cursor (the Deepgram Dot, kinda like karaoke). Transcribing is also a great way to learn new things. Upload a lecture and transcribe it. Then post add it to Youtube subtitles for the lectures. Others will thanks you and you'll have have better recall later :). Glad to answer questions about speech and brain!
Looking forward to testing your platform for audio and video transcriptions and accessing captions files - in the meantime I have added you to my curations.
@krishnade Glad to have you trying it out! It's available now BTWs :)
@stephensonsco Thanks - yes I was on your email notification list!
I got a 404 on the main PH link (, and then if I try signing up through I get a never ending circle... Actually, worked out it was because I put "Don't call me, I live in Australia and you'd probably just dial at weird hours" in the phone number field... validation bug? But yeah, I'd say phone number field would deter people from signing up. LOVE the idea of this though. 3 months ago there was almost none of these, and now there's been like 3-4 posted, but most of the cost per word on them has been a bit high for my liking (particularly given they were using free APIs). * Uploading files UI: I almost didn't see the bar at the bottom * I assume servers can transcribe faster than I can listen, so I like that this does it all on the back-end without me having to listen to an hour of audio (some of them make me do this) * That said, in the files page, If something is processing, it'd be nice to see a bar to indicate how much of the processing was done, and an expected time of completion - even if it's just a guestimate. * AI thing: Can you give me more info on your process? A couple of the other transcription services combine multiple Transcription AI's (Watson, Google Voice, etc.) to create a single "best guess" transcription. What's your process? * Transcribing thing: Can you pick up multiple speakers? I saw the IBM Watson demo will do that (and that's really useful for me) Okay, so while typing this, the 1hr / 15Mb MP3 conversation I uploaded has transcribed........ and I honestly don't know what to say. About 2/3 of the conversation is missing from the transcription. It's not studio quality audio, but I record on a Blue Yeti Pro and the other speaker was fairly understandable... even if it couldn't understand either of us in parts, I'd expect wrong words not big gaps.
@rossdcurrie I'm sorry about the 404 errors! I think people were getting them for a few hours :( We were switching over subdomains for the API and ended up clobbering the UI, oops. I hope everything is fixed up for ya now! * Phone number ask: we took it out. * Uploading files: We'll incorporate the feedback! * Server transcription time: Yep, the servers transcribe in less than a minute usually * Processing indicator: This is on the roadmap :). * AI for Deepgram: We are a full blown AI company so we train our models from scratch and everything is processed by Deepgram. Our system gets better when you've transcribed something since we'll use it to train our model on your voice and words, and the model will get better for you. This takes a bit of time (days), but over time snowballs into much better transcripts. * Multiple Speakers: This is on our roadmap! Definitely try out the IBM demo on several files. You'll probably see low quality on both transcription and speaker segmentation (that's what we see in out testing, but you don't have to take our word for it :D ). Sorry about the problem with the transcript. I'll get back with a fix asap.
Do you plan on exposing alignment data via the API?
@jeremyfranz Yep we do! We're working on revamping the algorithm before we add it to the standard API. What are you using alignment for?
@stephensonsco We combine alignment with acoustic analytics and NLP data from transcripts for our market research product. Having everything stitched together allows us to do some fun stuff on the reporting / playlist side of things.
@jeremyfranz Sounds awesome. We'd love to talk more.
I explored a few of these services to get transcripts for, and Deepgram is by far the most pleasant REST API to work with - glad to see they are expanding their offerings!
@garysieling Thanks Gary! Glad to see you still love DG! Our service is a lot faster and more accurate now (and will keep getting better!).