API for customizable speech recognition

Would you recommend this product?
1 Review5.0/5
Hey all, I’m the founder of AssemblyAI ( We're in the current batch of Y Combinator (S17) and are building an API for customizable speech recognition. Developers and companies use our API for things like transcribing phone calls and building voice powered smart devices. Unlike current speech recognition APIs, developers can customize our API to more accurately recognize an unlimited amount of industry specific words or phrases unique to what they're building without any training required. For example, you can recognize thousands of product or person names with our API. Or you can more accurately recognize commands/phrases common or custom to your use case. We've developed our own deep neural network speech recognition architecture, and aren't using any open source speech frameworks like Kaldi or Sphinx (just Tensorflow). Because of this, we're able to run things more affordably and pass those savings on to developers. I used to work on projects that had speech recognition requirements before starting AssemblyAI, and saw how limiting, expensive, and hard to work with traditional speech recognition services and APIs were. We want to help developers and companies easily build products with speech recognition. Would love feedback from the community on what we're building, and if you have any questions about deep learning or deep learning in production ask away!
@youvegotfox congrats on the launch. Are you guys working on a JS SDK for browser use? Would help you I think.
@moinism hey thanks! we are definitely working on a JS SDK that uses WebRTC ... hoping to have it out soon!
@youvegotfox Great going Dylan. Look forward to libraries in JS and PHP!
@youvegotfox hi! This seems really neat. Is this a potential competitor for specialized recognition services, such as medical niches? Could you for example, have one user be more specialized in radiology lingo while another user is more specialized for cardiology? Does it learn over time with edits from the user and such? I work on an app that uses a 3rd party speech recognition SDK in the medical field, and I'm always looking for other options.
@xcadaverx Exactly! You can have one user for radiology lingo and another for cardiology. And when you query the API with audio data, the transcripts will be customized for each user. Right now there is no user feedback loop, but we do QA that improves the system over time with more use. Would love to chat with you about your app! If you want to email I will look for your email and follow up with you directly about trying out our API!
Seems very interesting product. I'd love to test it. Already shared it with some friends :-)
@skaragiannis thanks! if you email I'll look for your mail and can get you setup with the API
@youvegotfox We built and might look into using your service. How are you different than VoiceBase and are you better at recognizing custom commands? What is or will be your pricing model? Interested to find out more
@automateiq Hey Chris! If you want to shoot an email to and mention this post, I can look for it and reply to you with some more info. We're very accurate at recognizing custom commands. You can actually restrict the API to just supporting the commands you need to be able to support.