Speech analysis is something I've been looking at recently for a fintech company but from testing the standard API transcription services (those from IBM, AT&T, etc.) the quality hasn't been great.
Is exposing raw transcript something you're looking at, even if it's a probabilistic transcript ? - search is one use case but we've also got other use cases for which we'd want raw data (predicting conversion, risk, fraud, etc.)
@imranghory We definitely can expose the transcript (there's a DeepGram API call for that), but the error rate in the transcript is highly dependent on the input audio quality (better quality audio has better transcripts, phone calls in noisy cars don't). However, our analysis techniques don't rely solely on the transcript being perfect, which is a feature that really sets us apart—especially for medium to poor quality audio.
On prediction: We've built AI prediction layers to do what you are mentioning (predicting outcomes) but we don't rely on the text in the transcript being perfect, we build it on top of our fuzzy key phrase search. Contact us if you need that sort of thing!
Report
@stephensonsco our current dataset is recorded phone calls in mp3 so the greatest, most models that are trained on phone data (i.e. dealing with narrowband) fail on our data due to the lossy compression on mp3. We're looking at switching to uncompressed call recording though.
What's the pricing structure for search and the API ?
@imranghory@stephensonsco We'd be happy to talk about our pricing and set up a demo for your call audio! Shoot us a quick message at deepgram.com/contact and we'll be in touch.
Report
Consider these new Use Cases:
1) Associated multiple language sound bytes!..
For people to use in foreign countries as a handheld translator!
2) Voice mail generator.
Thanks,
Jaswinder Brar.
@jay_bee12345 We're also very interested in multiple languages (right now it's only English)
As for 2), you should check out mashily.com (our mashup generator) if you want to use the search to make a great voicemail sound clip. Reply with the URL if you make a mashup!
@nj_raju I know right?! This is something we want to add soon. Getting it working on different platforms while not frustrating users is a challenge, though.
@iamhabitat Hey Ben! Thanks for the feedback. Are you searching on our homepage? That demo is only for searching within that single creative commons video about the Hoover Dam—we don't yet provide search that is akin to Google for Audio/Video (but we certainly work on it). You can search through other files files (YouTube videos, your own recorded memos, things like that) by creating an account and uploading them to the DeepGram console. Let us know if you have more feedback (also, we have a slack channel—https://www.hamsterpad.com/chat/...!)
Report
I love the concept - eager to use it! I tried to index a YouTube video and got a Failed status. Any ideas why this could happen?
Someone needs to go redesign the site. The search doesn't seem to work. Just keeps playing the same video. I think it skips to parts of the video where the search keyword can be found? No idea.
I searched "apple" and nothing happened.
@topcities Sorry that it's annoying. There is only a single video to search through as a demo and 'apple' isn't mentioned in the video. You can upload videos from YouTube or your personal audio/video stash by creating an account and dropping it into the console. Let us know how that goes for you!
Report
@stephensonsco The technology is cool. With better messaging on the site I think you'd increase your signup and engagement rates a lot.
@noajshu@stephensonsco Sure.
1. Don't make users sign up before trying it. Let users play with it a bit then ask them to register.
2. The video demo has many usability issues. The search box insinuates a user can search for a video, not text within the video. The pink buttons feel like they let users paginate to the next video search result. Might be better if you just made a video explaining the value proposition right above the fold.
3. I didn't realize there was more stuff below the video, because I got stuck at the video section and left shortly after because I didn't really get it right away.
4. The slogan didn't hook me because I didn't get it right away. I was just scanning so probably that's the reason. Perhaps make it even simpler for the average joe to get it. Something like "Search through speech within videos" might be easier to understand. You can always explain in more detail later after the user is hooked.
Replies
Deepgram
Deepgram
Deepgram
Deepgram
Deepgram
Deepgram
Deepgram
CAKEWALK
Deepgram
Deepgram
Zengo wallet
Deepgram
Arbor
Deepgram
Arbor
Deepgram
Deepgram
Arbor
Deepgram
Deepgram