LeMUR is a framework for applying Large Language Models to spoken data. In a few lines of code, you can do things like generate summaries or ask questions about your meetings, phone calls, videos, or podcasts.
Try AssemblyAI's most capable and highly trained speech recognition model trained on 12.5M hours of multilingual audio data. Universal-1 achieves best-in-class speech-to-text accuracy, reduces word error rate and hallucinations, and improves timestamps.