Kamalei Zhang

MonoSub - Local AI subtitle generator from audio or video

by•
100% Local AI subtitle generation. Buy once, use forever. Stop paying monthly subscriptions.

Add a comment

Replies

Best
Kamalei Zhang
Maker
šŸ“Œ

Hey Hunter šŸ‘‹
I'm excited to launch Monosub — a desktop app built with Tauri + React + FunASR that lets you generate accurate subtitles and video summaries entirely offline, right on your computer.
What inspired me to build this?
This project started as a personal tool for my own video workflow. I was spending hours manually transcribing and summarizing my video content before publishing, and I wanted a faster, more private way to get the text I needed for scripts, captions, and video summaries.
What problem were we trying to solve?
We wanted to eliminate the pain of:
Waiting for cloud-based transcription services
Paying for expensive API calls
Worrying about the privacy of your video content
Juggling multiple tools just to get a simple subtitle file
How did our approach evolve?
Initially, I just wanted a quick way to transcribe my own videos. But as I tested with early users, I realized they needed more: batch processing, multiple output formats, and a beautiful, intuitive UI. So we expanded to support:

Key Features:

• šŸš€ Local AI Processing: Leverages FunASR for accurate, high-quality offline speech recognition, ensuring data privacy and fast processing speeds.

• šŸŽžļø Broad Media Format Support: Natively supports processing of a wide range of video and audio file formats, eliminating the need for pre-conversion and enhancing usability.

• šŸ“ Multi-format Output: Generates subtitles in widely-used formats including SRT, VTT, and TXT to meet diverse playback and editing needs.

• šŸ“¦ Batch Processing: Seamlessly manages and processes multiple video files simultaneously, significantly boosting productivity for bulk captioning tasks.

• šŸ‘„ Multi-speaker Recognition: Precisely identifies and distinguishes speech from multiple speakers in conversations for clear, contextual subtitles.

• 🌐 Multi-language Support: Accommodates speech recognition and subtitle generation for multiple languages, catering to global content creators.

• šŸŽØ Modern UI: Features an elegant, intuitive interface built on a contemporary design system, offering a smooth and visually pleasing user experience.


This is a labor of love, and I'd love to hear what you think. What's the biggest pain point in your video workflow? Let's chat in the comments!

download link https://github.com/0xkamalei/monosub/releases

Agbaje Olajide

@kamalei_zhangĀ 
Just tried MonoSub — this is fantastic! šŸš€

I really like how it runs completely offline and supports batch processing — saves so much time compared to cloud-based transcription tools. The multi-speaker recognition and multi-format output are super handy for real workflows.

As a user, one thing I noticed is that the batch workflow could maybe show a progress indicator per file — that would make it feel even smoother when processing multiple videos.

The UI feels modern and intuitive, and I can see this being a huge time-saver for anyone creating subtitles regularly.