Ziven Jay

Caplo - Real-time AI captions & translation for any iOS app

by
Caplo brings real-time AI captions & translation to any iOS app. it captures system audio to provide live subtitles in a floating Picture-in-Picture (PiP) window—perfect for foreign streams, meetings, or anime. • Floating PiP: Overlays any app you use. • 12+ Languages: English, Japanese, Chinese, Spanish, etc. • Universal: Works with YouTube, Zoom, Netflix & more. • AI-Powered: Fast and accurate transcription. Break the language barrier on your iPhone!

Add a comment

Replies

Best
Ziven Jay
Maker
📌
Hey Product Hunt community! 👋 I’m the maker of Caplo. This project started from my own frustration: I love watching foreign content and attending global tech meetups, but I often found myself lost when there were no subtitles. So I built Caplo. The goal was simple: If you can hear it on your phone, you should be able to understand it. Caplo can "read" audio from ANY app without needing internal integrations. Combined with a floating Picture-in-Picture window, it feels like a native part of your OS. I’ll be here all day to answer your questions. Let me know what you think! 🚀
swati paliwal

@socekin Kudos on the launch. Just a QUICK Q: how accurate is Caplo's translation for live audio in noisy settings, say a crowded conference?

Ziven Jay

@swati_paliwal Using the iPhone's built-in noise processing capabilities, theoretically, the effect is similar to what the other party hears when making a phone call.Caplo can have free credit, you can try it out

S.Lee

If I rewatch a video I've already transcribed before, does it use up my minutes again? Or can I access the previous transcription from the session history?

Ziven Jay

@sj_lee15 Thank you for your suggestion. Caplo currently provides real-time transcription subtitles, and does not read and display from historical session. I will consider this proposal.

Kaito

Real-time translation for live calls is genuinely hard to get right. How does it handle strong accents or fast speakers?

Ziven Jay

@kaito_builds We used the most advanced AI large model algorithms to solve this problem.

Ruxandra Mazilu

Super impressive, congrats on the product and the launch!

What's next on your roadmap? Do you want to focus on covering more languages, add other complementary features, or something else?

Ziven Jay

@ruxandra_mazilu I'll consider adapting the main Agentic Tool to make it easier to give the recording results to the Agent to process.

Carolina Rad

Hello!

I tried to check it out in my husband's family group on whatsapp (they are Syrian, im Mexican), I played some voicenotes, some recordings but when I came back to Caplo the record mentioned zero words! I also tried with youtube and same zero answer.
I tried several ways but the app never came back with any results. Am I doing something wrong? Maybe It would be useful to have a small tutorial at the beginning or some visuals that show what to expect.

Ziven Jay

@carolinahunts Caplo supports two modes: microphone recording and system audio recording. For your scenario, please use system audio recording. After enabling it, you can play media in any app. Returning to Caplo will show the results.

Carolina Rad

@socekin Hello! I tried this, but when i checked the recording said "Zero words"

Could you help me understand what went wrong? if possible showing an example of how it should be would be amazing.

Ziven Jay

@carolinahunts Could you conveniently record a video of the error and send it to my email at contact@sparklight.ai? I will also reply to you with an operational tutorial video at the same time. thanks

ray

I watch a lot of tech meetup recordings that don't have subtitles so this is really useful. Can you export the transcript after a session? would save me a lot of note-taking.

Ziven Jay

@ray_artlas All sessions are automatically saved in i Cloud and you can use them at any time. In the future, it will be open to authorize your own Agent to read the

ray

@socekin  Oh nice, iCloud sync is handy. The agent integration sounds interesting too, looking forward to that.

wisdom ojieh [copywizard]

This is a clean execution of a very real pain.

The if you can hear it, you should understand it idea clicks instantly, and the audience is broader than it might first appear. Anime fans, remote workers, people sitting in multilingual meetings where half the context disappears, anyone deep in global content. The PiP layer is the detail that elevates it. It stops feeling like a tool you switch to and starts feeling like something native to how people already use their phones.

One small messaging tweak worth testing: a fast before and after. Watching without Caplo versus with Caplo makes the value obvious in seconds, which matters a lot for Product Hunt traffic that's skimming rather than reading.

Curious whether a primary use case has started to emerge yet: entertainment versus work versus education. Doubling down on one early usually sharpens both the messaging and the growth path more than trying to serve all three at once.

I spend a lot of time helping SaaS teams tighten positioning and launch messaging, so these are the details I naturally notice first. This already has strong bones. It would be fun to dig into the direction further if you're exploring that.