PiP Cue - Float, caption & translate any video in 100+ languages

Chrome's built-in Picture-in-Picture is a bare video with no controls. PiP Cue floats any video from any site with live captions, full playback controls, A-B loop, and speed control inside the window. New in this version: searchable transcripts you can translate into 100+ languages, so you can follow lectures, podcasts, and tutorials in your own language, jump to any line, replay the exact part you missed, and save any moment with its timestamp. Free, no account needed to start.

Add a comment

Replies

Best
Hey everyone, Naveen here. I built PiP Cue because I kept losing my place in long podcasts and lectures, and Chrome's built-in PiP is just a bare video with no controls. The biggest thing people asked for after the early version was language support, so that's the headline of this release: PiP Cue now pulls a searchable transcript for any video and translates it into 100+ languages. You can float the video, read captions in your own language, jump to any line in the transcript, and A-B loop the part you're practicing. It's become something I use as much for learning as for watching. Also new: export/import your saved videos, set reminders to finish something later, and send any saved video straight to NotebookLM. Still 100% free, no account needed to try it. If you're learning a language or studying from video, tell me what's missing. And tell me what breaks.

 congrats on the launch Naveen. How do you handle captioning when the source is noisy?

 Thanks Zolani! PiP Cue doesn't transcribe audio itself, it surfaces

the caption track the platform already provides (YouTube timedtext or

any HTML5 textTrack) and renders it live in the floating window.
so caption quality tracks the source: real uploaded captions render

clean, auto-captions on a noisy video are as good (or rough) as

YouTube's ASR made them. what PiP Cue adds is making them floatable,

searchable, and translatable into 100+ languages which for language

learners often makes even a rough transcript more useful.

what's the source you're working with? lectures, podcasts?