claude-real-video - Let Claude - or any LLM - actually watch a video

by
Paste a YouTube link into ChatGPT and it reads the transcript, not the picture. crv runs locally: scene-change detection picks the frames that matter, dedup drops the repeats, Whisper transcribes the audio - then any LLM can read the folder. MIT, no ML downloads.

Add a comment

Replies

Best
Maker
📌
Hey PH! I'm Leo, a non-engineer from Taiwan. I kept pasting YouTube links into AI chats and realizing they only read the transcript. The picture - the cuts, the pacing, the visual hooks - was invisible. So I built crv with Claude Code. It grabs a frame at every scene change (not a fixed 1 fps), throws away near-duplicates, and transcribes the audio. Everything runs on your machine. On a 58-second test clip that's 26 frames instead of 58, with nothing missed. It hit the Hacker News front page last week and just passed 700 stars, which still feels unreal. There's also a bundled Claude Code skill: paste a video link into the chat and Claude watches it by itself. It's MIT and `pip install claude-real-video`. Ask me anything - especially where it breaks. :)