Chris Messina

Clipto - Fully local, natural language search over terabytes of media

Like Google Photos, but fully local. Turn the terabytes of video, audio, meetings, and files you work with into searchable memories, without uploading anything to the cloud. Clipto automatically tags people, dialogue, and scenes, so you can instantly find any moment buried in your media just by describing what you're looking for. It's fast too: on a MacBook Pro M5, Clipto indexed 2TB of videos in just 24 hours.

Add a comment

Replies

Best
Tina Yao

Can I drag and drop clips directly from the Clipto search window straight into my Premiere Pro or DaVinci Resolve timeline, or do I need to reveal in Finder first?

Henry Kang

@libin_yao Yes. In fact, we’ve already built a Premiere Pro plugin specifically for this workflow.

You can search your media directly inside Premiere using Clipto, find the exact moment you’re looking for, and add the selected clip to your timeline without jumping back and forth between Finder and your editor.

For many editors, the goal isn’t just finding the clip, it’s finding it without breaking creative flow. That’s one of the main reasons we built the integration in the first place.

If you have to choose, which one you use more heavily? Premiere or Davinci?

Sounak Bhattacharya

"Automatically tags people" — is that face recognition, voice matching, or something else? And when it misidentifies someone, is there a way to correct the label without re-indexing the entire library?

Henry Kang

@sounak_bhattacharya Yes! We actually use both visual face recognition and voice identification to build a more complete understanding of who appears across your media.

And yes, corrections are fully supported. If Clipto misidentifies someone, you can simply relabel that person (or merge/split identities), and the change is reflected throughout your library. There’s no need to re-process or re-index everything from scratch.

In fact, user corrections become part of the local memory layer, which helps make future search and retrieval much more accurate for your own media collection.

First
Previous
•••
567