Cinestar - Privately Index and Search Through Media
byβ’
Search through images and videos backed by local LLMs. Private, fast, offline-first β powered by bge-large embeddings, moondream vision, and Whisper transcription.
For anyone with 10,000+ photos who can never find anything
Replies
Best
Maker
π
As someone who's always drowning in screenshots, videos, and random media files, I built Clipwise to solve my own problem: finding that one specific image or video moment when I actually need it.
Initial version is targeted towards:
Professional photographers/video editors who are slightly tech savy and comfortable using some ai inference tools
Tech Savy people, who have accumulated lot of photos/videos/audios over time and need a better search tool
Demo video:
Cinestar features:
π 100% local - Your media never leaves your machine π§ AI-powered - Search by describing what you see ("red car in parking lot") β‘ Instant results - No cloud delays, everything runs on your Mac π― Precise timestamps - Jump to exact moments in videos
Some of it is a work in progress, let me know the issues
Report
Maker
# Clipwise v0.1.3
TL;DR: Search your videos, images, and audio using natural language - now 40% faster!
## π― What's New
### β‘ Performance Boost
- 40% faster search during video processing (10s β 6s)
- Smart Ollama instance routing: dedicated resources for search, load-balanced for indexing
- No more waiting - search stays responsive even during heavy video processing
- Integrated video player with smart segment navigation
- Search by what you see AND what you hear: "person explaining code" or "sunset beach scene"
### π Fixes
- Native dependencies now work in packaged builds (Sharp, FFmpeg)
- Logo displays correctly in splash screen
- Smoother installation experience across all platforms
## ποΈ Architecture
- 2 Ollama instances with nginx load balancing
- Batch-concurrent processing pipeline
- SQLite + vector search for blazing fast queries
## π¬ UI Fixes
UI BUG showing alert dialogues. Sorry FOLKS!!
---
Try it: Download, run `docker-compose up -d`, and start searching your media library with natural language!
Report
Maker
# Clipwise v0.1.46
TL;DR: Stable Production Version
Report
Maker
# Cinestar (name change)
Release Date: October 11, 2025
## π― Overview
## β¨ Major Changes
### 1. Simplifying Installation
Removed dependency from docker, whisper is downloaded on request
Transcription is no longer availalble via api, but a cli command
### 2. Speed up local folder indexing
Local Folder connect indexing and searchability improved
Fixes in search and captioning. Allowing for tagging as well
Report
Maker
I understand, moving/downloading another new application, could be a hassle. Overall I would still want the option to use this privately hosted. The problem is with local, one needs to install any inference server, for example Ollama , is one of the easiest.
This seems to be a possible barrier to entry, not to mention, I can monetize it better. So would it be easier to deliver this via the web and a phone app?
That way say free tier users get only 5 videos or 30 minutes, have some data already setup for demo. On the app I can provide a toggle for `Incognito` mode, with Incognito on, it only relies on local inference servers, if not, BYOK or Use ours.
I really don't want to become a video/media provider, when I can rely on external integrations, so google, dropbox is something that is planned. So I don't have much motivation to store or ask users to upload their videos separately . Infact on a phone app things become much easier.
Option to use online models for people who can't run ollama and setup models
Automatic download of models without manual intervention
Internal Changes to handle data better.
Report
Maker
This update focuses on smarter video understanding, faster performance, and a more flexible privacy model that gives you the choice between local and external AI providers.
π Smarter Video Search
Video indexing is now multi-modal β combining audio transcription and visual scene understanding for precise, searchable context.
Audio Phase (Whisper): 20-min videos transcribed in ~60s
Visual Phase: Keyframe extraction + captioning for scene meaning
Smart Segmentation: Every 30s segment gets its own searchable context
Incremental Indexing: Start searching before processing finishes
Itβs now easier than ever to find that exact moment in a long video β just type what was said or shown.
πΌοΈ Better Image Search
Images now go through multi-pass captioning β overview β details β context β giving the search engine richer understanding of each picture.
And weβve opened up privacy settings: You can now choose between 100% local processing via Ollama, or opt-in to external providers like OpenAI or Google Gemini for higher-quality captions and embeddings.
Drillbit stays local by default β but now you decide the balance between speed, quality, and privacy.
π΅ Audio Support
Audio files get full transcription and indexing too. Search podcasts, interviews, or voice memos by meaning β not filenames. Waveform visualization and playback controls make navigation effortless.
π Unified Search Experience
Search everything from one place β images, videos, and audio β with instant results.
Hybrid Search: Combines AI semantic matching + keyword text search
Context-Aware Results: Jump straight to relevant timestamps in videos or audio
Deduplication: Smarter grouping of segments and parent files
Real-Time Updates: Search results appear as you type
The experience feels like a private, offline version of βGoogle Photos + ChatGPTβ β but built for your local drives.
π¨ Refreshed Interface (Driller v2)
A complete redesign focused on clarity and speed:
Sleek, glassy theme with soft animations
Scope filters (All, Folders, S3, Drive)
Activity panel to track ongoing indexing
Infinite scroll & smooth virtualization for large collections
Full-page media viewers replace old modals for a cleaner browsing flow β no z-index chaos, just focused exploration.
β‘ Performance Boosts Youβll Notice
Sub-1s total search time
~100s for full AI indexing of a 20-min video
2β3s image caption generation
0.5s embedding generation
Everything feels faster and more responsive, even while indexing is running in the background.
π Privacy β Now with Options
Previously, Drillbit was strictly offline. Now, you can selectively enable external AI providers for tasks that benefit from cloud-grade models β while keeping everything else local.
Your data is never uploaded by default. External providers are opt-in, transparent, and reversible at any time.
π§ Coming Next
Cloud & NAS integrations (S3, Google Drive)
Smart collections & auto-clustering
Advanced filters (date, people, location)
Batch tagging & export options
PWA support
Weβre focused on making Cinestar scale to 100K+ media files while keeping it private, fast, and personal.
Version: 0.1.64 Release: October 2024 License: MIT
Find anything β locally or with help from AI β while staying in control of your data.
Made with β€οΈ for people who value privacy and want powerful local search
Replies
As someone who's always drowning in screenshots, videos, and random media files, I built Clipwise to solve my own problem: finding that one specific image or video moment when I actually need it.
Initial version is targeted towards:
Professional photographers/video editors who are slightly tech savy and comfortable using some ai inference tools
Tech Savy people, who have accumulated lot of photos/videos/audios over time and need a better search tool
Demo video:
Cinestar features:
π 100% local - Your media never leaves your machine
π§ AI-powered - Search by describing what you see ("red car in parking lot")
β‘ Instant results - No cloud delays, everything runs on your Mac
π― Precise timestamps - Jump to exact moments in videos
Some of it is a work in progress, let me know the issues
# Clipwise v0.1.3
TL;DR: Search your videos, images, and audio using natural language - now 40% faster!
## π― What's New
### β‘ Performance Boost
- 40% faster search during video processing (10s β 6s)
- Smart Ollama instance routing: dedicated resources for search, load-balanced for indexing
- No more waiting - search stays responsive even during heavy video processing
### π¬ Video Intelligence
- Videos searchable in 60 seconds after upload
- Multi-modal understanding: combines audio transcription + visual keyframes + scene context
- Integrated video player with smart segment navigation
- Search by what you see AND what you hear: "person explaining code" or "sunset beach scene"
### π Fixes
- Native dependencies now work in packaged builds (Sharp, FFmpeg)
- Logo displays correctly in splash screen
- Smoother installation experience across all platforms
## ποΈ Architecture
- 2 Ollama instances with nginx load balancing
- Batch-concurrent processing pipeline
- SQLite + vector search for blazing fast queries
## π¬ UI Fixes
UI BUG showing alert dialogues. Sorry FOLKS!!
---
Try it: Download, run `docker-compose up -d`, and start searching your media library with natural language!
# Clipwise v0.1.46
TL;DR: Stable Production Version
# Cinestar (name change)
Release Date: October 11, 2025
## π― Overview
## β¨ Major Changes
### 1. Simplifying Installation
Removed dependency from docker, whisper is downloaded on request
Transcription is no longer availalble via api, but a cli command
### 2. Speed up local folder indexing
Local Folder connect indexing and searchability improved
Fixes in search and captioning. Allowing for tagging as well
I understand, moving/downloading another new application, could be a hassle. Overall I would still want the option to use this privately hosted. The problem is with local, one needs to install any inference server, for example Ollama , is one of the easiest.
This seems to be a possible barrier to entry, not to mention, I can monetize it better. So would it be easier to deliver this via the web and a phone app?
That way say free tier users get only 5 videos or 30 minutes, have some data already setup for demo. On the app I can provide a toggle for `Incognito` mode, with Incognito on, it only relies on local inference servers, if not, BYOK or Use ours.
I really don't want to become a video/media provider, when I can rely on external integrations, so google, dropbox is something that is planned. So I don't have much motivation to store or ask users to upload their videos separately . Infact on a phone app things become much easier.
Any opinions?
Website url changed to: https://cinestar.sourceforge.io/
New Updates Coming for:
Deduplication
Organisation
Option to use online models for people who can't run ollama and setup models
Automatic download of models without manual intervention
Internal Changes to handle data better.
This update focuses on smarter video understanding, faster performance, and a more flexible privacy model that gives you the choice between local and external AI providers.
π Smarter Video Search
Video indexing is now multi-modal β combining audio transcription and visual scene understanding for precise, searchable context.
Audio Phase (Whisper): 20-min videos transcribed in ~60s
Visual Phase: Keyframe extraction + captioning for scene meaning
Smart Segmentation: Every 30s segment gets its own searchable context
Incremental Indexing: Start searching before processing finishes
Itβs now easier than ever to find that exact moment in a long video β just type what was said or shown.
πΌοΈ Better Image Search
Images now go through multi-pass captioning β overview β details β context β giving the search engine richer understanding of each picture.
And weβve opened up privacy settings:
You can now choose between 100% local processing via Ollama, or opt-in to external providers like OpenAI or Google Gemini for higher-quality captions and embeddings.
Drillbit stays local by default β but now you decide the balance between speed, quality, and privacy.
π΅ Audio Support
Audio files get full transcription and indexing too. Search podcasts, interviews, or voice memos by meaning β not filenames.
Waveform visualization and playback controls make navigation effortless.
π Unified Search Experience
Search everything from one place β images, videos, and audio β with instant results.
Hybrid Search: Combines AI semantic matching + keyword text search
Context-Aware Results: Jump straight to relevant timestamps in videos or audio
Deduplication: Smarter grouping of segments and parent files
Real-Time Updates: Search results appear as you type
The experience feels like a private, offline version of βGoogle Photos + ChatGPTβ β but built for your local drives.
π¨ Refreshed Interface (Driller v2)
A complete redesign focused on clarity and speed:
Sleek, glassy theme with soft animations
Scope filters (All, Folders, S3, Drive)
Activity panel to track ongoing indexing
Infinite scroll & smooth virtualization for large collections
Full-page media viewers replace old modals for a cleaner browsing flow β no z-index chaos, just focused exploration.
β‘ Performance Boosts Youβll Notice
Sub-1s total search time
~100s for full AI indexing of a 20-min video
2β3s image caption generation
0.5s embedding generation
Everything feels faster and more responsive, even while indexing is running in the background.
π Privacy β Now with Options
Previously, Drillbit was strictly offline.
Now, you can selectively enable external AI providers for tasks that benefit from cloud-grade models β while keeping everything else local.
Your data is never uploaded by default. External providers are opt-in, transparent, and reversible at any time.
π§ Coming Next
Cloud & NAS integrations (S3, Google Drive)
Smart collections & auto-clustering
Advanced filters (date, people, location)
Batch tagging & export options
PWA support
Weβre focused on making Cinestar scale to 100K+ media files while keeping it private, fast, and personal.
Version: 0.1.64
Release: October 2024
License: MIT
Find anything β locally or with help from AI β while staying in control of your data.
Made with β€οΈ for people who value privacy and want powerful local search