TextBehind is a free browser-based tool that uses AI to automatically separate foregrounds from backgrounds. It lets creators place text behind subjects to create professional depth effects for YouTube thumbnails, magazine covers, and social media posts—all in seconds
I built this Video Caption Generator because I wanted studio-quality subtitles without the studio budget. The catch? I’m bootstrapping with $0, so no fancy GPUs here. It uses heavy-duty Hugging Face models to analyze your video frame-by-frame.
It’s definitely not instant—you might actually have time to touch grass while it renders. But unlike fast tools that hallucinate, this one actually watches your video to generate context-aware captions. It’s slow, free, and painstakingly accurate.