Browser use agents tend to prefer the models' native multimodality over concrete source, and, even if they do, they still tend to take too much context to even barely function. Then I came up with an idea. What if I can just... send the rendered DOM to the agent, but compressed?
And, that's what TideSurf is. It reduces token consumption by thirty-two times on GitHub (vs. raw DOM), at least according to my experiments, while only taking ~30ms to parse.
Also comes w/ 18 interactive tools.