All activity
Kernel Sphere is a DOM-native automation platform for developers which is built on Google Gemini and Playwright. You describe a task in plain English and give it a starting URL. The agent opens a real browser, reads the page, decides what to do at each step, and returns a structured result when it's done. It's open source. If you build on top of it, run into something it can't handle, or want to improve the extraction logic, contributions are welcome.
github.com/kernelsphere-ai/kernelsphere

Kernel SphereA web agent that reads, reasons, and acts in a real browser.
Kernelsphereleft a comment
Building kernelsphere taught me a lot about where AI agents still struggle. Login flows, CAPTCHAs, multi-step forms, and sites that heavily rely on JavaScript all have their own failure modes. A lot of the work went into handling those edge cases rather than the happy path. We tested it on WebVoyager (643 tasks across 15 real websites) and hit 48% accuracy. There's a meaningful room to improve,...

Kernel SphereA web agent that reads, reasons, and acts in a real browser.
