Open Computer Use

Open-source Computer Use MCP for AI agents

90 followers

Open-source Computer Use MCP for AI agents

90 followers

Visit website

Automation tools

Open Computer Use turns local desktop automation into a standard MCP service. It lets Codex, Claude Code, Gemini CLI, opencode, and custom MCP clients inspect apps, click, type, scroll, drag, and take screenshots across macOS, Linux, and Windows. It is open source, npm-installable, and designed to bring the non-intrusive Codex Computer Use experience to any agent stack.

Free

Launch tags:Open Source•Developer Tools•Artificial Intelligence

Launch Team

Viktor.comThe AI employee that does the work, in Slack & Teams

Promoted

Open Browser Use

Maker

📌

Hi Product Hunt, I built Open Computer Use because the new Computer Use experience should be available to any agent, not just one host. It wraps local desktop automation as MCP, so Codex, Claude Code, Gemini CLI, opencode, and custom clients can inspect apps, click, type, scroll, drag, and capture screenshots. The repo started from studying Codex Computer Use, then turned into a cross-platform runtime with macOS, Linux, and Windows support. It is installable with npm and open for people who want to study, extend, or plug Computer Use into their own agent stack. Feedback is especially welcome on reliability, Linux and Windows coverage, and host integrations.

Report

2mo ago

apideck

the new Computer Use experience should be available to any agent, not just one host.

I agree 100%.

Report

2mo ago

Love the idea of standardizing computer use via MCP. It opens up so many possibilities for custom agents. Do you think it could be used to automate data entry from legacy apps directly into something like a structured database or even a spreadsheet?

Report

2mo ago

Open Browser Use

Maker

@phatysddev Yes, exactly - that is one of the use cases I am most excited about.

Because Open Computer Use exposes desktop control through MCP, an agent can inspect a legacy app, click through forms, read or copy values, and write them into a structured database or spreadsheet. Beyond MCP, it also supports a CLI, JS/Python/Go SDKs, and Skills, so you can plug it into different agent stacks or build a more custom workflow around it.

For production-ish data entry flows, I would still recommend adding validation steps, screenshot/state checks, retries, and human review for sensitive fields, but the core automation path is supported.

Report

2mo ago

VibeAround

The interoperability story makes sense. The thing I would want in practice is a first-class notion of state assertions around actions, not just actions themselves: window X focused, field Y contains Z, screenshot region roughly matches expected state, and per-app permission envelopes.

That feels like the line between a very cool transport layer and something teams can trust for repetitive real work, especially once agents start chaining multiple desktop steps together.

Report

22d ago

The thing I would look for in this category is not just whether the agent can click a UI, but whether the run leaves enough evidence for a developer to trust it.

A strong first-run demo for a desktop automation MCP is small and inspectable: fresh app state, one approved task, screenshots or traces before and after the action, and a clear boundary around credentials and destructive clicks.

If those artifacts are easy to review, the tool becomes much more useful for real agent workflows because success is no longer just "the model said it worked."

Report

9d ago

Curious how you handle window focus switching when multiple apps are open — that's usually the brittle part in desktop automation. Does the MCP layer abstract that away or does the agent need to manage it?

Report

1mo ago

Local browser automation keeps data private — rare in this space. How does it handle CAPTCHAs or bot detection on aggressive sites?

Report

1mo ago

Most agent frameworks just roll their own desktop automation and call it done, you wrapping it as a proper MCP service so any agent can plug in is actually a smarter approach. The cross-platform support is a nice touch too. My question is how it handles rapid sequential clicks or form fills, does the MCP layer add enough latency to break those kinds of workflows?

Report

1mo ago

1 2

Reviews

No reviews yetBe the first to leave a review for Open Computer Use