Launching today

Deep Work Plan

Launching today

Models matter. Context matters more. Give your agent a plan.

50 followers

Models matter. Context matters more. Give your agent a plan.

50 followers

Visit website

AI Coding Agents

•

LLM Developer Tools

Deep Work Plan turns any repo into a harness with the context of your best engineer — so any AI agent codes like your smartest model and can't drift from the plan. Not a chat window it forgets, a spec written into the repo: atomic tasks, acceptance criteria, validation gates, resumable state. Long runs survive context resets; any agent picks up where the last left off. Point an agent at it, walk away, come back to work you can verify. Any agent, any repo, no lock-in. Open Source, MIT.

Free

Launch tags:Open Source•Developer Tools•Artificial Intelligence

Launch Team / Built With

Fin Startups get Fin free for a year + 93% off Intercom

Promoted

Deep Work Plan

Maker

📌

Hi Product Hunt 👋 Models matter. Context matters more. That one line is the whole reason this exists. I build with AI agents every day, and I kept hitting the same wall: an agent starts a long task brilliantly, then somewhere around hour three it quietly drifts. The diff still compiles — it's just not what I asked for. There was never a clean way to resume, because the whole plan lived in a chat window that had grown too long to trust. I stopped treating that as a prompting problem and started treating it as a structural one. The fix wasn't a smarter model. It was giving the agent a plan it couldn't drift from — written into the repository itself. That's Deep Work Plan. The idea is two moves: 1) Make the plan the source of truth, not the chat. Before any code, you write a spec: a goal, atomic tasks, and for each task explicit acceptance criteria + a validation gate. "Done" is decided by the gate, not by how the model feels. And it lives on disk, so it survives a context reset or a handoff to a different agent tomorrow. 2) Let the repository be the harness. The context (files), the tools (your scripts and tests), the guardrails (the plan and its gates), the state (on disk) — all of it lives in the repo as plain files any agent can read. So it's tool-agnostic: Claude Code, Codex, Cursor, or next year's agent can all run the same plan. No vendor to bet on. What I'm proudest of is that it's not a slide. It's dogfooded across three repos — including the site that documents it. It's MIT, and you can install it into your own repo in one step at deepworkplan.com/init. If your agents start strong and wander by hour three, I'd genuinely love your take. How are you keeping long-horizon agent work on track today?

Report

3d ago

Mailwarm

How do you keep the plan from getting stale as humans change the codebase between runs?

Report

2h ago

Deep Work Plan

Maker

@naimz Great question, honestly the failure mode I worried about most while designing this, because it's the one most "just write a spec" approaches quietly ignore. The plan isn't a snapshot of the code, so it doesn't rot like one. DWP handles drift on three fronts.

The first is that I write tasks as behavior, not edits. An acceptance criterion in a DWP task reads like "`POST /login` rate-limits to 10 attempts per IP per minute and returns 429 with a `Retry-After` header," not "add a Redis client in `auth.ts` and wrap the handler." So if a teammate swaps the store for an in-memory cache between runs, lifts the check into a CDN rule, or just renames the file, nothing in my plan is invalidated, the criterion is still expressible against the current code.

The second is that every task carries its own validation gate, and the gate re-runs against the repo as it is right now, not the repo as it was when I wrote the plan. So if someone broke an assumption between runs, the next run fails loudly at that gate instead of drifting silently, and that failure is my cue to `refine` before continuing, not to paper over it.

The third is that I made keeping the spec in sync with the code part of the work, not a separate chore. Any DWP task that changes behavior also updates the `docs/`, `AGENTS.md`, and `.agents/` kit that describe it, re-syncing the repo's agent-facing surface is part of the task's validation gate. On top of that, every plan ends with a security-analysis pass and a skill-discovery step that proposes new reusable skills out of what was just built. It's basically the Boy Scout rule applied to the harness, every run is meant to leave the codebase a little more agent-ready than it found it, not more stale.

If you want the longer take on why I built it this way, the methodology write-up walks through it: https://deepworkplan.com/methodology/

Report

27m ago

Writing the plan into the repo rather than the context window is the right architecture. Durable state that survives model swaps and context resets is what makes long multi-step tasks actually viable. The validation gate pattern catches drift before it compounds. How are the gates implemented? Are they executable assertions the agent runs itself, or do they require human sign-off?

Report

40m ago

If you want the longer take on why I built it this way, the methodology write-up walks through it: https://deepworkplan.com/methodology/