Agentspan

Open-source runtime for durable AI agents

96 followers

Open-source runtime for durable AI agents

96 followers

Visit website

AI Infrastructure Tools

•

AI Workflow Automation

Agentspan is an open-source server and SDK for running AI agents as durable workflows. You can define agents programmatically, execute them server-side, and inspect each run and execution state in the UI. Agentspan adds crash recovery, human-in-the-loop approvals, guardrails, tool history, and observability around the agent frameworks and LLMs you already use. MIT licensed.

Free

Launch tags:API•Open Source•Developer Tools

Launch Team

Framer 3.0With Agents, Branching Community and an all-new design

Promoted

Maker

📌

Hey Product Hunt, We built Agentspan because production agent execution gets messy fast, and we're working to fix that. Common issues include state loss, human approvals needing resume logic, tool calls needing auditing, and retries causing repeated side effects. Agentspan gives agents a durable execution layer. You define agents client-side, but execution state, tool history, approvals, and observability live on the server. The goal is to make agents easier to operate and debug without forcing teams to abandon the frameworks or models they already use. The project is open source and MIT-licensed. Check out the repo at https://github.com/agentspan and the quickstart at https://agentspan.ai/docs/quicks....

Report

2mo ago

Elentaria

@nickorkes Congrats! Looks amazing, it's super cool for people who want to just focus on the code and don't spend too much time on the infra.

QQ, maybe trivial since I didn't check the codebase in detail, but by server, you mean it's still local, right? Not based on any specific cloud provider. It could be amazing to see adapter/connectors/versions on major cloud providers too, and have it super easy to deploy with few line of code (then no need to learn anything major from any provider side).

Report

2mo ago

Maker

@khashayar_mansourizadeh1 Thanks! Agentspan can definitely be installed locally, but it doesn't have to be. See https://agentspan.ai/docs/deployment/. Great feedback on cloud-specific connectors though. That would make it very easy to get up and running.

Report

2mo ago

Elentaria

@nickorkes Yes indeed and thanks for explanation, wish you all the best!

Report

1mo ago

Durable AI agents that survive failures and interruptions is one of the harder infrastructure problems right now. Open-sourcing the runtime is a real commitment to the ecosystem. We've been building in the customer success for developer tool companies space at RetainSure, and Agentspan touches on something we think about a lot: how agent persistence changes what's possible in long-running business workflows. What's your approach to handling state when agents run for hours or days?

Report

2mo ago

Maker

@shivam_jaiswal21 the way we approach state is thinking of it in terms of long-term durable workflows. Each agent run persists server-side as a workflow with a long lived execution ID, backed by a DB. If something interrupts the agent's execution, it can then resume from wherever it left off.

Report

2mo ago

Crash recovery for agents is the thing nobody talks about until it breaks in production. We've had workflows silently fail partway through with no state to resume from. Human in the loop approvals are the other piece teams always bolt on last minute. Does Agentspan support branching approvals, where different steps route to different reviewers?

Report

2mo ago

Maker

@dhiraj_patel5 yes, crash recovery is super important and a primary factor in us building this. Agentspan supports approvals as a first-class tool, though the branching logic would live in your agent/workflow code today.

Report

2mo ago

The durability layer is the piece most agent frameworks skip. We're building AI workflows at RetainSure and the biggest headache isn't the LLM calls, it's what happens when a step fails partway through and the state is gone. Keeping execution state server side while defining agents client side is a clean separation. Does Agentspan support partial retries, or does a failure restart the whole run?

Report

2mo ago

Maker

@dhiraj_patel5 yes, that's part of the design. We worked hard on crash resume being a core part of the project for the reasons you mentioned. Now, how the reconciliation works may need to be part of the workflow code you write as it might very agent to agent. But the fact that history and run state persists server-side makes that possible.

Report

2mo ago

Does a failed tool call mid-run restart from the last checkpoint or from the top? Most frameworks (afaik) treat the whole run as atomic, which defeats the point for long-running agents.

Report

1mo ago

Maker

@kumarran it resumes at the last checkpoint. That's actually a main motivation of why we developed agentspan in the first place. So that the execution itself is durable.

Report

1mo ago

@nickorkes Perfect. Wish you the best!

Report

1mo ago

The durable runtime angle is the part I’d look at first. For agent teams, the hard bit is usually not starting a run, it’s resuming state, handling approvals, and seeing exactly what changed after a long task.

Report

2mo ago

Maker

@new_user___2672025cf1bc18102609b53 exactly. Those are core production failure modes this project works hard to address.

Report

2mo ago

That makes sense. The part I’d stress in the docs is replay around approvals and retries, because repeated side effects are where durable agent runs get scary in productiThat makes sense. The part I’d stress in the docs is replay around approvals and retries, because repeated side effects are where durable agent runs get scary in production. A small example showing failed tool call -> resume -> audit trail would make the value click fast.on. A small example showing failed tool call -> resume -> audit trail would make the value click fast.

Report

2mo ago

Pros

Cons

Reviews