forking a run and editing what a specific tool call returned, without re-firing the real API, is exactly the debugging workflow I've been missing. most agent tools let you replay a whole run or nothing, this lets you pin one step and see how everything downstream reacts. the per-step override map the maker described in the comments (span id to output) means you can script sweeps of "what if X" scenarios instead of clicking through each one by hand.