Honestly, it just works. You point it at a real codebase, give it a messy multi-step task, and it actually finishes: reads the right files, edits them, runs the tests, reports back. The skills system is the part I didn't know I needed: instead of hoping the model behaves well, you literally wire in the discipline (TDD, brainstorming, code review) and it just follows it. Plan mode + sub-agents mean I can scope big work upfront and let it run in the background while I do something else. And the memory thing? Game changer. I tell it once "don't use uv in this repo" and it remembers next time without me repeating myself.