Giovambattista Fazioli

Claude Opus 4.8 just dropped — who’s tried it yet?

Claude Opus 4.8 just dropped — who’s tried it yet?

Anthropic released Claude Opus 4.8 yesterday, and from the announcement it looks like a solid upgrade over 4.7. Here’s what stood out to me:

  • Better judgment on agentic tasks — early testers say it catches its own mistakes, pushes back on bad plans, and flags uncertainties instead of bulldozing through

  • ~4x less likely to let code flaws pass unremarked compared to Opus 4.7

  • Dynamic Workflows in Claude Code — can now orchestrate hundreds of parallel subagents for codebase-scale migrations

  • Effort control — you can now choose how hard the model thinks on each response (from fast/light to max effort)

  • Fast mode is 3x cheaper than it was for previous models

  • Same pricing as Opus 4.7: $5/M input, $25/M output

The TechCrunch piece notes this came just 41 days after Opus 4.7, which had a mixed reception. Cursor, Devin, and Bridgewater are among the early testers reporting real improvements.

For those who’ve already tried it:

  1. Have you noticed a real difference vs. 4.7 in everyday use?

  2. Is the “honesty” improvement (flagging issues instead of silently pushing through) noticeable in practice?

  3. Anyone tested Dynamic Workflows yet? Curious how it handles large-scale refactors.

Would love to hear first impressions from the community.

343 views

Add a comment

Replies

Best
Dilek Kesim

I’m especially curious about whether Opus 4.8 improves consistency with structured JSON outputs. I’m currently building JS2Mail, a tool that turns JSON into responsive email templates, so reliable and predictable outputs are incredibly important. Even small gains in consistency can have a big impact on real-world automation workflows.

Giovambattista Fazioli

@dilek_kesim this is the one i care about too tbh. structured json was the thing that used to bite me - youd get 95% valid then one response randomly drops a field or wraps it in ```json fences.

from what ive seen so far 4.8 is noticeably more consistent on schema adherence, esp on longer outputs where it used to fall apart. still not 100% so id keep a validator + retry in the loop, dont trust it blind. JS2Mail sounds cool, that exact use case (json -> templates) is where strict outputs matter most. would be curious how it holds up

on your real payloads.

Volodymyr Demchenko

Opus 4.8 is soo good with custom harness, reads and follows them much better than 4.7 ever did. Same with custom skills, and in a way it's much more enthusiastic to fix the friction that it encounters rather than 4.7 "not my problem" approach.

Robert Young

Claude's been pretty bad lately.. It's actually forgetting things in VSCode during compacting. I've noticed the difference big time.

Mahmoud Zalt

Built my entire SaaS (Sistava) using Claude Code. Upgraded to 4.8 and noticed that the Dynamic Workflows/Ultra-Code mode is great for large-scale tasks. For smaller tasks, though, it doesn't really seem to give better results.