Opus 4.7 is somewhere between seriously clueless and stupidly dangerous. The worst frontier model I have used so far in the past 2 years. We were hoping to get at least our 4.6 back but 4.7 with so many critical logical failures mean you have to babysit it all the time. I'm losing hope in Anthropic.
Opus 4.7 on Max effort decided to create a new email template by itself (which is pretty stupid btw) and mass mailed it to the whole database (some emails were repeatedly sent 20x). Before you ask me - yes, [CLAUDE.md](http://CLAUDE.md) has the exact rule for that, it's supposed to email the tester before any new email templates are to be used in production. I have created this safety rule a few months ago. I feel like the Opus 4.7 is a huge letdown the way it's been downgraded. If Anthropic is "pushing the boundaries", it's probably only in the meaning of how far they can push the...