Found 48 Vulnerabilities in Open Source Projects During Live Testing with Claude Opus 4.6
https://preview.redd.it/g98j5txd7sxg1.png?width=936&format=png&auto=webp&s=df75bc132f57cc14ba04cdd06257ba997b9bbb0b Ran a loop where each round runs Claude in a sandboxed Docker container with a fresh context window. The key difference is that the goal is **objective and verifiable.** When I ran it on a repo, I noticed that during rounds 1-2, it found several independent low-risk vulnerabilities, but then, from round 3 onward, it started chaining them into critical exploits. This emergent behavior makes it very interesting.