Anthropic research scientist Nicholas Carlini demonstrated that Claude Code can discover critical security vulnerabilities in the Linux kernel, including a heap buffer overflow in the NFS driver that had remained undetected since 2003. By using a simple bash script to iterate through source files with minimal prompting, the AI identified five confirmed vulnerabilities across various components like io_uring and futex. This discovery marks a significant shift in cybersecurity, as Linux kernel maintainers report a surge in high-quality vulnerability reports from AI agents.
Key points:
* Claude Code discovered a 23-year-old NFS driver bug using basic automation.
* Significant capability jump observed between older models and Opus 4.6.
* Kernel maintainers are seeing a massive increase in daily, accurate security reports.
* LLM agents may represent a new category of tool that combines the strengths of fuzzing and static analysis.
* Concerns exist regarding the dual-use nature of these tools for adversaries.