SecureVibeBench evaluates AI agents' ability to write secure C/C++ code, highlighting challenges in preventing vulnerabilities in AI-generated software.
Explore SWE-chat, a large dataset capturing real AI coding agent interactions with developers, revealing usage patterns, efficiency, and security issues.