News

Comparing AI agents to cybersecurity professionals in real-world pen testing

  • littlexsparkee--Arxiv.org
  • published date: 2026-01-06 21:23:07 UTC

We present the first comprehensive evaluation of AI agents against human cybersecurity professionals in a live enterprise environment. We evaluate ten cybersecurity professionals alongside six existing AI agents and ARTEMIS, our new agent scaffold, on a large…

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and acce… [+257 chars]