Passer à la navigation principale Passer à la recherche Passer au contenu principal

On the Potential of LLMs for Offensive Security: Benchmarks vs. Operational Reality

  • Ruben Missotten
  • , Vera Rimmer
  • , Wim Mees
  • , Lieven Desmet
  • KU Leuven
  • Royal Military Academy

Résultats de recherche: Chapitre dans un livre, un rapport, des actes de conférencesContribution à une conférenceRevue par des pairs

Résumé

Large Language Models (LLMs), through their strong capabilities in code generation, reasoning, and tool use, have demonstrated promising results in security tasks involving vulnerability discovery and exploitation. However, evaluating their offensive potential in automating penetration testing - a more complex and multi-stage process - remains a critical research challenge. While existing evaluation frameworks effectively demonstrate LLM capabilities in isolated or simplified scenarios, they often do not extend toward the complexity of interconnected attack chains characteristic of real-world adversarial operations. In this analytical study, we examine the challenge of assessing the feasibility of LLM-powered automation across the full adversarial pipeline within realistic environments. We contribute an analysis of current benchmarks and associated environments, and highlight opportunities for methodological enhancements that would strengthen alignment between academic evaluations and operational realities.

langue originaleAnglais
titreProceedings - 2025 Annual Computer Security Applications Conference Workshops, ACSACW 2025
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages420-427
Nombre de pages8
ISBN (Electronique)9798331545369
Les DOIs
étatPublié - 2025
Evénement2025 Annual Computer Security Applications Conference Workshops, ACSACW 2025 - Honolulu, États-Unis
Durée: 8 déc. 202512 déc. 2025

Série de publications

NomProceedings - 2025 Annual Computer Security Applications Conference Workshops, ACSACW 2025

Une conférence

Une conférence2025 Annual Computer Security Applications Conference Workshops, ACSACW 2025
Pays/TerritoireÉtats-Unis
La villeHonolulu
période8/12/2512/12/25

Empreinte digitale

Examiner les sujets de recherche de « On the Potential of LLMs for Offensive Security: Benchmarks vs. Operational Reality ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation