Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

On the Potential of LLMs for Offensive Security: Benchmarks vs. Operational Reality

  • Ruben Missotten
  • , Vera Rimmer
  • , Wim Mees
  • , Lieven Desmet
  • KU Leuven
  • Royal Military Academy

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

Abstract

Large Language Models (LLMs), through their strong capabilities in code generation, reasoning, and tool use, have demonstrated promising results in security tasks involving vulnerability discovery and exploitation. However, evaluating their offensive potential in automating penetration testing - a more complex and multi-stage process - remains a critical research challenge. While existing evaluation frameworks effectively demonstrate LLM capabilities in isolated or simplified scenarios, they often do not extend toward the complexity of interconnected attack chains characteristic of real-world adversarial operations. In this analytical study, we examine the challenge of assessing the feasibility of LLM-powered automation across the full adversarial pipeline within realistic environments. We contribute an analysis of current benchmarks and associated environments, and highlight opportunities for methodological enhancements that would strengthen alignment between academic evaluations and operational realities.

OriginalspracheEnglisch
TitelProceedings - 2025 Annual Computer Security Applications Conference Workshops, ACSACW 2025
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
Seiten420-427
Seitenumfang8
ISBN (elektronisch)9798331545369
DOIs
PublikationsstatusVeröffentlicht - 2025
Veranstaltung2025 Annual Computer Security Applications Conference Workshops, ACSACW 2025 - Honolulu, USA/Vereinigte Staaten
Dauer: 8 Dez. 202512 Dez. 2025

Publikationsreihe

NameProceedings - 2025 Annual Computer Security Applications Conference Workshops, ACSACW 2025

Konferenz

Konferenz2025 Annual Computer Security Applications Conference Workshops, ACSACW 2025
Land/GebietUSA/Vereinigte Staaten
OrtHonolulu
Zeitraum8/12/2512/12/25

Fingerprint

Untersuchen Sie die Forschungsthemen von „On the Potential of LLMs for Offensive Security: Benchmarks vs. Operational Reality“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren