lucagioacchini / auto-pen-benchView external linksLinks
This repo contains the codes of the penetration test benchmark for Generative Agents presented in the paper "AutoPenBench: Benchmarking Generative Agents for Penetration Testing". It contains also the instructions to install, develop and test new vulnerable containers to include in the benchmark.
☆64Oct 28, 2025Updated 3 months ago
Alternatives and similar repositories for auto-pen-bench
Users that are interested in auto-pen-bench are comparing it to the libraries listed below
Sorting:
- The goal of this repo is to become a benchmark for pentesting☆19Oct 25, 2024Updated last year
- [IEEE T-IFS] AutoPT: How Far Are We from the Fully Automated Web Penetration Testing?☆30Aug 18, 2025Updated 5 months ago
- The Pentest Agent System is an autonomous penetration testing framework built on the MITRE ATT&CK framework.☆30Apr 16, 2025Updated 9 months ago
- Advancing TTP Analysis: Harnessing the Power of Large Language Models with Retrieval Augmented Generation☆11May 14, 2024Updated last year
- Notes for HungYi Lee - The Next Step for Machine Learning☆11Apr 21, 2022Updated 3 years ago
- ☆118Sep 22, 2025Updated 4 months ago
- Deep RL agents for NASimEmu. See also https://github.com/jaromiru/NASimEmu.☆15Jul 16, 2024Updated last year
- The official repository of the paper "The Digital Cybersecurity Expert: How Far Have We Come?" presented in IEEE S&P 2025☆24May 21, 2025Updated 8 months ago
- PentestAgent is a novel LLM-driven penetration testing framework to automate intelligence gathering, vulnerability analysis, and exploita…☆112Dec 20, 2025Updated last month
- YuraScanner☆73Feb 13, 2025Updated last year
- PenGym: Pentesting Training Framework for Reinforcement Learning Agents☆54Dec 19, 2024Updated last year
- ☆193Dec 13, 2025Updated 2 months ago
- An overview of LLMs for cybersecurity.☆1,212Updated this week
- https://arxiv.org/abs/2412.02776☆67Dec 5, 2024Updated last year
- datacon比赛2024年漏洞分析赛道解题框架与运行镜像压缩包☆183Jun 10, 2025Updated 8 months ago
- Enterprise AI Security Platform - Real-time firewall protection for LLM applications against prompt injection, data leakage, and function…☆23Sep 14, 2025Updated 5 months ago
- ☆27Feb 19, 2024Updated last year
- This is the official repository for the ICLR 2025 accepted paper Badrobot: Manipulating Embodied LLMs in the Physical World.☆41Jun 26, 2025Updated 7 months ago
- Cybersecurity Intelligent Pentesting Helper for Ethical Researcher (CIPHER). Fine tuned LLM for penetration testing guidance based on wri…☆35Dec 24, 2024Updated last year
- Mysql4 on Docker☆11May 20, 2016Updated 9 years ago
- LLM Agent and Evaluation Framework for Autonomous Penetration Testing☆292Jun 24, 2025Updated 7 months ago
- ☆17Jan 22, 2026Updated 3 weeks ago
- Imports events from remotely-located iCalendar files into The Events Calendar plugin for WordPress.☆10Jun 26, 2025Updated 7 months ago
- ☆47May 27, 2023Updated 2 years ago
- Witcher is the first framework for using AFL to fuzz web applications.☆104Nov 28, 2023Updated 2 years ago
- ☆12Nov 16, 2020Updated 5 years ago
- The Tifinagh Hand-written Letters Dataset☆12Feb 17, 2024Updated last year
- ☆10Sep 6, 2024Updated last year
- OpenPGP in Python using Sequoia PGP☆17Oct 9, 2025Updated 4 months ago
- 常用CTF爆破字典整理☆14Apr 8, 2023Updated 2 years ago
- A penetration testing tool to help in Infrastructure pentesting process.☆11Sep 19, 2023Updated 2 years ago
- Curated list of Moroccans publishing in the most prestigious AI conferences☆10Oct 14, 2024Updated last year
- ☆10Feb 6, 2018Updated 8 years ago
- Gym-based environment for training offensive RL agents. Agents can generalize to unseen scenarios and simulation-trained agents can be de…☆42Sep 20, 2024Updated last year
- The repository of Pentest-R1: Towards Autonomous Penetration Testing Reasoning Optimized via Two-Stage Reinforcement Learning.☆26Sep 8, 2025Updated 5 months ago
- Automatically generate tests for your website by using LLM models☆17Aug 7, 2023Updated 2 years ago
- Code and artifacts related to the Asia CCS 2022 paper☆38Nov 8, 2021Updated 4 years ago
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆146Jan 14, 2026Updated last month
- A library of prompts, intended for LibreChat. Note: Archived☆11Aug 14, 2023Updated 2 years ago