This repo contains the codes for the experiments of the paper "AutoPenBench: Benchmarking Generative Agents for Penetration Testing".
☆14Oct 28, 2025Updated 5 months ago
Alternatives and similar repositories for genai-pentest-paper
Users that are interested in genai-pentest-paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚡ FutureGPT - Application development framework that connects GPT-4 with external data, the internet, other applications and language mod…☆12May 14, 2023Updated 2 years ago
- Code for the paper "AICrypto: A Comprehensive Benchmark for Evaluating Cryptography Capabilities of Large Language Models"☆29Sep 27, 2025Updated 6 months ago
- The goal of this repo is to become a benchmark for pentesting☆22Oct 25, 2024Updated last year
- PyC (Pytorch Concepts) is a PyTorch-based library for training concept-based interpretable deep learning models.☆31Mar 21, 2026Updated last week
- A Security Analysis of Honeywords☆16Nov 28, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- MCP wrapper for Hashcat – automate hash cracking with natural language☆25Jun 5, 2025Updated 9 months ago
- ☆32Mar 12, 2025Updated last year
- This repo contains the codes of the penetration test benchmark for Generative Agents presented in the paper "AutoPenBench: Benchmarking G…☆71Oct 28, 2025Updated 5 months ago
- Automated discovery and exploitation of security vulnerabilities using natural language and LLMs.☆20Feb 27, 2026Updated last month
- This project is a deliberately vulnerable environment to learn about LLM-specific risks based on the OWASP Top 10 for LLM Applications.☆52Jan 19, 2026Updated 2 months ago
- ☆26Sep 25, 2024Updated last year
- Our memories are right here.☆21Jan 19, 2022Updated 4 years ago
- This is a working copy of the OWASP Project Handbook and is the draft where changes are made before publishing a final version on the OWA…☆19Feb 22, 2017Updated 9 years ago
- A bunch of LLaMa model investigations, including recreating generative agents (from the paper Generative Agents: Interactive Simulacra of…☆23May 31, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- In-the-wild deepfake detection dataset☆13Mar 5, 2025Updated last year
- A framework for password-strength evaluation☆14Sep 26, 2020Updated 5 years ago
- [NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents☆69Nov 14, 2025Updated 4 months ago
- zentao Getshell☆10Oct 27, 2020Updated 5 years ago
- Point of Concept: To help to automate the collection of evidence for SOC 2 Audits and etc.☆11May 13, 2024Updated last year
- Modeling Password Guessability Using Markov Models☆58Jul 11, 2019Updated 6 years ago
- OWASP Web Security Testing Guide RAG system with ChromaDB, MCP for Claude Code☆20Dec 11, 2025Updated 3 months ago
- The project serves as a strategic advisory tool, capitalizing on the ZySec series of AI models to amplify the capabilities of security pr…☆68May 19, 2024Updated last year
- ☆15Jan 16, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Reducing Bias in Modeling Real-world Password Strength via Deep Learning and Dynamic Dictionaries☆21May 7, 2024Updated last year
- Enterprise AI Security Platform - Real-time firewall protection for LLM applications against prompt injection, data leakage, and function…☆23Sep 14, 2025Updated 6 months ago
- A collection of resources that showcase the intersection of simulation and LLM-agents.☆46Feb 1, 2026Updated last month
- This Repository holds the pcap and Snort rules used for generating the dataset used in my paper: "Deterministic Dendritic Cell Algorithm …☆20Jun 30, 2019Updated 6 years ago
- AI "Mafia" is coming! As night falls,9 ChatGPT AI players each harbor their own sinister motives. Let's see who will have the last laugh.…☆37Jun 19, 2023Updated 2 years ago
- Symbolic execution engine for Whitespace.☆13May 30, 2021Updated 4 years ago
- Treat XPath expressions as Python objects☆11Mar 31, 2021Updated 4 years ago
- ☆14Aug 28, 2023Updated 2 years ago
- A Python-based tool designed to capture IP addresses and NTLM authentication hashes from remote Windows clients using Telegram (lack of b…☆27Mar 31, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ECStore Pro - Laravel 微信网店微服务框架☆15Oct 11, 2017Updated 8 years ago
- ☆15Mar 3, 2026Updated 3 weeks ago
- All about llm-agents security,attack,vulnerabilities and how to do them for cybersecurity.☆48Dec 28, 2025Updated 3 months ago
- A security-first MCP server empowering AI agents to orchestrate Ghidra, Radare2, and YARA for automated reverse engineering.☆51Mar 13, 2026Updated 2 weeks ago
- ☆27Apr 3, 2025Updated 11 months ago
- [42-b3yond-6ug] This repository hosts BugBuster, our team’s submission to the AI Cyber Challenge Final Competition.☆30Aug 19, 2025Updated 7 months ago
- CVE-2020-28243 Local Privledge Escalation Exploit in SaltStack Minion☆18Mar 3, 2021Updated 5 years ago