CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities
☆167Jan 14, 2026Updated last month
Alternatives and similar repositories for cve-bench
Users that are interested in cve-bench are comparing it to the libraries listed below
Sorting:
- PentestAgent is a novel LLM-driven penetration testing framework to automate intelligence gathering, vulnerability analysis, and exploita…☆117Dec 20, 2025Updated 2 months ago
- ☆12Nov 30, 2018Updated 7 years ago
- Modelizer - is a framework for learning models from BlackBox systems using Input-Output examples☆22Jul 17, 2025Updated 7 months ago
- Code snippets to reproduce MCP tool poisoning attacks.☆193Apr 10, 2025Updated 10 months ago
- A subset of CTF challenges I have made over the years.☆18Aug 4, 2022Updated 3 years ago
- An autonomous LLM-agent for large-scale, repository-level code auditing☆345Dec 4, 2025Updated 3 months ago
- The repository of Pentest-R1: Towards Autonomous Penetration Testing Reasoning Optimized via Two-Stage Reinforcement Learning.☆29Sep 8, 2025Updated 6 months ago
- A continuously updated collection of CodeLLM papers maintained by PurCL group @ Purdue☆606Jan 14, 2026Updated last month
- ☆65Dec 8, 2025Updated 3 months ago
- The code implementation of MuScleLoRA (Accepted in ACL 2024)☆10Dec 1, 2024Updated last year
- This repo contains the codes for the experiments of the paper "AutoPenBench: Benchmarking Generative Agents for Penetration Testing".☆13Oct 28, 2025Updated 4 months ago
- Caputre the flag with Large Language Models☆28Aug 5, 2025Updated 7 months ago
- Fuzzing Automatic Differentiation in Deep-Learning Libraries (ICSE'23)☆27Mar 2, 2024Updated 2 years ago
- The notes about programming language theory☆27May 7, 2023Updated 2 years ago
- ☆32May 1, 2025Updated 10 months ago
- ☆52Jul 31, 2025Updated 7 months ago
- Holistic Concolic Execution for Dynamic Web Applications via Symbolic Interpreter Analysis (IEEE S&P 2024)☆13Oct 3, 2024Updated last year
- Autonomous Assumed Breach Penetration-Testing Active Directory Networks☆41Updated this week
- A polyglot static analysis engine for detecting vulnerabilities in scripting languages native extensions based on joern.☆21Sep 1, 2025Updated 6 months ago
- ☆123Sep 22, 2025Updated 5 months ago
- Artifact for ICSE 2023☆50Sep 24, 2022Updated 3 years ago
- Testability Pattern Catalogs for SAST☆32Feb 18, 2025Updated last year
- ☆36Nov 13, 2025Updated 3 months ago
- [42-b3yond-6ug] This repository hosts BugBuster, our team’s submission to the AI Cyber Challenge Final Competition.☆31Aug 19, 2025Updated 6 months ago
- CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift (FSE 2025)☆13May 19, 2025Updated 9 months ago
- Exploit POC for CVE-2024-22026 affecting Ivanti EPMM "MobileIron Core"☆15May 15, 2024Updated last year
- Effective ReDoS Detection by Principled Vulnerability Modeling and Exploit Generation☆14Jul 24, 2025Updated 7 months ago
- 按照会话解包, 然后提取明文txt信息, 让ChatGPT来判断一下是否存在攻击行为☆14Mar 7, 2023Updated 3 years ago
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆57Jan 27, 2026Updated last month
- https://arxiv.org/abs/2412.02776☆68Dec 5, 2024Updated last year
- The Z3-Noodler String Solver☆25Updated this week
- CleanVul: Automatic Function-Level Vulnerability Detection in Code Commits Using LLM Heuristics☆20Jan 23, 2026Updated last month
- Protect your PHP project from deserialization attacks! As seen on NDSS 2024☆15Aug 8, 2025Updated 7 months ago
- Proof-of-Concept to evade auditd by tampering via ptrace☆19Aug 3, 2023Updated 2 years ago
- Proof-of-Concept to evade auditd by writing /proc/PID/mem☆24Aug 21, 2023Updated 2 years ago
- The repo of "BugLens"☆35Nov 12, 2025Updated 3 months ago
- This is the replication package of V-SZZ, which has been accepted by ICSE2022☆16Jan 19, 2026Updated last month
- Automated web vulnerability scanning with LLM agents☆451Jun 18, 2025Updated 8 months ago
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…☆64May 4, 2025Updated 10 months ago