uiuc-kang-lab / cve-bench
CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities
☆33Updated last week
Alternatives and similar repositories for cve-bench:
Users that are interested in cve-bench are comparing it to the libraries listed below
- ☆48Updated last month
- The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench☆65Updated 2 weeks ago
- YuraScanner☆40Updated 2 months ago
- An Execution Isolation Architecture for LLM-Based Agentic Systems☆70Updated 2 months ago
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)☆66Updated 3 months ago
- [CCS'24] An LLM-based, fully automated fuzzing tool for option combination testing.☆74Updated last week
- ☆29Updated 7 months ago
- 🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.☆23Updated 11 months ago
- A framework for identifying vulnerabilities in VS Code extensions☆17Updated 9 months ago
- Hey folks, this is a repository for papers on LLM for Vuln. Detection area☆41Updated 3 weeks ago
- ☆26Updated last year
- Code snippets to reproduce MCP tool poisoning attacks.☆93Updated 2 weeks ago
- [USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…☆44Updated last month
- This is The most comprehensive prompt hacking course available, which record our progress on a prompt engineering and prompt hacking cour…☆51Updated last week
- Testability Pattern Catalogs for SAST☆30Updated 2 months ago
- ☆59Updated 9 months ago
- VulZoo: A Comprehensive Vulnerability Intelligence Dataset (ASE 2024 Demo)☆41Updated last month
- EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability Detection☆14Updated 3 months ago
- ☆33Updated 6 months ago
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…☆55Updated 5 months ago
- A neurosymbolic framework for vulnerability detection in code☆49Updated this week
- Artifact for ICSE 2023☆49Updated 2 years ago
- Agent Security Bench (ASB)☆76Updated 3 weeks ago
- This repo contains the codes of the penetration test benchmark for Generative Agents presented in the paper "AutoPenBench: Benchmarking G…☆26Updated 6 months ago
- The automated prompt injection framework for LLM-integrated applications.☆198Updated 7 months ago
- ☆64Updated 3 months ago
- AIBugHunter: A Practical Tool for Predicting, Classifying and Repairing Software Vulnerabilities☆40Updated last year
- ☆38Updated 6 months ago
- A library to produce cybersecurity exploitation routes (exploit flows). Inspired by TensorFlow.☆35Updated last year
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆52Updated last week