uiuc-kang-lab / cve-benchLinks

CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities

☆69

Alternatives and similar repositories for cve-bench

Users that are interested in cve-bench are comparing it to the libraries listed below

Sorting:

PurCL / RepoAudit
An autonomous LLM-agent for large-scale, repository-level code auditing
☆186Updated 2 weeks ago
NYU-LLM-CTF / nyuctf_agents
The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench
☆89Updated last week
NYU-LLM-CTF / NYU_CTF_Bench
☆63Updated 2 months ago
sunblaze-ucb / cybergym
CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…
☆49Updated last week
NUS-Curiosity / VulZoo
VulZoo: A Comprehensive Vulnerability Intelligence Dataset | ASE 2024 Demo
☆57Updated 4 months ago
ShenaoW / awesome-llm-supply-chain-security
A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)
☆81Updated 6 months ago
KHenryAegis / VulnBot
The repository of VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework.
☆86Updated 3 months ago
pixelindigo / yurascanner
YuraScanner
☆47Updated 5 months ago
iris-sast / iris
A neurosymbolic framework for vulnerability detection in code
☆188Updated this week
llm-platform-security / SecGPT
An Execution Isolation Architecture for LLM-Based Agentic Systems
☆86Updated 6 months ago
NASP-THU / ProphetFuzz
[CCS'24] An LLM-based, fully automated fuzzing tool for option combination testing.
☆85Updated 3 months ago
ai4cloudops / SecLLMHolmes
SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…
☆57Updated 3 months ago
mnns / LLMFuzzer
🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed …
☆303Updated last year
testable-eu / sast-testability-patterns
Testability Pattern Catalogs for SAST
☆31Updated 5 months ago
LLMSecurity / HouYi
The automated prompt injection framework for LLM-integrated applications.
☆220Updated 10 months ago
pasquini-dario / LLMmap
☆49Updated last week
tuhh-softsec / LLMSecEval
☆47Updated 10 months ago
timothee-chauvin / eyeballvul
future-proof vulnerability detection benchmark, based on CVEs in open-source repos
☆59Updated this week
xbow-engineering / validation-benchmarks
XBOW Validation Benchmarks
☆200Updated last month
PurCL / LLMSCAN
Parsing-based Analyzer
☆44Updated last month
invariantlabs-ai / mcp-injection-experiments
Code snippets to reproduce MCP tool poisoning attacks.
☆164Updated 3 months ago
awsm-research / AIBugHunter
AIBugHunter: A Practical Tool for Predicting, Classifying and Repairing Software Vulnerabilities
☆43Updated last year
GitHubSecurityLab / codeql-zero-to-hero
CodeQL zero to hero blog post series challenges
☆131Updated last month
Song-Li / ODGen
ODGen is a JavaScript Static Analysis tool to detect multiple types of vulnerabilities in Node.js packages.
☆154Updated last year
OwenSanzas / LLM-For-Software-Security
Hey folks, this is a repository for papers on LLM for Vuln. Detection area
☆55Updated 4 months ago
huhusmang / Awesome-LLMs-for-Vulnerability-Detection
Awesome Large Language Models for Vulnerability Detection
☆207Updated this week
SoheilKhodayari / JAW
JAW: A Graph-based Security Analysis Framework for Client-side JavaScript
☆111Updated 7 months ago
s3c2 / UntrustIDE
A framework for identifying vulnerabilities in VS Code extensions
☆18Updated last year
KDEGroup / LLMVulnerabilityDetection
Resources for our ICSE'24 poster: Prompt-Enhanced Software Vulnerability Detection Using ChatGPT.
☆24Updated last year
Icyrockton / MegaVul
MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset
☆111Updated 6 months ago