tuhh-softsec / LLMSecEvalLinks
☆55Updated last year
Alternatives and similar repositories for LLMSecEval
Users that are interested in LLMSecEval are comparing it to the libraries listed below
Sorting:
- Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" publis…☆82Updated 2 years ago
- ☆126Updated last year
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…☆63Updated 8 months ago
- VulRepair: A T5-Based Automated Software Vulnerability Repair☆83Updated 8 months ago
- [USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…☆56Updated 9 months ago
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)☆91Updated 11 months ago
- An implementation of the ACL 2024 Findings paper "Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tu…☆74Updated 2 months ago
- DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection (RAID 2023) https://surrealyz.github.io/…☆173Updated last year
- Repository for PrimeVul Vulnerability Detection Dataset☆208Updated last year
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆136Updated this week
- 🔥🔥🔥 Detecting hidden backdoors in Large Language Models with only black-box access☆50Updated 7 months ago
- AIBugHunter: A Practical Tool for Predicting, Classifying and Repairing Software Vulnerabilities☆44Updated last year
- A Novel Benchmark evaluating the Deep Capability of Vulnerability Detection with Large Language Models☆32Updated 8 months ago
- 🪐 A Database of Existing Security Vulnerabilities Patches to Enable Evaluation of Techniques (single-commit; multi-language)☆42Updated 9 months ago
- The automated prompt injection framework for LLM-integrated applications.☆247Updated last year
- CVEfixes: Automated Collection of Vulnerabilities and Their Fixes from Open-Source Software☆311Updated last year
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆49Updated last month
- An autonomous LLM-agent for large-scale, repository-level code auditing☆314Updated last month
- ☠️ Ground-truth dataset for vulnerability prediction (known research datasets and data sources included such as NVD, CVE Details and OSV)…☆102Updated 2 years ago
- ☆50Updated last year
- Vul4J: A Dataset of Reproducible Java Vulnerabilities☆115Updated 4 months ago
- Resources for our ICSE'24 poster: Prompt-Enhanced Software Vulnerability Detection Using ChatGPT.☆25Updated last year
- ☆12Updated 3 weeks ago
- The official repository of "GraphSPD: Graph-Based Security Patch Detection with Enriched Code Semantics". The paper will appear in the IE…☆48Updated 2 years ago
- Agent Security Bench (ASB)☆168Updated 2 months ago
- ☆42Updated last year
- open science repo of "Neural Transfer Learning for Repairing Security Vulnerabilities in C Code" https://arxiv.org/pdf/2104.08308☆63Updated last year
- CS-Eval is a comprehensive evaluation suite for fundamental cybersecurity models or large language models' cybersecurity ability.☆58Updated last year
- For our ISSTA23 paper "How Effective are Neural Networks for Fixing Security Vulnerabilities?" by Yi Wu, Nan Jiang, Hung Viet Pham, Thiba…☆41Updated 2 years ago
- [NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents☆63Updated 2 months ago