NYU-LLM-CTF / nyuctf_agents
The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench
☆48Updated last week
Alternatives and similar repositories for nyuctf_agents:
Users that are interested in nyuctf_agents are comparing it to the libraries listed below
- ☆34Updated last week
- ☆64Updated 3 weeks ago
- ☆75Updated 2 months ago
- A comprehensive local Linux Privilege-Escalation Benchmark☆28Updated 2 months ago
- ☆26Updated last year
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆46Updated this week
- An Execution Isolation Architecture for LLM-Based Agentic Systems☆62Updated 2 weeks ago
- [CCS'24] An LLM-based, fully automated fuzzing tool for option combination testing.☆63Updated last month
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)☆32Updated 3 weeks ago
- VulZoo: A Comprehensive Vulnerability Intelligence Dataset (ASE 2024 Demo)☆30Updated 3 months ago
- Challenge Problem #1 - Linux Kernel (NOTE: This code does not reflect the active state of what will be used at competition time, please r…☆52Updated 10 months ago
- A library to produce cybersecurity exploitation routes (exploit flows). Inspired by TensorFlow.☆33Updated last year
- https://arxiv.org/abs/2412.02776☆47Updated 2 months ago
- A collection of prompt injection mitigation techniques.☆20Updated last year
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆108Updated 11 months ago
- 🪐 A Database of Existing Security Vulnerabilities Patches to Enable Evaluation of Techniques (single-commit; multi-language)☆37Updated 2 years ago
- ☆28Updated 5 months ago
- The source code (including datasets) of V1SCAN (USENIX Security 2023; will be uploaded).☆41Updated last year
- ☆51Updated 7 months ago
- AutoCorpus is a tool backed by a large language model (LLM) for automatically generating corpus files for fuzzing.☆54Updated 9 months ago
- ☆34Updated 2 months ago
- 🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.☆22Updated 9 months ago
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…☆46Updated 3 months ago
- A command line tool for extracting machine learning ready data from software binaries powered by Radare2☆61Updated 2 months ago
- ICSE'23 - CoFuzz: Coordinated hybrid fuzzing framework with advanced coordination mode☆45Updated last year
- ☆24Updated 4 months ago
- This is a dataset intended to train a LLM model for a completely CVE focused input and output.☆49Updated 2 months ago
- A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.☆58Updated 10 months ago
- Security Vulnerability Repair via Concolic Execution and Code Mutations☆18Updated 5 months ago
- ☆34Updated 4 months ago