NickNameInvalid / LLM_CTFLinks
☆66Updated 4 months ago
Alternatives and similar repositories for LLM_CTF
Users that are interested in LLM_CTF are comparing it to the libraries listed below
Sorting:
- https://arxiv.org/abs/2412.02776☆67Updated last year
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆63Updated last week
- Tree of Attacks (TAP) Jailbreaking Implementation☆117Updated 2 years ago
- General research for Dreadnode☆27Updated last year
- using ML models for red teaming☆45Updated 2 years ago
- The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench☆122Updated 3 months ago
- ☆117Updated 4 months ago
- CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…☆108Updated 3 weeks ago
- ☆131Updated 5 months ago
- ☆187Updated last month
- A library to produce cybersecurity exploitation routes (exploit flows). Inspired by TensorFlow.☆37Updated 2 years ago
- Arxiv + Notion Sync☆20Updated 8 months ago
- Automatically fuzz Rust projects from scratch☆59Updated 7 months ago
- Example agents for the Dreadnode platform☆22Updated last month
- Data Scientists Go To Jupyter☆68Updated 11 months ago
- A command line tool for extracting machine learning ready data from software binaries powered by Radare2☆73Updated 9 months ago
- A very simple open source implementation of Google's Project Naptime☆184Updated 10 months ago
- ☆29Updated 2 years ago
- A comprehensive local Linux Privilege-Escalation Benchmark☆46Updated 3 months ago
- ☆25Updated 2 years ago
- A utility to inspect, validate, sign and verify machine learning model files.☆65Updated last year
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆112Updated last year
- ☆190Updated last month
- A collection of prompt injection mitigation techniques.☆27Updated 2 years ago
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆146Updated 3 weeks ago
- Cyber-Zero: Training Cybersecurity Agents Without Runtime☆69Updated last week
- AutoCorpus is a tool backed by a large language model (LLM) for automatically generating corpus files for fuzzing.☆74Updated last year
- Payloads for Attacking Large Language Models☆119Updated 3 weeks ago
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10Updated 3 years ago
- ChainReactor is a research project that leverages AI planning to discover exploitation chains for privilege escalation on Unix systems. T…☆58Updated last year