NickNameInvalid / LLM_CTFLinks
☆65Updated 5 months ago
Alternatives and similar repositories for LLM_CTF
Users that are interested in LLM_CTF are comparing it to the libraries listed below
Sorting:
- https://arxiv.org/abs/2412.02776☆57Updated 6 months ago
- The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench☆81Updated 2 months ago
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆56Updated this week
- using ML models for red teaming☆43Updated last year
- General research for Dreadnode☆23Updated last year
- ☆55Updated last month
- Tree of Attacks (TAP) Jailbreaking Implementation☆109Updated last year
- ☆112Updated 2 weeks ago
- A library to produce cybersecurity exploitation routes (exploit flows). Inspired by TensorFlow.☆35Updated last year
- 🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.☆23Updated last year
- CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…☆21Updated 2 weeks ago
- A utility to inspect, validate, sign and verify machine learning model files.☆57Updated 4 months ago
- Data Scientists Go To Jupyter☆64Updated 3 months ago
- XBOW Validation Benchmarks☆104Updated last week
- ☆16Updated last year
- ☆41Updated 8 months ago
- ☆37Updated this week
- ChainReactor is a research project that leverages AI planning to discover exploitation chains for privilege escalation on Unix systems. T…☆48Updated 7 months ago
- A collection of prompt injection mitigation techniques.☆23Updated last year
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆73Updated last month
- ☆13Updated last year
- An interactive CLI application for interacting with authenticated Jupyter instances.☆53Updated last month
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆58Updated last week
- Common Corpus is used to build coverage-minimized corpus data sets for fuzzing.☆27Updated last year
- A comprehensive local Linux Privilege-Escalation Benchmark☆36Updated last month
- Automatically fuzz Rust projects from scratch☆56Updated last year
- ☆22Updated last year
- This repository contains the pre-joining training materials given to aspiring researchers on the Vulnerability Researcher Development Pro…☆72Updated 3 weeks ago
- A command line tool for extracting machine learning ready data from software binaries powered by Radare2☆70Updated last month
- ☆14Updated 6 months ago