NickNameInvalid / LLM_CTFLinks
☆65Updated 2 months ago
Alternatives and similar repositories for LLM_CTF
Users that are interested in LLM_CTF are comparing it to the libraries listed below
Sorting:
- https://arxiv.org/abs/2412.02776☆66Updated 11 months ago
- CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…☆90Updated last month
- Tree of Attacks (TAP) Jailbreaking Implementation☆115Updated last year
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆60Updated this week
- ☆121Updated 2 months ago
- using ML models for red teaming☆44Updated 2 years ago
- The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench☆102Updated 3 weeks ago
- ☆96Updated last month
- Data Scientists Go To Jupyter☆67Updated 8 months ago
- A utility to inspect, validate, sign and verify machine learning model files.☆60Updated 9 months ago
- Example agents for the Dreadnode platform☆19Updated 3 weeks ago
- Automatically fuzz Rust projects from scratch☆58Updated 4 months ago
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆113Updated last week
- ☆81Updated 3 months ago
- ☆168Updated 5 months ago
- Payloads for Attacking Large Language Models☆104Updated 5 months ago
- VulZoo: A Comprehensive Vulnerability Intelligence Dataset | ASE 2024 Demo☆66Updated 7 months ago
- General research for Dreadnode☆25Updated last year
- ☆28Updated 2 years ago
- SAST + LLM Interprocedural Context Extractor☆146Updated 3 weeks ago
- ChainReactor is a research project that leverages AI planning to discover exploitation chains for privilege escalation on Unix systems. T…☆52Updated last year
- A command line tool for extracting machine learning ready data from software binaries powered by Radare2☆72Updated 6 months ago
- We present MAPTA, a multi-agent system for autonomous web application security assessment that combines large language model orchestratio…☆75Updated 2 months ago
- AutoCorpus is a tool backed by a large language model (LLM) for automatically generating corpus files for fuzzing.☆72Updated last year
- Research browsers☆43Updated 2 weeks ago
- ☆25Updated last year
- ☆63Updated last week
- A very simple open source implementation of Google's Project Naptime☆173Updated 7 months ago
- Arxiv + Notion Sync☆20Updated 6 months ago
- [CCS'24] An LLM-based, fully automated fuzzing tool for option combination testing.☆91Updated 7 months ago