☆127Sep 22, 2025Updated 6 months ago
Alternatives and similar repositories for NYU_CTF_Bench
Users that are interested in NYU_CTF_Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench☆139Oct 25, 2025Updated 5 months ago
- ☆66Sep 13, 2025Updated 6 months ago
- https://arxiv.org/abs/2412.02776☆70Dec 5, 2024Updated last year
- This repo contains the codes of the penetration test benchmark for Generative Agents presented in the paper "AutoPenBench: Benchmarking G…☆71Oct 28, 2025Updated 4 months ago
- A comprehensive local Linux Privilege-Escalation Benchmark☆46Nov 7, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- I am still working on it☆12Apr 30, 2020Updated 5 years ago
- ☆15Sep 17, 2024Updated last year
- The repository of Pentest-R1: Towards Autonomous Penetration Testing Reasoning Optimized via Two-Stage Reinforcement Learning.☆30Sep 8, 2025Updated 6 months ago
- Exploit code by DirtyChain☆14Apr 11, 2025Updated 11 months ago
- Machine learning on knowledge graphs for context-aware security monitoring (data and model)☆18Mar 11, 2022Updated 4 years ago
- This repository is used to provide a reference for CTF dynamic target machine☆14Mar 11, 2023Updated 3 years ago
- The goal of this repo is to become a benchmark for pentesting☆22Oct 25, 2024Updated last year
- [IEEE T-IFS] AutoPT: How Far Are We from the Fully Automated Web Penetration Testing?☆32Aug 18, 2025Updated 7 months ago
- ☆211Dec 13, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆179Jan 14, 2026Updated 2 months ago
- Official Implementation of implicit reference attack☆11Oct 16, 2024Updated last year
- CyberBench: A Multi-Task Cyber LLM Benchmark☆31Apr 29, 2025Updated 10 months ago
- [USENIX Security 2024] Official Repository of 'KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-…☆16Aug 6, 2025Updated 7 months ago
- LLM Agent and Evaluation Framework for Autonomous Penetration Testing☆300Jun 24, 2025Updated 9 months ago
- A fast and powerful gadget finder and ROP chain generator. A research prototype for the ropbot paper accepted at NDSS'26.☆48Jan 22, 2026Updated 2 months ago