sunblaze-ucb / cybergymView external linksLinks
CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.
☆116Updated this week
Alternatives and similar repositories for cybergym
Users that are interested in cybergym are comparing it to the libraries listed below
Sorting:
- ☆11May 14, 2024Updated last year
- Source code for LLMxCPG paper☆110Dec 22, 2025Updated last month
- ☆27Sep 11, 2025Updated 5 months ago
- Security Vulnerability Repair via Concolic Execution and Code Mutations☆19Sep 12, 2024Updated last year
- XNU Image Fuzzer - iOS App for Fuzzing Images with Objective-C Code covering 12 CGCreateBitmap & CGColorSpace Functions working with Raw …☆39Feb 4, 2026Updated last week
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 5 months ago
- docker env for ios research on a mac host☆27Jun 12, 2025Updated 8 months ago
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…☆64May 4, 2025Updated 9 months ago
- An example vulnerable app that integrates an LLM☆26Apr 5, 2024Updated last year
- Parsing-based Analyzer☆70Jun 8, 2025Updated 8 months ago
- ☆22May 28, 2025Updated 8 months ago
- ☆91Oct 23, 2025Updated 3 months ago
- [ICSE'24 Industry Challenge Track] "ReposVul: A Repository-Level High-Quality Vulnerability Dataset".☆92Nov 24, 2024Updated last year
- ☆126Jul 14, 2024Updated last year
- Run OpenDevin inside Docker☆24Jul 22, 2025Updated 6 months ago
- [NDSS 2025] "CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models"☆24Aug 20, 2025Updated 5 months ago
- [CCS'24] An LLM-based, fully automated fuzzing tool for option combination testing.☆100Updated this week
- Official repo for the NCR Crypto Meetup☆17Jun 1, 2022Updated 3 years ago
- Resources for our ICSE'24 poster: Prompt-Enhanced Software Vulnerability Detection Using ChatGPT.☆25May 8, 2024Updated last year
- ☆27Apr 28, 2023Updated 2 years ago
- WTF Snapshot fuzzing of macOS targets☆99May 31, 2024Updated last year
- SDK for building SecDim Play challenges, an open training game for AppSec, DevSecOps, CloudSec, etc.☆30Aug 7, 2025Updated 6 months ago
- 🥇 Amazon Nova AI Challenge Winner - ASTRA emerged victorious as the top attacking team in Amazon's global AI safety competition, defeati…☆70Aug 14, 2025Updated 6 months ago
- ☆27Oct 6, 2024Updated last year
- List of Papers on Attack and Defense (AD) in AI Models☆26Mar 18, 2022Updated 3 years ago
- High-Efficiency eXpanded Coverage for Improved Testing of Executables☆25Jul 7, 2022Updated 3 years ago
- ☆29Apr 7, 2023Updated 2 years ago
- ☆25Feb 6, 2024Updated 2 years ago
- This tool allows local LLM usage that can automate tasks without human interventention. The agent can call itself recursively and work on…☆20May 5, 2025Updated 9 months ago
- Security Harness Engineering for Robust Program Analysis☆111Jan 23, 2026Updated 3 weeks ago
- MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset☆136Jan 12, 2025Updated last year
- A toolkit to assess data privacy in LLMs (under development)☆67Jan 2, 2025Updated last year
- HardsHeap: A Universal and Extensible Framework for Evaluating Secure Allocators☆37Jan 14, 2022Updated 4 years ago
- Public Source code Release of Theori's AIxCC AFC Submission☆231Aug 5, 2025Updated 6 months ago
- A manually vetted dataset for security vulnerability detection in Java projects☆90Aug 12, 2025Updated 6 months ago
- B-Spline Density Estimation Library - nonparametric density estimation using B-Spline density estimator from univariate sample.☆15Aug 22, 2021Updated 4 years ago
- Auditing agents for fine-tuning safety☆18Oct 21, 2025Updated 3 months ago
- iBoot-1145.3 Image3/heap stack RE (+unholy tools)☆84Feb 10, 2024Updated 2 years ago
- Plugin for loading MachO kernelcache and dSYM files to Binary Ninja☆40Mar 23, 2025Updated 10 months ago