CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.
☆187Feb 23, 2026Updated last month
Alternatives and similar repositories for cybergym
Users that are interested in cybergym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Sep 11, 2025Updated 6 months ago
- ☆94Mar 6, 2026Updated 3 weeks ago
- ☆11May 14, 2024Updated last year
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆43Jul 21, 2025Updated 8 months ago
- Anonymous repo for USCHunt, a tool for detecting and classifying upgradeable proxy smart contracts, built atop Slither☆22Apr 2, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆25Sep 3, 2025Updated 6 months ago
- Parsing-based Analyzer☆75Jun 8, 2025Updated 9 months ago
- A Unified Platform for Evaluating SAST Tools for Android☆20Mar 30, 2025Updated last year
- Security Vulnerability Repair via Concolic Execution and Code Mutations☆19Sep 12, 2024Updated last year
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆183Jan 14, 2026Updated 2 months ago
- Security Harness Engineering for Robust Program Analysis☆121Jan 23, 2026Updated 2 months ago
- ☆24Jan 15, 2026Updated 2 months ago
- [ICSE'24 Industry Challenge Track] "ReposVul: A Repository-Level High-Quality Vulnerability Dataset".☆95Nov 24, 2024Updated last year
- A manually vetted dataset for security vulnerability detection in Java projects☆94Aug 12, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆128Jul 14, 2024Updated last year
- List of Papers on Attack and Defense (AD) in AI Models☆27Mar 18, 2022Updated 4 years ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 7 months ago
- How effective are LLMs in identifying and exploiting security vulnerabilities?☆69Feb 28, 2025Updated last year
- 🥇 Amazon Nova AI Challenge Winner - ASTRA emerged victorious as the top attacking team in Amazon's global AI safety competition, defeati…☆70Aug 14, 2025Updated 7 months ago
- tool of llm-based indirect-call analyzer☆30Feb 18, 2025Updated last year
- Cyber-Zero: Training Cybersecurity Agents Without Runtime☆79Feb 13, 2026Updated last month
- Resources for our ICSE'24 poster: Prompt-Enhanced Software Vulnerability Detection Using ChatGPT.☆25May 8, 2024Updated last year
- A Reproducible Benchmark of Recent Java Bugs☆47Aug 19, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An example vulnerable app that integrates an LLM☆26Apr 5, 2024Updated last year
- [CCS'24] An LLM-based, fully automated fuzzing tool for option combination testing.☆102Feb 10, 2026Updated last month
- XNU Image Fuzzer - iOS App for Fuzzing Images with Objective-C Code covering 15 CGCreateBitmap & CGColorSpace Functions working with Raw …☆40Mar 24, 2026Updated last week
- [NDSS 2025] "CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models"☆26Aug 20, 2025Updated 7 months ago
- MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset☆142Jan 12, 2025Updated last year
- ☆49Jan 14, 2025Updated last year
- Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" publis…☆86Nov 4, 2023Updated 2 years ago
- Multi-vault, user-configured cloud hosted password manager☆15Jun 22, 2025Updated 9 months ago
- ☆41Jan 13, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆29Aug 31, 2025Updated 7 months ago
- Vul4J: A Dataset of Reproducible Java Vulnerabilities☆125Updated this week
- docker env for ios research on a mac host☆28Jun 12, 2025Updated 9 months ago
- List of awesome starred repositories☆15Updated this week
- ☆27May 27, 2023Updated 2 years ago
- The official repository of "GraphSPD: Graph-Based Security Patch Detection with Enriched Code Semantics". The paper will appear in the IE…☆49Aug 9, 2023Updated 2 years ago
- Damn Vulnerable Browser Extension (DVBE), previously named as Badly Coded Browser Extension (BCBE), is an open-source vulnerable Chrome E…☆33Mar 4, 2025Updated last year