CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.
☆220Apr 13, 2026Updated last week
Alternatives and similar repositories for cybergym
Users that are interested in cybergym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Sep 11, 2025Updated 7 months ago
- ☆11May 14, 2024Updated last year
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆42Jul 21, 2025Updated 8 months ago
- Anonymous repo for USCHunt, a tool for detecting and classifying upgradeable proxy smart contracts, built atop Slither☆22Apr 2, 2023Updated 3 years ago
- Training Language Model Agents to Find Vulnerabilities with CTF-Dojo☆42Jan 10, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25Sep 3, 2025Updated 7 months ago
- Parsing-based Analyzer☆75Jun 8, 2025Updated 10 months ago
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…☆64May 4, 2025Updated 11 months ago
- A Unified Platform for Evaluating SAST Tools for Android☆20Mar 30, 2025Updated last year
- Security Vulnerability Repair via Concolic Execution and Code Mutations☆19Sep 12, 2024Updated last year
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆192Jan 14, 2026Updated 3 months ago
- Simultaneous evaluation on both functionality and security of LLM-generated code.☆34Mar 6, 2026Updated last month
- Security Harness Engineering for Robust Program Analysis☆126Jan 23, 2026Updated 2 months ago
- ☆29Jan 15, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICSE'24 Industry Challenge Track] "ReposVul: A Repository-Level High-Quality Vulnerability Dataset".☆100Nov 24, 2024Updated last year
- A manually vetted dataset for security vulnerability detection in Java projects☆94Aug 12, 2025Updated 8 months ago
- ☆128Jul 14, 2024Updated last year
- List of Papers on Attack and Defense (AD) in AI Models☆27Mar 18, 2022Updated 4 years ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 7 months ago
- 🥇 Amazon Nova AI Challenge Winner - ASTRA emerged victorious as the top attacking team in Amazon's global AI safety competition, defeati…☆69Aug 14, 2025Updated 8 months ago
- A SAST skill that gives AI coding agents structured vulnerability detection across 34 vulnerability classes.☆215Apr 7, 2026Updated last week
- tool of llm-based indirect-call analyzer☆31Feb 18, 2025Updated last year
- Cyber-Zero: Training Cybersecurity Agents Without Runtime☆83Feb 13, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Resources for our ICSE'24 poster: Prompt-Enhanced Software Vulnerability Detection Using ChatGPT.☆25May 8, 2024Updated last year
- ☆25May 28, 2025Updated 10 months ago
- A Reproducible Benchmark of Recent Java Bugs☆47Aug 19, 2025Updated 8 months ago
- An example vulnerable app that integrates an LLM☆26Apr 5, 2024Updated 2 years ago
- [CCS'24] An LLM-based, fully automated fuzzing tool for option combination testing.☆102Feb 10, 2026Updated 2 months ago
- XNU Image Fuzzer - iOS App for Fuzzing Images with Objective-C Code covering 15 CGCreateBitmap & CGColorSpace Functions working with Raw …☆40Apr 13, 2026Updated last week
- [NDSS 2025] "CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models"☆26Aug 20, 2025Updated 8 months ago
- MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset☆145Jan 12, 2025Updated last year
- ☆51Jan 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Security scanner for AI agents, MCP servers and agent skills.☆2,112Apr 13, 2026Updated last week
- Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" publis…☆88Nov 4, 2023Updated 2 years ago
- ☆41Jan 13, 2023Updated 3 years ago
- ☆29Aug 31, 2025Updated 7 months ago
- docker env for ios research on a mac host☆27Jun 12, 2025Updated 10 months ago
- [NeurIPS 2024] Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling☆35Nov 8, 2024Updated last year
- List of awesome starred repositories☆16Updated this week