PatchEval: A New Benchmark for Evaluating LLMs on Patching Real-World Vulnerabilities
☆211Mar 11, 2026Updated 3 months ago
Alternatives and similar repositories for PatchEval
Users that are interested in PatchEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FLOWMATRIX: GPU-Assisted Information-Flow Analysis through Matrix-Based Representation, USENIX Security'22☆28Apr 17, 2023Updated 3 years ago
- An standalone execution trace library built on DynamoRIO.☆23Jul 4, 2022Updated 3 years ago
- Source code of AsiaCCS'22 paper - RecIPE: Revisiting the Evaluation of Memory Error Defenses☆14Sep 19, 2023Updated 2 years ago
- Bilingual Resume Template in Latex. 中英双语Latex简历模板☆20Jun 4, 2024Updated 2 years ago
- SecCodeBench is a benchmark suite focusing on evaluating the security of code generated by large language models (LLMs).☆125Jun 10, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A model-based API Fuzzer for SMT Solvers.☆16May 20, 2026Updated last month
- xAST评价体系,让安全工具不再“黑盒”. The xAST evaluation benchmark makes security tools no longer a "black box".☆481May 21, 2026Updated last month
- GAINS: Getting stArted wIth biNary analysiS☆32Feb 23, 2022Updated 4 years ago
- ☆13Oct 14, 2025Updated 8 months ago
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…☆65May 4, 2025Updated last year
- ☆15Mar 17, 2025Updated last year
- Simultaneous evaluation on both functionality and security of LLM-generated code.☆41Jun 18, 2026Updated last week
- JAVA指针分析☆13Jul 23, 2019Updated 6 years ago
- 四川大学抢课系统 (已整合进入Draven-System)☆15Mar 4, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 华中科技大学网络安全课程设计-Linux下的状态检测防火墙☆11Oct 17, 2022Updated 3 years ago
- [2023 TDSC] Pre-trained Model-based Automated Software Vulnerability Repair: How Far are We?☆25Jun 2, 2023Updated 3 years ago
- ☆12Sep 27, 2018Updated 7 years ago
- 腾讯云黑客松 - 智能 渗透挑战赛 第一届Top9☆505Apr 25, 2026Updated 2 months ago
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆341Dec 18, 2025Updated 6 months ago
- 绕过360、火绒等安全设备拦截添加用户☆15Feb 15, 2022Updated 4 years ago
- A V8 Sandbox Escape Technique.☆21Feb 8, 2025Updated last year
- To detect logic bugs in graph database engines by mutating graph query patterns. ICSE'24.☆36Jan 24, 2024Updated 2 years ago
- Code for the "Predictive Context-sensitive Fuzzing" NDSS'24 paper☆30Feb 29, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- AIxCC: automated vulnerability repair via LLMs, search, and static analysis☆13Jul 16, 2024Updated last year
- KernJC: Automated Vulnerable Environment Generation for Linux Kernel Vulnerabilities | 🏆 Best Practical Paper Award of RAID 2024☆88Oct 15, 2025Updated 8 months ago
- Holistic Concolic Execution for Dynamic Web Applications via Symbolic Interpreter Analysis (IEEE S&P 2024)☆17Oct 3, 2024Updated last year
- 打造最强的AI安全文档☆121Mar 14, 2026Updated 3 months ago
- ☆12Feb 20, 2021Updated 5 years ago
- ☆99Jun 15, 2026Updated last week
- MCPCorpus is a comprehensive dataset for analyzing the Model Context Protocol (MCP) ecosystem, containing ~14K MCP servers and 300 MCP cl…☆34Sep 1, 2025Updated 9 months ago
- A neurosymbolic framework for vulnerability detection in code☆399Apr 19, 2026Updated 2 months ago
- Slime是一个组合众多优秀安全工具的漏扫软件,它将目光集中在安全工具的组合上,而不是自己实现漏扫的某一流程。☆17Sep 9, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LLM-Powered Code Security Scanning☆22Apr 2, 2025Updated last year
- Leveraging revolutionary Agent and Phi-2 technology, Graph Detective uncovers concealed linkages and discerns patterns, enabling pinpoint…☆10Apr 21, 2024Updated 2 years ago
- ☆14Sep 4, 2025Updated 9 months ago
- A portable framework to map DFG (dataflow graph, representing an application) on spatial accelerators.☆41Oct 31, 2022Updated 3 years ago
- Bypass for the hardening against usage of tagWnd as a kernel read/write primitive☆32Mar 22, 2017Updated 9 years ago
- [USENIX Security '25] My ZIP isn’t your ZIP: Identifying and Exploiting Semantic Gaps Between ZIP Parsers☆39Mar 20, 2026Updated 3 months ago
- Implementation of QFuzz.☆17Dec 3, 2021Updated 4 years ago