PatchEval: A New Benchmark for Evaluating LLMs on Patching Real-World Vulnerabilities
☆202Mar 11, 2026Updated last month
Alternatives and similar repositories for PatchEval
Users that are interested in PatchEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FLOWMATRIX: GPU-Assisted Information-Flow Analysis through Matrix-Based Representation, USENIX Security'22☆28Apr 17, 2023Updated 3 years ago
- An standalone execution trace library built on DynamoRIO.☆23Jul 4, 2022Updated 3 years ago
- Bilingual Resume Template in Latex. 中英双语Latex简历模板☆20Jun 4, 2024Updated last year
- SecCodeBench is a benchmark suite focusing on evaluating the security of code generated by large language models (LLMs).☆111Mar 13, 2026Updated last month
- xAST评价体系,让安全工具不再“黑盒”. The xAST evaluation benchmark makes security tools no longer a "black box".☆473Apr 14, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A practical fuzzing tool for SMT solvers☆11Nov 26, 2025Updated 5 months ago
- GAINS: Getting stArted wIth biNary analysiS☆32Feb 23, 2022Updated 4 years ago
- ☆28Sep 15, 2024Updated last year
- ☆13Oct 14, 2025Updated 6 months ago
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…☆65May 4, 2025Updated last year
- ☆15Mar 17, 2025Updated last year
- Learning graph-based code representations for source-level functional similarity detection. ICSE'23☆64Mar 27, 2023Updated 3 years ago
- Golang eBPF RASP☆10Jul 19, 2023Updated 2 years ago
- Simultaneous evaluation on both functionality and security of LLM-generated code.☆36Mar 6, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- IoTvulCode: AI-enabled vulnerability detection in software products designed for IoT applications☆18May 10, 2024Updated last year
- ☆21Aug 25, 2024Updated last year
- 在菜单栏中查看最近复制过的JSON并进行格式化☆12Feb 18, 2021Updated 5 years ago
- 华中科技大学网络安全课程设计-Linux下的状态检测防火墙☆11Oct 17, 2022Updated 3 years ago
- 四川大学抢课系统 (已整合进入Draven-System)☆14Mar 4, 2020Updated 6 years ago
- This is the source code for the 60min django CMS demo via https://www.django-cms.org/en/test-django-cms/☆12Mar 31, 2022Updated 4 years ago
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆333Dec 18, 2025Updated 4 months ago
- 绕过360、火绒等安全设备拦截添加用户☆15Feb 15, 2022Updated 4 years ago
- A V8 Sandbox Escape Technique.☆21Feb 8, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the "Predictive Context-sensitive Fuzzing" NDSS'24 paper☆30Feb 29, 2024Updated 2 years ago
- Holistic Concolic Execution for Dynamic Web Applications via Symbolic Interpreter Analysis (IEEE S&P 2024)☆16Oct 3, 2024Updated last year
- KernJC: Automated Vulnerable Environment Generation for Linux Kernel Vulnerabilities | 🏆 Best Practical Paper Award of RAID 2024☆84Oct 15, 2025Updated 6 months ago
- kubernetes rootkit☆34Dec 18, 2023Updated 2 years ago
- ☆12Feb 20, 2021Updated 5 years ago
- ☆94Mar 6, 2026Updated last month
- A neurosymbolic framework for vulnerability detection in code☆360Apr 19, 2026Updated 2 weeks ago
- MCPCorpus is a comprehensive dataset for analyzing the Model Context Protocol (MCP) ecosystem, containing ~14K MCP servers and 300 MCP cl…☆32Sep 1, 2025Updated 8 months ago
- ☆10Jan 21, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A set of Code-ql/Joern queries to find vulnerabilities☆67May 22, 2021Updated 4 years ago
- LLM-Powered Code Security Scanning☆21Apr 2, 2025Updated last year
- Leveraging revolutionary Agent and Phi-2 technology, Graph Detective uncovers concealed linkages and discerns patterns, enabling pinpoint…☆10Apr 21, 2024Updated 2 years ago
- A portable framework to map DFG (dataflow graph, representing an application) on spatial accelerators.☆41Oct 31, 2022Updated 3 years ago
- [USENIX Security '25] My ZIP isn’t your ZIP: Identifying and Exploiting Semantic Gaps Between ZIP Parsers☆38Mar 20, 2026Updated last month
- Bypass for the hardening against usage of tagWnd as a kernel read/write primitive☆32Mar 22, 2017Updated 9 years ago
- Implementation of QFuzz.☆17Dec 3, 2021Updated 4 years ago