bytedance/PatchEval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bytedance/PatchEval)

bytedance / PatchEval

PatchEval: A New Benchmark for Evaluating LLMs on Patching Real-World Vulnerabilities

☆212

Alternatives and similar repositories for PatchEval

Users that are interested in PatchEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ljiahao / TeLL
View on GitHub
TeLL: Log Level Suggestions via Modeling Multi-Level Code Block Information, ISSTA'22
☆14Jul 14, 2022Updated 4 years ago
mimicji / FlowMatrix
View on GitHub
FLOWMATRIX: GPU-Assisted Information-Flow Analysis through Matrix-Based Representation, USENIX Security'22
☆28Apr 17, 2023Updated 3 years ago
melynx / peekaboo
View on GitHub
An standalone execution trace library built on DynamoRIO.
☆23Jul 4, 2022Updated 4 years ago
CGCL-codes / MavenEcoSysResearch
View on GitHub
☆45Sep 8, 2023Updated 2 years ago
mimicji / GAINS
View on GitHub
GAINS: Getting stArted wIth biNary analysiS
☆32Feb 23, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
livecvebench / LiveCVEBench-Preview
View on GitHub
☆23Mar 27, 2026Updated 3 months ago
CGCL-codes / PathEval
View on GitHub
This is an evaluation set for the problem of directed/targeted test input generation. We use it to benchmark the ability of Large Languag…
☆34Mar 11, 2025Updated last year
mimicji / Bilingual-Resume-Template
View on GitHub
Bilingual Resume Template in Latex. 中英双语Latex简历模板
☆20Jun 4, 2024Updated 2 years ago
livecvebench / CVE-Factory
View on GitHub
CVE-Factory
☆146Mar 27, 2026Updated 3 months ago
CGCL-codes / HistFuzz
View on GitHub
A practical fuzzing tool for SMT solvers
☆11Nov 26, 2025Updated 7 months ago
YuanchengJiang / recipe-benchmark
View on GitHub
Source code of AsiaCCS'22 paper - RecIPE: Revisiting the Evaluation of Memory Error Defenses
☆14Sep 19, 2023Updated 2 years ago
Icegrave0391 / Palantir
View on GitHub
PalanTír: Optimizing Attack Provenance with Hardware-enhanced System Observability, ACM CCS'22
☆25Nov 11, 2024Updated last year
alipay / ant-application-security-testing-benchmark
View on GitHub
xAST评价体系，让安全工具不再“黑盒”. The xAST evaluation benchmark makes security tools no longer a "black box".
☆485May 21, 2026Updated 2 months ago
uiuc-kang-lab / cve-bench
View on GitHub
CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities
☆254Jan 14, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PurCL / ASE
View on GitHub
A continuously updated collection of papers on agentic SE
☆631Updated this week
ise-uiuc / KNighter
View on GitHub
[SOSP'25] Automatic checker synthesis for system-level static analysis
☆181Oct 26, 2025Updated 8 months ago
YuanchengJiang / GraphGenie
View on GitHub
To detect logic bugs in graph database engines by mutating graph query patterns. ICSE'24.
☆37Jan 24, 2024Updated 2 years ago
alibaba / sec-code-bench
View on GitHub
SecCodeBench is a benchmark suite focusing on evaluating the security of code generated by large language models (LLMs).
☆130Jun 10, 2026Updated last month
CGCL-codes / VulLLM
View on GitHub
An implementation of the ACL 2024 Findings paper "Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tu…
☆77Oct 29, 2025Updated 8 months ago
SEC-bench / SEC-bench
View on GitHub
Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]
☆87Jan 27, 2026Updated 5 months ago
Snakinya / MCPCorpus
View on GitHub
MCPCorpus is a comprehensive dataset for analyzing the Model Context Protocol (MCP) ecosystem, containing ~14K MCP servers and 300 MCP cl…
☆34Sep 1, 2025Updated 10 months ago
jun-zeng / Tailor
View on GitHub
Learning graph-based code representations for source-level functional similarity detection. ICSE'23
☆63Mar 27, 2023Updated 3 years ago
ConcoLLMic / ConcoLLMic
View on GitHub
ConcoLLMic: the first language- and theory-agonistic concolic execution engine via LLM agents
☆147May 25, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
seclab-ucr / BugLens
View on GitHub
The repo of "BugLens"
☆42Jul 4, 2026Updated 2 weeks ago
CGCL-codes / VFFinder
View on GitHub
☆13Oct 14, 2025Updated 9 months ago
Feng-Jay / GiantRepair
View on GitHub
Artifact for TOSEM Submission: GiantRepair
☆12Jun 26, 2024Updated 2 years ago
ucsb-mlsec / SeCodePLT
View on GitHub
☆15Sep 24, 2025Updated 9 months ago
PurCL / RepoAudit
View on GitHub
An autonomous LLM-agent for large-scale, repository-level code auditing
☆422Mar 12, 2026Updated 4 months ago
uw-pluverse / pluverse-latex-style-guide
View on GitHub
☆26Jun 4, 2026Updated last month
VMnK-Run / MARVEL
View on GitHub
[ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training
☆11Sep 13, 2024Updated last year
ecolab-nus / lisa
View on GitHub
A portable framework to map DFG (dataflow graph, representing an application) on spatial accelerators.
☆41Oct 31, 2022Updated 3 years ago
cla7aye15I4nd / PatchAgent
View on GitHub
[USENIX Security 25] PatchAgent is a LLM-based practical program repair agent that mimics human expertise.
☆128Feb 25, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
iris-sast / cwe-bench-java
View on GitHub
A manually vetted dataset for security vulnerability detection in Java projects
☆107Aug 12, 2025Updated 11 months ago
zzjas / anypoc
View on GitHub
Generates executable Proof-of-Concept for any bug in any project. AI agents discover and reproduce vulnerabilities — verified, not halluc…
☆27May 5, 2026Updated 2 months ago
pascal-lab / Tai-e-assignments
View on GitHub
Tai-e assignments for static program analysis
☆1,225Aug 28, 2025Updated 10 months ago
pascal-lab / Tai-e
View on GitHub
An easy-to-learn/use static analysis framework for Java and Android
☆1,790Jun 28, 2026Updated 3 weeks ago
soarsmu / VulMaster_
View on GitHub
☆19May 27, 2025Updated last year
php / flowfusion
View on GitHub
A Dataflow-Driven and Automated Fuzzer for the PHP Interpreter
☆48Jun 19, 2025Updated last year
DaweiX / UIHash
View on GitHub
UIHash: Detecting Similar Android UIs through Grid-Based Visual Appearance Representation, USENIX Security '24
☆12Dec 5, 2024Updated last year