thunlp/DebugBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thunlp/DebugBench)

thunlp / DebugBench

The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".

☆87

Alternatives and similar repositories for DebugBench

Users that are interested in DebugBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NEUIR / COAST
View on GitHub
Official repository for the paper "COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis".
☆18Feb 19, 2025Updated last year
multimodal-art-projection / CodeCriticBench
View on GitHub
☆16Nov 1, 2025Updated 8 months ago
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
iSEngLab / AwesomeLLM4APR
View on GitHub
[TOSEM 2026]A Systematic Literature Review on Large Language Models for Automated Program Repair
☆244May 1, 2026Updated 2 months ago
wanghanbinpanda / Large-Language-Models-for-Code
View on GitHub
Large Language Models(LLMs) of Code
☆20Apr 23, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
afortunado-aceptado / Rudra
View on GitHub
This repo is for our submission for ICSE 2025.
☆20Jun 12, 2024Updated 2 years ago
ruocwang / llm-symbolic-program
View on GitHub
Official implementation: Large Language Models are Interpretable Learners - Google
☆13Jun 29, 2024Updated 2 years ago
sfeng-m / REAL4MWP
View on GitHub
Code for EMNLP 2021 Paper "Recall and Learn: A Memory-augmented Solver for Math Word Problems".
☆16Oct 20, 2022Updated 3 years ago
IntelligentDDS / LogReducer
View on GitHub
☆15Jan 7, 2023Updated 3 years ago
ReliableCoding / REPEAT
View on GitHub
☆10Apr 15, 2023Updated 3 years ago
pkuzqh / ICSE23Repair
View on GitHub
An implementation of Tare.
☆12Feb 23, 2024Updated 2 years ago
FloridSleeves / LLMDebugger
View on GitHub
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)
☆587Sep 10, 2024Updated last year
ASSERT-KTH / cigar
View on GitHub
Efficient APR with LLMs http://arxiv.org/pdf/2402.06598
☆16May 28, 2024Updated 2 years ago
ridgesai / ridges-old
View on GitHub
☆12May 30, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zkx06111 / ALGO
View on GitHub
☆36May 25, 2023Updated 3 years ago
TencentARC / Plot2Code
View on GitHub
☆23Aug 17, 2024Updated last year
gitbugactions / gitbug-java
View on GitHub
A Reproducible Benchmark of Recent Java Bugs
☆52Aug 19, 2025Updated 11 months ago
shentianxiao / FiLM
View on GitHub
☆13Oct 18, 2023Updated 2 years ago
aorwall / moatless-tree-search
View on GitHub
☆141Jun 6, 2025Updated last year
bytedance / FullStackBench
View on GitHub
Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"
☆122May 7, 2025Updated last year
GhabiX / SRepair
View on GitHub
✅SRepair: Powerful LLM-based Program Repairer with $0.029/Fixed Bug
☆79Apr 23, 2026Updated 2 months ago
ARiSE-Lab / CYCLE_OOPSLA_24
View on GitHub
Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"
☆10Mar 8, 2024Updated 2 years ago
zlwang-cs / LASER-release
View on GitHub
Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework
☆14May 31, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jyi / ITSP
View on GitHub
Artifacts of FSE-2017 paper on an Intelligent Tutoring System for Programming
☆27May 21, 2019Updated 7 years ago
CUHK-Shenzhen-SE / D4C
View on GitHub
[ICSE'25] Aligning the Objective of LLM-based Program Repair
☆24Mar 8, 2025Updated last year
kwaipilot / SWE-Compass
View on GitHub
☆18Mar 28, 2026Updated 3 months ago
gonglinyuan / safim
View on GitHub
☆48May 6, 2025Updated last year
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
XiangJinyu / APrompt
View on GitHub
An automatic prompt iteration and optimization generator suitable for any scenario
☆16Jan 31, 2025Updated last year
secure-foundations / human-eval-verus
View on GitHub
☆27Updated this week
kaiiiz / NTU-Computer-Security-2021-Fall
View on GitHub
台大計算機安全 (交大程式安全) 2021 Fall
☆25Feb 17, 2022Updated 4 years ago
google-research / runtime-error-prediction
View on GitHub
This is the repository for the paper Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descripti…
☆25Nov 18, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Feng-Jay / GiantRepair
View on GitHub
Artifact for TOSEM Submission: GiantRepair
☆12Jun 26, 2024Updated 2 years ago
ntunlp / xCodeEval
View on GitHub
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
☆90Sep 17, 2024Updated last year
BAAI-WuDao / Code
View on GitHub
“悟道”源代码
☆21Aug 24, 2021Updated 4 years ago
lt-asset / REPOCOD
View on GitHub
For our ACL25 Paper: Can Language Models Replace Programmers? RepoCod Says ‘Not Yet’ - by Shanchao Liang and Yiran Hu and Nan Jiang and L…
☆27Aug 27, 2025Updated 10 months ago
LiveCodeBench / LiveCodeBench
View on GitHub
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
☆911Jul 16, 2025Updated last year
THUDM / NaturalCodeBench
View on GitHub
NaturalCodeBench (Findings of ACL 2024)
☆70Oct 14, 2024Updated last year
evo-eval / evoeval
View on GitHub
EvoEval: Evolving Coding Benchmarks via LLM
☆84Apr 6, 2024Updated 2 years ago