ASSERT-KTH / repairbench
Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/
☆10Updated 2 weeks ago
Alternatives and similar repositories for repairbench:
Users that are interested in repairbench are comparing it to the libraries listed below
- Automatic Repair Framework with LLMs ❤️ https://arxiv.org/pdf/2409.18952☆21Updated this week
- RepoQA: Evaluating Long-Context Code Understanding☆107Updated 5 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆68Updated 7 months ago
- Large Language Models Meet NL2Code: A Survey☆36Updated 5 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated 2 weeks ago
- Advancing LLM with Diverse Coding Capabilities☆69Updated 9 months ago
- EvoEval: Evolving Coding Benchmarks via LLM☆68Updated last year
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMs☆12Updated 8 months ago
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆41Updated 8 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆139Updated 8 months ago
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)☆55Updated 9 months ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆45Updated 3 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆159Updated 3 weeks ago
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆83Updated last month
- Harness used to benchmark aider against SWE Bench benchmarks☆70Updated 10 months ago
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆38Updated 2 weeks ago
- ☆91Updated 9 months ago
- ☆31Updated 2 weeks ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆12Updated 3 months ago
- Updated 7 months ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆62Updated 3 years ago
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆15Updated last year
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆156Updated 8 months ago
- ⚙️ A tool for collecting executable code datasets with GitHub Actions ⚙️☆21Updated this week
- ☆109Updated 9 months ago
- Code and Data for: Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming☆32Updated last year
- ☆60Updated 11 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆71Updated 3 months ago
- ☆92Updated 7 months ago