JetBrains-Research / EnvBenchLinks
[DL4C @ ICLR 2025] A Benchmark for Automated Environment Setup
☆12Updated last week
Alternatives and similar repositories for EnvBench
Users that are interested in EnvBench are comparing it to the libraries listed below
Sorting:
- Pip compatible CodeBLEU metric implementation available for linux/macos/win☆93Updated 2 months ago
- A multi-lingual program repair benchmark set based on the Quixey Challenge☆118Updated 2 years ago
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆71Updated last year
- ☆30Updated 2 years ago
- ☆23Updated 8 months ago
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆60Updated 9 months ago
- EvoEval: Evolving Coding Benchmarks via LLM☆72Updated last year
- A Reproducible Benchmark of Recent Java Bugs☆38Updated last month
- TeCo: an ML+Execution model for test completion☆31Updated 11 months ago
- BugsInPy: Benchmarking Bugs in Python Projects☆101Updated 10 months ago
- [ISSTA 2025] A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing☆9Updated 3 months ago
- Benchmark ClassEval for class-level code generation.☆143Updated 7 months ago
- For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆60Updated 7 months ago
- ☆25Updated 2 years ago
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆25Updated 2 years ago
- Large Language Models for Software Engineering☆230Updated this week
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆77Updated 10 months ago
- ☆59Updated 4 months ago
- ☆23Updated last year
- ✅SRepair: Powerful LLM-based Program Repairer with $0.029/Fixed Bug☆65Updated last year
- Dataflow-guided retrieval augmentation for repository-level code completion, ACL 2024 (main)☆26Updated 2 months ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆23Updated 2 years ago
- ☆33Updated last year
- Dianshu-Liao / AAA-Code-Generation-Framework-for-Code-Repository-Local-Aware-Global-Aware-Third-Party-Aware☆19Updated last year
- ☆22Updated 4 months ago
- ☆47Updated 11 months ago
- A Systematic Literature Review on Large Language Models for Automated Program Repair☆185Updated 6 months ago
- ☆28Updated 2 years ago
- This repo is for our submission for ICSE 2025.☆20Updated 11 months ago
- Refactory: Re-factoring based Program Repair applied to Programming Assignments☆39Updated 2 years ago