mahimanzum / FixEval
We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of execution based evaluation compared to suboptimal match based evaluation metrics like BLEU, CodeBLEU, Syntax Match, Exact Match etc.
☆22Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for FixEval
- Source Code Data Augmentation for Deep Learning: A Survey.☆60Updated 5 months ago
- ☆40Updated 2 months ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆27Updated 4 months ago
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆66Updated last year
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆53Updated 3 months ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆59Updated 2 years ago
- ☆17Updated 2 years ago
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆14Updated last year
- ☆19Updated last year
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆57Updated 2 years ago
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Updated 3 years ago
- Generate the WizardCoder Instruct from the CodeAlpaca☆20Updated last year
- ☆29Updated last year
- Replication package for ICSE2022 paper: On the Evaluation of Neural Code Summarization☆27Updated 2 years ago
- ☆46Updated 2 years ago
- This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''☆14Updated last year
- Official implementation of our ICSE 2023 paper on Automatic Code Generation.☆23Updated last year
- ☆12Updated 8 months ago
- ☆43Updated 2 years ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆57Updated 4 months ago
- Improving Machine Translation Systems via Isotopic Replacement☆12Updated last year
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆48Updated 8 months ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Updated 2 years ago
- ☆5Updated last year
- ☆117Updated last year
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆23Updated 2 months ago
- ☆12Updated 2 months ago
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆20Updated last year
- Program Transformation Tool for Java Methods☆11Updated 2 years ago