mahimanzum / FixEvalLinks
We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of execution based evaluation compared to suboptimal match based evaluation metrics like BLEU, CodeBLEU, Syntax Match, Exact Match etc.
☆23Updated 2 years ago
Alternatives and similar repositories for FixEval
Users that are interested in FixEval are comparing it to the libraries listed below
Sorting:
- ☆43Updated 3 months ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Updated 5 months ago
- ☆17Updated 2 years ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆31Updated 11 months ago
- For our ACL25 Paper: Can Language Models Replace Programmers? RepoCod Says ‘Not Yet’ - by Shanchao Liang and Yiran Hu and Nan Jiang and L…☆19Updated last week
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆62Updated 3 years ago
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆74Updated 4 months ago
- This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''☆14Updated last year
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Updated 4 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Updated 2 years ago
- ☆47Updated 2 years ago
- Replication package for ICSE2022 paper: On the Evaluation of Neural Code Summarization☆28Updated 2 years ago
- [NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning☆22Updated 6 months ago
- Source Code Data Augmentation for Deep Learning: A Survey.☆64Updated 11 months ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15Updated 3 years ago
- Reinforcement Learning for Repository-Level Code Completion☆33Updated 9 months ago
- ☆12Updated last year
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆39Updated 2 months ago
- ☆20Updated 2 years ago
- ☆28Updated 2 years ago
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆82Updated 8 months ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆74Updated last year
- A collection of practical code generation tasks and tests from open source projects. Complementary to HumanEval by OpenAI.☆24Updated 2 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆53Updated last year
- ☆41Updated 2 years ago
- ☆5Updated 2 years ago
- Dianshu-Liao / AAA-Code-Generation-Framework-for-Code-Repository-Local-Aware-Global-Aware-Third-Party-Aware☆19Updated last year
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆54Updated 10 months ago
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆30Updated 2 months ago
- Official implementation of our ICSE 2023 paper on Automatic Code Generation.☆27Updated last year