bdqnghi / awesome-ai4code
A collection of recent papers, benchmarks and datasets of AI4Code domain.
☆54Updated 4 months ago
Related projects: ⓘ
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆57Updated 2 years ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆53Updated last month
- ☆40Updated 3 weeks ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆46Updated 6 months ago
- ☆42Updated last year
- ☆46Updated 2 years ago
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆19Updated last year
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Updated last year
- Code implementation for CoTexT: Multi-task Learning with Code-Text Transformer☆35Updated 3 years ago
- Code Generation as a Dual Task of Code Summarization.☆30Updated 3 years ago
- MODIT: On Multi-Modal Learning of Editing Source Code.☆19Updated 3 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆65Updated 8 months ago
- Replication package for evaluation of code generation metrics☆13Updated last year
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆19Updated last week
- Code and data for AAAI 2022 paper "Multilingual Code Snippets Training for Program Translation"☆9Updated 2 years ago
- ☆5Updated last year
- CoditT5: Pretraining for Source Code and Natural Language Editing☆29Updated last year
- Source Code Data Augmentation for Deep Learning: A Survey.☆58Updated 3 months ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆55Updated 2 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆52Updated 2 years ago
- ☆16Updated 2 years ago
- This repo is the benchmark for source code summarization on C language☆23Updated 3 years ago
- A Tree-Based Transformer Architecture for Code Generation. (AAAI'20)☆90Updated 2 years ago
- Deep Just-In-Time Inconsistency Detection Between Comments and Source Code: Artifact☆21Updated 2 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆14Updated 2 years ago
- VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning☆38Updated last year
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆67Updated 3 months ago
- Cross-Domain Deep Code Search with Few-Shot Learning☆11Updated last year
- Probing pre-trained source code models☆15Updated 2 years ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago