code-desc / CoDesc
A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.
☆15Updated 2 years ago
Alternatives and similar repositories for CoDesc:
Users that are interested in CoDesc are comparing it to the libraries listed below
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Updated 3 years ago
- Generating Adversarial Examples for Holding Robustness of Source Code Processing Models☆12Updated 3 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Updated 2 years ago
- ☆29Updated 4 years ago
- ☆11Updated 4 years ago
- ☆40Updated 2 years ago
- ☆13Updated last year
- Code and dataset for paper C4: Contrastive Cross-Language Code Clone Detection☆26Updated 2 years ago
- Program Transformation Tool for Java Methods☆11Updated 2 years ago
- [ICSE 2021] - InferCode: Self-Supervised Learning of Code Representations by Predicting Subtrees☆89Updated 3 years ago
- ☆24Updated 2 years ago
- NLQF is a tool to filter query-appropriate comments for building high-quality code search datasets.☆17Updated 3 years ago
- Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.☆10Updated 2 years ago
- ☆11Updated last year
- Reproduce the results of Tree-based Convolutional Neural Network (TBCNN)☆38Updated last year
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆25Updated 2 years ago
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆12Updated 4 months ago
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Updated 2 years ago
- mwcvitkovic / Open-Vocabulary-Learning-on-Source-Code-with-a-Graph-Structured-Cache--Code-PreprocessorLibrary for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Updated 6 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Updated 2 years ago
- ☆33Updated 2 years ago
- ☆12Updated 3 years ago
- code for "Learning to Represent Programs with Heterogeneous Graphs"☆12Updated 2 years ago
- Sequence-to-Sequence Learning for End-to-End Program Repair (IEEE TSE 2019). Open-science repo. http://arxiv.org/pdf/1901.01808☆82Updated last year
- Code for the paper: "Adversarial Examples for Models of Code"☆17Updated 4 years ago
- Towards Robustness of Deep Program Processing Models – Detection, Estimation and Enhancement☆19Updated 2 years ago
- This repository is the replication package of the ICSE22 paper "FIRA: Fine-Grained Graph-Based Code Change Representation for Automated C…☆31Updated 2 years ago
- [FSE 2019] Learning Cross-Language API Mappings with Little Knowledge☆18Updated last year
- Replication package for ICSE2022 paper: On the Evaluation of Neural Code Summarization☆28Updated 2 years ago
- ☆55Updated 2 years ago