A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.
☆15Feb 24, 2022Updated 4 years ago
Alternatives and similar repositories for CoDesc
Users that are interested in CoDesc are comparing it to the libraries listed below
Sorting:
- This repo is the benchmark for source code summarization on C language☆26Mar 18, 2021Updated 5 years ago
- Program Translator AI built on Pytorch☆15Dec 19, 2019Updated 6 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Feb 24, 2022Updated 4 years ago
- Generating Adversarial Examples for Holding Robustness of Source Code Processing Models☆15Dec 2, 2021Updated 4 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated last month
- This is the code repository for our ICPC 2021 paper "Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting"☆24Jan 3, 2023Updated 3 years ago
- Jigsaw Dataset: Natural language to Python Pandas code☆55Dec 18, 2022Updated 3 years ago
- Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Sourc…☆66Dec 3, 2021Updated 4 years ago
- Code Generation as a Dual Task of Code Summarization.☆30Jun 28, 2021Updated 4 years ago
- This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''☆14Oct 4, 2023Updated 2 years ago
- ☆13Jul 6, 2023Updated 2 years ago
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆17May 25, 2025Updated 9 months ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆18Dec 8, 2022Updated 3 years ago
- Extracting Concise Bug-Fixing Patches from Human-Written Patches in Version Control Systems☆16Feb 21, 2023Updated 3 years ago
- ☆30Nov 23, 2020Updated 5 years ago
- Set of tools to help working with "Big Code"☆42Apr 28, 2022Updated 3 years ago
- ☆15Sep 29, 2025Updated 5 months ago
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆27Aug 12, 2022Updated 3 years ago
- Resources for recent AI systems (deployment concerns, cost and accessibility). -- closed☆12May 29, 2021Updated 4 years ago
- Code for the ICPC 2020 paper Improved Source Code Summarization via a Graph Neural Network☆68Apr 9, 2021Updated 4 years ago
- Implementation of "Automatic Source Code Summarization with Extended Tree-LSTM"☆36Nov 22, 2022Updated 3 years ago
- ☆23Aug 6, 2020Updated 5 years ago
- Description: We want to create a deep Neural Network that can automatically generate comments for code snippets passed to it. The motiva…☆44Nov 16, 2022Updated 3 years ago
- ☆26Jul 19, 2022Updated 3 years ago
- code for "Retrieve and Refine: Exemplar-based Neural Comment Generation"☆15Mar 27, 2021Updated 4 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 3 years ago
- NNGen, a simple baseline for commit message generation from diffs.☆15Nov 25, 2022Updated 3 years ago
- ☆35Updated this week
- ☆20Jul 26, 2023Updated 2 years ago
- Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models☆35Apr 18, 2023Updated 2 years ago
- Funcom Source Code Summarization Tool - Public Release☆34Apr 26, 2024Updated last year
- ☆15Oct 11, 2023Updated 2 years ago
- PROGEX (Program Graph Extractor); a cross platform tool for extracting graphical program representations from software source code☆89Aug 8, 2021Updated 4 years ago
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago
- Repository for the code of the "A Convolutional Attention Network for Extreme Summarization of Source Code" paper☆120Jul 19, 2016Updated 9 years ago
- LaTeX Template for Fudan University School of Computer Science 2024☆11May 21, 2024Updated last year
- This repository contains the dataset of our ISSTA 2018 paper: An Empirical Study on TensorFlow Program Bugs.☆29May 20, 2020Updated 5 years ago
- 上海房产信息实录,分析每个小区优劣,包括地段、地铁、学区等☆10Apr 7, 2019Updated 6 years ago
- This repository contains the artifacts accompanied by the paper "Fair Preprocessing"☆13Jul 20, 2021Updated 4 years ago