A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.
☆15Feb 24, 2022Updated 4 years ago
Alternatives and similar repositories for CoDesc
Users that are interested in CoDesc are comparing it to the libraries listed below
Sorting:
- This repo is the benchmark for source code summarization on C language☆26Mar 18, 2021Updated 4 years ago
- Program Translator AI built on Pytorch☆15Dec 19, 2019Updated 6 years ago
- Generating Adversarial Examples for Holding Robustness of Source Code Processing Models☆14Dec 2, 2021Updated 4 years ago
- Jigsaw Dataset: Natural language to Python Pandas code☆55Dec 18, 2022Updated 3 years ago
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆17May 25, 2025Updated 9 months ago
- Extracting Concise Bug-Fixing Patches from Human-Written Patches in Version Control Systems☆16Feb 21, 2023Updated 3 years ago
- ☆15Sep 29, 2025Updated 5 months ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆18Dec 8, 2022Updated 3 years ago
- ☆23Aug 6, 2020Updated 5 years ago
- This is the code repository for our ICPC 2021 paper "Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting"☆24Jan 3, 2023Updated 3 years ago
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆27Aug 12, 2022Updated 3 years ago
- Fast tokenization and structural analysis of any programming language☆62Jan 14, 2025Updated last year
- ☆26Jul 19, 2022Updated 3 years ago
- Code for the ICPC 2020 paper Improved Source Code Summarization via a Graph Neural Network☆68Apr 9, 2021Updated 4 years ago
- Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models☆35Apr 18, 2023Updated 2 years ago
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago
- This repository contains the dataset of our ISSTA 2018 paper: An Empirical Study on TensorFlow Program Bugs.☆29May 20, 2020Updated 5 years ago
- SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems☆10Apr 11, 2025Updated 10 months ago
- ☆11Jul 25, 2020Updated 5 years ago
- This repo is the artifact of FUEL☆13Dec 2, 2025Updated 2 months ago
- Added functionality to the cml python package☆14Feb 4, 2026Updated 3 weeks ago
- The impact of text pre-processing methods on the performance of deep learning models for the toxic comments classification☆10Jan 12, 2021Updated 5 years ago
- The official implementation of EMNLP 2021 paper "#HowYouTagTweets: Learning User Hashtagging Preferences via Personalized Topic Attention…☆11Feb 21, 2023Updated 3 years ago
- Implementation of "Automatic Source Code Summarization with Extended Tree-LSTM"☆36Nov 22, 2022Updated 3 years ago
- An Abstractive Summarization(for Datasets in English format) Implementation with Transformer and Pointer-generator☆12Dec 31, 2020Updated 5 years ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- ☆242Feb 14, 2024Updated 2 years ago
- Description: We want to create a deep Neural Network that can automatically generate comments for code snippets passed to it. The motiva…☆44Nov 16, 2022Updated 3 years ago
- Autoencoder for multi-label classification using Google's Tensorflow framework and MDMR for feature selection.☆10Aug 31, 2017Updated 8 years ago
- ☆10Aug 25, 2020Updated 5 years ago
- TheDeepChecker: Dynamic Debugger for Neural Networks Training Programs☆10Nov 2, 2022Updated 3 years ago
- Resources for recent AI systems (deployment concerns, cost and accessibility). -- closed☆12May 29, 2021Updated 4 years ago
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- This repository contains the artifacts accompanied by the paper "Fair Preprocessing"☆13Jul 20, 2021Updated 4 years ago
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 4 months ago
- LaTeX Template for Fudan University School of Computer Science 2024☆11May 21, 2024Updated last year
- ☆11Jul 28, 2021Updated 4 years ago