bdqnghi/awesome-ai4code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bdqnghi/awesome-ai4code)

bdqnghi / awesome-ai4code

A collection of recent papers, benchmarks and datasets of AI4Code domain.

☆61

Alternatives and similar repositories for awesome-ai4code

Users that are interested in awesome-ai4code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bdqnghi / bi-tbcnn
View on GitHub
Bilateral Neural Network implementation in Tensorflow
☆53Mar 23, 2019Updated 7 years ago
bdqnghi / SAR_API_mapping
View on GitHub
[FSE 2019] Learning Cross-Language API Mappings with Little Knowledge
☆19Jul 6, 2023Updated 3 years ago
bdqnghi / ggnn.tensorflow
View on GitHub
Tensorflow implementation of Gated Graph Neural Network for Source Code Classification
☆42Jan 12, 2021Updated 5 years ago
bdqnghi / tbcnn.tensorflow
View on GitHub
Reproduce the results of Tree-based Convolutional Neural Network (TBCNN)
☆39Mar 25, 2023Updated 3 years ago
bdqnghi / hierarchical-programming-language-mapping
View on GitHub
[ICSE'18] Hierarchical Learning of Cross-Language Mappings through Distributed Vector Representations for Code
☆22May 18, 2018Updated 8 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
google-research-datasets / great
View on GitHub
The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…
☆22Aug 19, 2020Updated 5 years ago
bdqnghi / ast-node-encoding
View on GitHub
A tool to convert nodes in an Abstract Syntax Tree into vector embeddings
☆78Apr 25, 2022Updated 4 years ago
bdqnghi / infercode
View on GitHub
[ICSE 2021] - InferCode: Self-Supervised Learning of Code Representations by Predicting Subtrees
☆92Aug 8, 2025Updated 11 months ago
FSoft-AI4Code / DocChecker
View on GitHub
DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment
☆15Jan 23, 2024Updated 2 years ago
chengjunyan1 / GN-Transformer-AST
View on GitHub
Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".
☆17May 25, 2025Updated last year
yuewang-cuhk / awesome-programming-language-pretraining-papers
View on GitHub
Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)
☆60Dec 17, 2021Updated 4 years ago
WM-SEMERU / dl4se
View on GitHub
A Systematic Literature Review of Deep Learning in Software Engineering
☆20Aug 28, 2024Updated last year
FSoft-AI4Code / RepoExec
View on GitHub
[NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…
☆45Jan 8, 2026Updated 6 months ago
zkcpku / HiT-hierarchy-transformer
View on GitHub
code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"
☆12Dec 13, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
DeepSoftwareAnalytics / CodeSumEvaluation
View on GitHub
Replication package for ICSE2022 paper: On the Evaluation of Neural Code Summarization
☆29Sep 20, 2022Updated 3 years ago
bayesgroup / code_transformers
View on GitHub
Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Sourc…
☆67Dec 3, 2021Updated 4 years ago
FSoft-AI4Code / CodeCapybara
View on GitHub
Open-source Self-Instruction Tuning Code LLM
☆172Apr 26, 2023Updated 3 years ago
FSoft-AI4Code / CodeMMLU
View on GitHub
[ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.
☆29Apr 21, 2025Updated last year
SEKE-Adversary / MHM
View on GitHub
Generating Adversarial Examples for Holding Robustness of Source Code Processing Models
☆17Dec 2, 2021Updated 4 years ago
FSoft-AI4Code / CodeText-parser
View on GitHub
⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file
☆53Jul 1, 2024Updated 2 years ago
swtheing / WizardCoder_Instruct_Generator
View on GitHub
Generate the WizardCoder Instruct from the CodeAlpaca
☆21Jun 27, 2023Updated 3 years ago
salesforce / CodeRL
View on GitHub
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…
☆573Jun 2, 2026Updated last month
NougatCA / SPT-Code
View on GitHub
☆49Nov 19, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LRNavin / AutoComments
View on GitHub
Description: We want to create a deep Neural Network that can automatically generate comments for code snippets passed to it. The motiva…
☆44Nov 16, 2022Updated 3 years ago
mdrafiqulrabin / tnpa-generalizability
View on GitHub
IST'21 & SANER'22: Semantic-Preserving Program Transformations
☆31Oct 25, 2022Updated 3 years ago
Fsoft-AIC / NCO-LLM
View on GitHub
[ICLR 2025 - Workshop AgenticAI Oral] Large Language Models powered Neural Solvers for Generalized Vehicle Routing Problems
☆27May 29, 2025Updated last year
mahimanzum / FixEval
View on GitHub
We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…
☆26Aug 31, 2022Updated 3 years ago
saltudelft / ml4se
View on GitHub
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
☆733Nov 6, 2025Updated 8 months ago
microsoft / ReACC
View on GitHub
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
☆67Apr 18, 2022Updated 4 years ago
microsoft / JigsawDataset
View on GitHub
Jigsaw Dataset: Natural language to Python Pandas code
☆55Dec 18, 2022Updated 3 years ago
saikat107 / Codit
View on GitHub
☆13Jul 6, 2023Updated 3 years ago
Pengyu03 / LLM-Commit-Message-Generation
View on GitHub
This repository contains source code and a high-quality test dataset for "Automated Commit Message Generation with Large Language Models.…
☆10Nov 6, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yueyueL / ReliableLM4Code
View on GitHub
Collections of research, benchmarks and tools towards more robust and reliable language models for code; LM4Code; LM4SE; reliable LLM; L…
☆30Dec 14, 2023Updated 2 years ago
eth-sri / TFix
View on GitHub
☆71May 12, 2022Updated 4 years ago
microsoft / neurips21-self-supervised-bug-detection-and-repair
View on GitHub
Replication Code for "Self-Supervised Bug Detection and Repair" NeurIPS 2021
☆111Aug 30, 2022Updated 3 years ago
google-deepmind / codesembench
View on GitHub
☆16Mar 22, 2024Updated 2 years ago
modit-team / MODIT
View on GitHub
MODIT: On Multi-Modal Learning of Editing Source Code.
☆20Apr 24, 2021Updated 5 years ago
mwcvitkovic / Open-Vocabulary-Learning-on-Source-Code-with-a-Graph-Structured-Cache--Code-Preprocessor
View on GitHub
Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…
☆21Oct 22, 2018Updated 7 years ago
9erxis / DietCode
View on GitHub
☆20Mar 6, 2023Updated 3 years ago