saltudelft / ml4seView external linksLinks
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
☆731Nov 6, 2025Updated 3 months ago
Alternatives and similar repositories for ml4se
Users that are interested in ml4se are comparing it to the libraries listed below
Sorting:
- Large Language Models for Software Engineering☆259Jul 24, 2025Updated 6 months ago
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆313Feb 6, 2026Updated last week
- CodeXGLUE☆1,800Apr 23, 2024Updated last year
- Source Code Data Augmentation for Deep Learning: A Survey.☆66Jun 15, 2024Updated last year
- ☆49Jul 24, 2022Updated 3 years ago
- [TOSEM 2026]A Systematic Literature Review on Large Language Models for Automated Program Repair☆230Jan 18, 2026Updated 3 weeks ago
- CodeBERT☆2,729Jul 9, 2023Updated 2 years ago
- A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries☆351Mar 25, 2021Updated 4 years ago
- VulRepair: A T5-Based Automated Software Vulnerability Repair☆84May 13, 2025Updated 9 months ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆58Mar 20, 2024Updated last year
- Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks☆256Jan 19, 2024Updated 2 years ago
- ☆223Jul 25, 2024Updated last year
- Simplified Source Code Pre-Training for Vulnerability Detection☆113Dec 4, 2025Updated 2 months ago
- Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" publis…☆84Nov 4, 2023Updated 2 years ago
- A continuously updated collection of CodeLLM papers maintained by PurCL group @ Purdue☆599Jan 14, 2026Updated last month
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆75May 3, 2024Updated last year
- open science repo of "Neural Transfer Learning for Repairing Security Vulnerabilities in C Code" https://arxiv.org/pdf/2104.08308☆63Feb 23, 2024Updated last year
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Feb 7, 2025Updated last year
- VulnerabilityDetectionResearch☆93Mar 22, 2022Updated 3 years ago
- [TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.☆3,215Feb 1, 2026Updated last week
- [SCIS 2025] A Survey on Large Language Models for Software Engineering☆309Feb 6, 2025Updated last year
- ☆29Oct 29, 2022Updated 3 years ago
- ☆41Jan 13, 2023Updated 3 years ago
- Statement-level deep learning model for automated software vulnerability detection in C/C++ (Accepted in MSR 2022)☆75Jun 24, 2022Updated 3 years ago
- ☆52Nov 19, 2022Updated 3 years ago
- [ICSE 2021] - InferCode: Self-Supervised Learning of Code Representations by Predicting Subtrees☆89Aug 8, 2025Updated 6 months ago
- WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models (OOPSLA 2024)☆77Aug 5, 2025Updated 6 months ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆79Jan 21, 2024Updated 2 years ago
- ☆61Dec 21, 2023Updated 2 years ago
- Collect simple coverage information in memory.☆11Oct 6, 2022Updated 3 years ago
- This is the official repository for VulHawk.☆74Mar 28, 2023Updated 2 years ago
- An awesome & curated list of binary code similarity papers☆597Jan 5, 2026Updated last month
- For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆63Oct 16, 2024Updated last year
- Seq2seq Type Inference using Static Analysis and CodeT5☆32Jul 9, 2023Updated 2 years ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,687Oct 2, 2025Updated 4 months ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆186Mar 1, 2022Updated 3 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Oct 25, 2022Updated 3 years ago
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆19Dec 12, 2022Updated 3 years ago
- Replication Package for "Natural Attack for Pre-trained Models of Code", ICSE 2022☆51Nov 7, 2025Updated 3 months ago