A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
☆729Nov 6, 2025Updated 4 months ago
Alternatives and similar repositories for ml4se
Users that are interested in ml4se are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Large Language Models for Software Engineering☆258Jul 24, 2025Updated 8 months ago
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆317Updated this week
- ☆48Nov 19, 2025Updated 4 months ago
- CodeXGLUE☆1,808Apr 23, 2024Updated last year
- Source Code Data Augmentation for Deep Learning: A Survey.☆66Jun 15, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆49Jul 24, 2022Updated 3 years ago
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆75May 3, 2024Updated last year
- CodeBERT☆2,742Jul 9, 2023Updated 2 years ago
- [TOSEM 2026]A Systematic Literature Review on Large Language Models for Automated Program Repair☆231Updated this week
- Simplified Source Code Pre-Training for Vulnerability Detection☆115Dec 4, 2025Updated 3 months ago
- ☆226Jul 25, 2024Updated last year
- ☆41Jan 13, 2023Updated 3 years ago
- A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries☆357Mar 25, 2021Updated 5 years ago
- [SCIS 2025] A Survey on Large Language Models for Software Engineering☆316Feb 6, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- VulRepair: A T5-Based Automated Software Vulnerability Repair☆84May 13, 2025Updated 10 months ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆58Mar 20, 2024Updated 2 years ago
- open science repo of "Neural Transfer Learning for Repairing Security Vulnerabilities in C Code" https://arxiv.org/pdf/2104.08308☆63Feb 23, 2024Updated 2 years ago
- [TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.☆3,260Mar 5, 2026Updated 3 weeks ago
- Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks☆256Jan 19, 2024Updated 2 years ago
- VulnerabilityDetectionResearch☆94Mar 22, 2022Updated 4 years ago
- This is the official repository for VulHawk.☆76Mar 28, 2023Updated 2 years ago
- ☆29Oct 29, 2022Updated 3 years ago
- Seq2seq Type Inference using Static Analysis and CodeT5☆32Jul 9, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆292Feb 7, 2025Updated last year
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆19Dec 12, 2022Updated 3 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Oct 25, 2022Updated 3 years ago
- An awesome & curated list of binary code similarity papers☆600Jan 5, 2026Updated 2 months ago
- For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆63Oct 16, 2024Updated last year
- Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" publis…☆86Nov 4, 2023Updated 2 years ago
- ☆22Mar 21, 2024Updated 2 years ago
- A continuously updated collection of CodeLLM papers maintained by PurCL group @ Purdue☆614Jan 14, 2026Updated 2 months ago
- ☆16Aug 16, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Statement-level deep learning model for automated software vulnerability detection in C/C++ (Accepted in MSR 2022)☆76Jun 24, 2022Updated 3 years ago
- Replication Package for "Natural Attack for Pre-trained Models of Code", ICSE 2022☆52Nov 7, 2025Updated 4 months ago
- Replication package for "Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection", ICSE 2024.☆75Sep 24, 2024Updated last year
- Code for the paper "A Lightweight Framework for Function Name Reassignment Based on Large-Scale Stripped Binaries"☆15Jul 3, 2021Updated 4 years ago
- ☆54Nov 19, 2022Updated 3 years ago
- Repository for PrimeVul Vulnerability Detection Dataset☆228Sep 7, 2024Updated last year
- ☆23Mar 25, 2023Updated 3 years ago