saltudelft / ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
☆693Updated 6 months ago
Alternatives and similar repositories for ml4se:
Users that are interested in ml4se are comparing it to the libraries listed below
- Large Language Models for Software Engineering☆204Updated this week
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆282Updated 2 weeks ago
- A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries☆258Updated 3 years ago
- Repository for PrimeVul Vulnerability Detection Dataset☆93Updated 4 months ago
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆136Updated last year
- Large Language Models for Software Engineering: A Systematic Literature Review☆59Updated 5 months ago
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆69Updated 8 months ago
- For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆59Updated 3 months ago
- A collection of datasets for machine learning for big code☆46Updated 3 years ago
- A continuously updated collection of CodeLLM papers☆261Updated this week
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆180Updated 10 months ago
- ☆190Updated 5 months ago
- Pip compatible CodeBLEU metric implementation available for linux/macos/win☆72Updated this week
- Repo-Level Code generation papers☆120Updated last month
- Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.☆375Updated last month
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆289Updated 5 months ago
- ☆57Updated last year
- DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection (RAID 2023) https://surrealyz.github.io/…☆117Updated 2 months ago
- Benchmark ClassEval for class-level code generation.☆136Updated 2 months ago
- ☆101Updated 6 months ago
- Replication package for "Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection", ICSE 2024.☆51Updated 3 months ago
- Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks☆205Updated last year
- A library for mining of path-based representations of code (and more)☆285Updated last year
- SeqTrans: Automatic Vulnerability Fix via Sequence to Sequence Learning☆15Updated 2 years ago
- ☠️ Ground-truth dataset for vulnerability prediction (known research datasets and data sources included such as NVD, CVE Details and OSV)…☆84Updated last year
- PatchFinder: A Two-Phase Approach to Security Patch Tracing for Disclosed Vulnerabilities in Open Source Software (ISSTA 2024)☆17Updated last month
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆187Updated 2 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆70Updated last year
- [ICSE 2024 Industry Challenge Track] Official implementation of "ReposVul: A Repository-Level High-Quality Vulnerability Dataset".☆51Updated last month
- Implementation of the paper "Language-agnostic representation learning of source code from structure and context".☆167Updated 2 years ago