saltudelft / ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
☆709Updated 9 months ago
Alternatives and similar repositories for ml4se:
Users that are interested in ml4se are comparing it to the libraries listed below
- Large Language Models for Software Engineering☆223Updated this week
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆292Updated 3 weeks ago
- Implementation of the paper "Language-agnostic representation learning of source code from structure and context".☆169Updated 3 years ago
- A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries☆292Updated 4 years ago
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆149Updated last year
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆70Updated 11 months ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Updated 2 months ago
- A continuously updated collection of CodeLLM papers maintained by PurCL group @ Purdue☆407Updated last week
- ☆138Updated 5 months ago
- A collection of datasets for machine learning for big code☆56Updated 3 years ago
- A library for mining of path-based representations of code (and more)☆287Updated last year
- Repo-Level Code generation papers☆167Updated 3 weeks ago
- Benchmark ClassEval for class-level code generation.☆141Updated 6 months ago
- Vul4J: A Dataset of Reproducible Java Vulnerabilities☆82Updated 2 months ago
- Large Language Models for Software Engineering: A Systematic Literature Review☆78Updated 8 months ago
- ☆204Updated 9 months ago
- ☆61Updated last year
- ☆234Updated last year
- ☠️ Ground-truth dataset for vulnerability prediction (known research datasets and data sources included such as NVD, CVE Details and OSV)…☆90Updated last year
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆190Updated last year
- Pip compatible CodeBLEU metric implementation available for linux/macos/win☆88Updated 3 weeks ago
- A Systematic Literature Review on Large Language Models for Automated Program Repair☆183Updated 5 months ago
- Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks☆220Updated last year
- ☆29Updated 2 years ago
- GitHub Search: Platform used to crawl, store and present projects from GitHub, as well as any statistics related to them☆154Updated this week
- Replication package for "Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection", ICSE 2024.☆61Updated 7 months ago
- A multi-lingual program repair benchmark set based on the Quixey Challenge☆114Updated 2 years ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆187Updated 3 years ago
- Extract and combine multiple source code views using tree-sitter☆132Updated 4 months ago
- Repository for PrimeVul Vulnerability Detection Dataset☆139Updated 7 months ago