saltudelft / ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
☆708Updated 8 months ago
Alternatives and similar repositories for ml4se:
Users that are interested in ml4se are comparing it to the libraries listed below
- Large Language Models for Software Engineering☆217Updated this week
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆289Updated this week
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆147Updated last year
- A library for mining of path-based representations of code (and more)☆287Updated last year
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆69Updated 11 months ago
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆70Updated 2 months ago
- ☆235Updated last year
- For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆59Updated 5 months ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆290Updated last month
- A Systematic Literature Review on Large Language Models for Automated Program Repair☆175Updated 4 months ago
- Repo-Level Code generation papers☆154Updated this week
- Benchmark ClassEval for class-level code generation.☆138Updated 5 months ago
- Extract and combine multiple source code views using tree-sitter☆126Updated 3 months ago
- Repository for PrimeVul Vulnerability Detection Dataset☆126Updated 6 months ago
- Large Language Models for Software Engineering: A Systematic Literature Review☆71Updated 7 months ago
- CVEfixes: Automated Collection of Vulnerabilities and Their Fixes from Open-Source Software☆240Updated 8 months ago
- Pip compatible CodeBLEU metric implementation available for linux/macos/win☆83Updated this week
- ☆23Updated 11 months ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆187Updated 3 years ago
- ✅SRepair: Powerful LLM-based Program Repairer with $0.029/Fixed Bug☆60Updated 11 months ago
- A continuously updated collection of CodeLLM papers maintained by PurCL group @ Purdue☆373Updated last week
- A collection of datasets for machine learning for big code☆54Updated 3 years ago
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆187Updated last year
- A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries☆288Updated 4 years ago
- BugsInPy: Benchmarking Bugs in Python Projects☆94Updated 8 months ago
- A Survey on Large Language Models for Software Engineering☆229Updated last month
- Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks☆214Updated last year
- Code and dataset for paper C4: Contrastive Cross-Language Code Clone Detection☆30Updated 2 years ago
- ☆132Updated 4 months ago
- CodeXGLUE☆1,642Updated 11 months ago