CUHK-ARISE / ml4code-datasetLinks
A collection of datasets for machine learning for big code
☆61Updated 4 years ago
Alternatives and similar repositories for ml4code-dataset
Users that are interested in ml4code-dataset are comparing it to the libraries listed below
Sorting:
- A multi-lingual program repair benchmark set based on the Quixey Challenge☆130Updated 3 years ago
 - This repository is to support contributions for tools and new data entries for the D2A dataset hosted in DAX☆74Updated 3 years ago
 - Vul4J: A Dataset of Reproducible Java Vulnerabilities☆103Updated 2 months ago
 - For our ISSTA23 paper "How Effective are Neural Networks for Fixing Security Vulnerabilities?" by Yi Wu, Nan Jiang, Hung Viet Pham, Thiba…☆40Updated last year
 - [ICSE 2021] - InferCode: Self-Supervised Learning of Code Representations by Predicting Subtrees☆89Updated 2 months ago
 - ☆32Updated 3 years ago
 - Replication package for "Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection", ICSE 2024.☆67Updated last year
 - VulRepair: A T5-Based Automated Software Vulnerability Repair☆81Updated 5 months ago
 - ☆39Updated 2 years ago
 - Statement-level deep learning model for automated software vulnerability detection in C/C++ (Accepted in MSR 2022)☆72Updated 3 years ago
 - Code of our paper Applying CodeBERT for Automated Program Repair of Java Simple Bugs which is accepted to MSR 2021.☆53Updated 2 years ago
 - NaturalCC: An Open-Source Toolkit for Code Intelligence☆307Updated last month
 - ☆120Updated 3 years ago
 - open science repo of "Neural Transfer Learning for Repairing Security Vulnerabilities in C Code" https://arxiv.org/pdf/2104.08308☆63Updated last year
 - Repository for PrimeVul Vulnerability Detection Dataset☆188Updated last year
 - [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆72Updated last year
 - ☆34Updated last year
 - ☆24Updated 4 years ago
 - This repository is the replication package of the ICSE22 paper "FIRA: Fine-Grained Graph-Based Code Change Representation for Automated C…☆32Updated 3 years ago
 - For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆62Updated last year
 - Large Language Models for Software Engineering☆252Updated 3 months ago
 - RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair http://arxiv.org/pdf/2312.15698☆36Updated last month
 - ☆25Updated 3 years ago
 - A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries☆337Updated 4 years ago
 - BugsInPy: Benchmarking Bugs in Python Projects☆116Updated 3 weeks ago
 - ☆19Updated last year
 - ☆61Updated last year
 - ☠️ Ground-truth dataset for vulnerability prediction (known research datasets and data sources included such as NVD, CVE Details and OSV)…☆98Updated 2 years ago
 - ☆217Updated last year
 - Refactory: Re-factoring based Program Repair applied to Programming Assignments☆41Updated 3 years ago