CUHK-ARISE / ml4code-datasetLinks
A collection of datasets for machine learning for big code
☆61Updated 4 years ago
Alternatives and similar repositories for ml4code-dataset
Users that are interested in ml4code-dataset are comparing it to the libraries listed below
Sorting:
- BugsInPy: Benchmarking Bugs in Python Projects☆113Updated last year
- [ICSE 2021] - InferCode: Self-Supervised Learning of Code Representations by Predicting Subtrees☆89Updated 2 months ago
- This repository is to support contributions for tools and new data entries for the D2A dataset hosted in DAX☆74Updated 3 years ago
- A multi-lingual program repair benchmark set based on the Quixey Challenge☆128Updated 3 years ago
- ☆117Updated 2 years ago
- Vul4J: A Dataset of Reproducible Java Vulnerabilities☆99Updated last month
- Code of our paper Applying CodeBERT for Automated Program Repair of Java Simple Bugs which is accepted to MSR 2021.☆53Updated 2 years ago
- For our ICSE21 paper "CURE: Code-Aware Neural Machine Translation for Automatic Program Repair" by Nan Jiang, Thibaud Lutellier, and Lin …☆55Updated 2 years ago
- ☆39Updated 2 years ago
- Learning graph-based code representations for source-level functional similarity detection. ICSE'23☆53Updated 2 years ago
- Replication package for "Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection", ICSE 2024.☆67Updated last year
- Code and data for paper "Detecting Code Clones with Graph Neural Network and Flow-Augmented Abstract Syntax Tree".☆67Updated 3 years ago
- Repository for PrimeVul Vulnerability Detection Dataset☆184Updated last year
- ☆34Updated last year
- For our ISSTA20 paper "CoCoNuT: Combining Context-Aware Neural Translation Models using Ensemble for Program Repair" by Thibaud Lutellier…☆62Updated 2 years ago
- ☆32Updated 3 years ago
- ☆49Updated 2 years ago
- Refactory: Re-factoring based Program Repair applied to Programming Assignments☆41Updated 3 years ago
- Statement-level deep learning model for automated software vulnerability detection in C/C++ (Accepted in MSR 2022)☆72Updated 3 years ago
- open science repo of "Neural Transfer Learning for Repairing Security Vulnerabilities in C Code" https://arxiv.org/pdf/2104.08308☆63Updated last year
- ☠️ Ground-truth dataset for vulnerability prediction (known research datasets and data sources included such as NVD, CVE Details and OSV)…☆96Updated 2 years ago
- ☆18Updated last year
- A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries☆332Updated 4 years ago
- Large Language Models for Software Engineering☆246Updated 2 months ago
- VulRepair: A T5-Based Automated Software Vulnerability Repair☆81Updated 4 months ago
- ☆61Updated last year
- For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆62Updated 11 months ago
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆69Updated last year
- Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks☆237Updated last year
- ☆60Updated 2 years ago