CUHK-ARISE / ml4code-dataset
A collection of datasets for machine learning for big code
☆46Updated 3 years ago
Alternatives and similar repositories for ml4code-dataset:
Users that are interested in ml4code-dataset are comparing it to the libraries listed below
- ☆40Updated 2 years ago
- ☆33Updated 2 years ago
- open science repo of "Neural Transfer Learning for Repairing Security Vulnerabilities in C Code" https://arxiv.org/pdf/2104.08308☆57Updated 10 months ago
- Probing pre-trained source code models☆15Updated 2 years ago
- ☆111Updated 2 years ago
- This repository is the replication package of the ICSE22 paper "FIRA: Fine-Grained Graph-Based Code Change Representation for Automated C…☆31Updated 2 years ago
- [ICSE 2021] - InferCode: Self-Supervised Learning of Code Representations by Predicting Subtrees☆89Updated 3 years ago
- Vul4J: A Dataset of Reproducible Java Vulnerabilities☆75Updated 4 months ago
- For our ISSTA23 paper "How Effective are Neural Networks for Fixing Security Vulnerabilities?" by Yi Wu, Nan Jiang, Hung Viet Pham, Thiba…☆29Updated last year
- Code and dataset for paper C4: Contrastive Cross-Language Code Clone Detection☆25Updated 2 years ago
- An implementation of the ACL 2024 Findings paper "Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tu…☆23Updated 7 months ago
- For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan☆59Updated 3 months ago
- Bugs.jar: A Large-scale, Diverse Dataset of Bugs for Java Program Repair☆52Updated 6 years ago
- A Repository of Real, Recent Java Bugs☆13Updated 2 weeks ago
- ☆49Updated 2 years ago
- ☆15Updated 9 months ago
- Refactory: Re-factoring based Program Repair applied to Programming Assignments☆38Updated 2 years ago
- This repository is to support contributions for tools and new data entries for the D2A dataset hosted in DAX☆65Updated 2 years ago
- ☆29Updated 3 years ago
- ☆24Updated 2 years ago
- ☆55Updated 2 years ago
- VulRepair: A T5-Based Automated Software Vulnerability Repair☆70Updated last year
- For our ICSE23 paper "KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair" by Nan Jiang, Thibaud Lutellier, Yiling…☆30Updated last year
- ☆45Updated 2 years ago
- Replication package for "Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection", ICSE 2024.☆51Updated 3 months ago
- Code of our paper Applying CodeBERT for Automated Program Repair of Java Simple Bugs which is accepted to MSR 2021.☆51Updated 2 years ago
- Hoppity☆59Updated 4 years ago
- An Extensible Java Bug Benchmark for Automatic Program Repair Studies☆33Updated 10 months ago
- MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset☆57Updated this week
- ☆26Updated 11 months ago