Utilities used by the Deep Program Understanding team
☆104Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for dpu-utils
Users that are interested in dpu-utils are comparing it to the libraries listed below
Sorting:
- C# Data Extraction for "Learning to Represent Edits"☆27Nov 3, 2018Updated 7 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆19Nov 29, 2023Updated 2 years ago
- Code for "Generative Code Modeling with Graphs" (ICLR'19)☆172Dec 8, 2022Updated 3 years ago
- the code for three models introduced in DYNAMIC NEURAL PROGRAM EMBEDDINGS FOR PROGRAM REPAIR (ICLR 18)☆33Jun 30, 2018Updated 7 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 5 years ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Aug 19, 2020Updated 5 years ago
- An IntelliJ IDEA plugin that allows to get suggestions for better method names☆10Dec 4, 2019Updated 6 years ago
- ☆10Aug 25, 2020Updated 5 years ago
- Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.☆22Dec 22, 2021Updated 4 years ago
- Set of tools to help working with "Big Code"☆42Apr 28, 2022Updated 3 years ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Oct 22, 2018Updated 7 years ago
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Jun 17, 2022Updated 3 years ago
- Code to reproduce the experiments in the paper Open Vocabulary Learning on Source Code with a Graph-Structured Cache☆21Apr 15, 2019Updated 6 years ago
- Sequence-to-Sequence Learning for End-to-End Program Repair (IEEE TSE 2019). Open-science repo. http://arxiv.org/pdf/1901.01808☆86Jun 9, 2023Updated 2 years ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆85May 10, 2021Updated 4 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- AST factorization: transformation AST of Kotlin source code to a vector☆11Oct 17, 2019Updated 6 years ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Feb 7, 2025Updated last year
- ☆50Feb 12, 2020Updated 6 years ago
- ☆45Jun 22, 2022Updated 3 years ago
- Implementation of 'A Convolutional Attention Network for Extreme Summarization of Source Code'☆15Mar 14, 2019Updated 6 years ago
- Deep learning code semantic similarity☆66Jun 11, 2019Updated 6 years ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆87Apr 5, 2022Updated 3 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Oct 4, 2022Updated 3 years ago
- A javac plugin for extracting a feature graph for plugging in to machine learning models☆28Jan 20, 2021Updated 5 years ago
- A Tool for Mining Rich Abstract Syntax Trees from Code☆61Oct 24, 2025Updated 4 months ago
- A redistributable subset of the ETH Py150 corpus [https://www.sri.inf.ethz.ch/py150], introduced in the ICML 2020 paper 'Learning and Eva…☆32Aug 11, 2020Updated 5 years ago
- CD4Py: Code De-Duplication for Python☆23Dec 13, 2020Updated 5 years ago
- Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Sourc…☆66Dec 3, 2021Updated 4 years ago
- Tracking events, CfPs, abstracts, slides, and all other even related things☆22Oct 4, 2019Updated 6 years ago
- 🔍 Code Search Tools & Experiments☆12Updated this week
- Code search model based the self-attention☆12Oct 16, 2020Updated 5 years ago
- DeepBugs is a framework for learning bug detectors from an existing code corpus.☆152Apr 7, 2021Updated 4 years ago
- Tree-based Autofolding Software Summarization Algorithm☆43Jul 30, 2016Updated 9 years ago
- Code for "CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning" (WWW 2019)☆37Apr 21, 2020Updated 5 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Oct 25, 2022Updated 3 years ago
- CodRep 2019 edition.☆20Nov 12, 2019Updated 6 years ago
- A tool for mining commits from Git repositories and diffs to automatically extract code change pattern instances and features with ast a…☆98Nov 13, 2024Updated last year
- Finding similar repositories on GitHub☆51Jan 12, 2023Updated 3 years ago