Utilities used by the Deep Program Understanding team
☆104Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for dpu-utils
Users that are interested in dpu-utils are comparing it to the libraries listed below
Sorting:
- C# Data Extraction for "Learning to Represent Edits"☆27Nov 3, 2018Updated 7 years ago
- Code for "Generative Code Modeling with Graphs" (ICLR'19)☆172Dec 8, 2022Updated 3 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆19Nov 29, 2023Updated 2 years ago
- the code for three models introduced in DYNAMIC NEURAL PROGRAM EMBEDDINGS FOR PROGRAM REPAIR (ICLR 18)☆33Jun 30, 2018Updated 7 years ago
- Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.☆22Dec 22, 2021Updated 4 years ago
- Set of tools to help working with "Big Code"☆42Apr 28, 2022Updated 3 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 5 years ago
- An IntelliJ IDEA plugin that allows to get suggestions for better method names☆10Dec 4, 2019Updated 6 years ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Aug 19, 2020Updated 5 years ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆85May 10, 2021Updated 4 years ago
- Implementation of 'A Convolutional Attention Network for Extreme Summarization of Source Code'☆15Mar 14, 2019Updated 7 years ago
- ☆50Feb 12, 2020Updated 6 years ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Oct 22, 2018Updated 7 years ago
- A redistributable subset of the ETH Py150 corpus [https://www.sri.inf.ethz.ch/py150], introduced in the ICML 2020 paper 'Learning and Eva…☆32Aug 11, 2020Updated 5 years ago
- Code to reproduce the experiments in the paper Open Vocabulary Learning on Source Code with a Graph-Structured Cache☆21Apr 15, 2019Updated 6 years ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆292Feb 7, 2025Updated last year
- ☆10Aug 25, 2020Updated 5 years ago
- Babelfish Python client☆17Nov 6, 2019Updated 6 years ago
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Jun 17, 2022Updated 3 years ago
- Sequence-to-Sequence Learning for End-to-End Program Repair (IEEE TSE 2019). Open-science repo. http://arxiv.org/pdf/1901.01808☆86Jun 9, 2023Updated 2 years ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆30Feb 5, 2026Updated last month
- Tracking events, CfPs, abstracts, slides, and all other even related things☆22Oct 4, 2019Updated 6 years ago
- CodRep 2019 edition.☆20Nov 12, 2019Updated 6 years ago
- CD4Py: Code De-Duplication for Python☆23Dec 13, 2020Updated 5 years ago
- Finding similar repositories on GitHub☆51Jan 12, 2023Updated 3 years ago
- A javac plugin for extracting a feature graph for plugging in to machine learning models☆28Jan 20, 2021Updated 5 years ago
- A static analysis library for computing graph representations of Python programs suitable for use with graph neural networks.☆340Aug 11, 2023Updated 2 years ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆87Apr 5, 2022Updated 3 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- DeepBugs is a framework for learning bug detectors from an existing code corpus.☆152Apr 7, 2021Updated 4 years ago
- Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Sourc…☆66Dec 3, 2021Updated 4 years ago
- 58069 Java source code diffs. http://arxiv.org/pdf/1807.03200☆94Jul 21, 2019Updated 6 years ago
- AST factorization: transformation AST of Kotlin source code to a vector☆11Oct 17, 2019Updated 6 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Oct 4, 2022Updated 3 years ago
- Source code for the Naturalize project☆56Sep 5, 2015Updated 10 years ago
- A Tool for Mining Rich Abstract Syntax Trees from Code☆62Oct 24, 2025Updated 4 months ago
- sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees☆143May 22, 2019Updated 6 years ago
- evaluation dataset consisting of natural language query and code snippet pairs☆124May 3, 2024Updated last year
- Deep learning code semantic similarity☆66Jun 11, 2019Updated 6 years ago