microsoft / near-duplicate-code-detector
A simple tool for detecting near-duplicate source code
☆96Updated 3 months ago
Alternatives and similar repositories for near-duplicate-code-detector:
Users that are interested in near-duplicate-code-detector are comparing it to the libraries listed below
- Code for "Generative Code Modeling with Graphs" (ICLR'19)☆170Updated 2 years ago
- Utilities used by the Deep Program Understanding team☆102Updated last year
- A toolkit for pre-processing large source code corpora☆46Updated 2 years ago
- Your library for dynamic language modeling☆67Updated 6 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆83Updated last year
- Web queries dataset for code search☆31Updated last year
- evaluation dataset consisting of natural language query and code snippet pairs☆123Updated 8 months ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆28Updated last year
- CD4Py: Code De-Duplication for Python☆22Updated 4 years ago
- Hosts our tool for mining simple "stupid'' bugs (SStuBs).☆35Updated 2 years ago
- ☆49Updated 2 years ago
- 58069 Java source code diffs. http://arxiv.org/pdf/1807.03200☆92Updated 5 years ago
- Set of tools to help working with "Big Code"☆43Updated 2 years ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆83Updated 3 years ago
- ☆50Updated 4 years ago
- A Typescript library for parsing Python 3 and doing basic program analysis, like forming control-flow graphs and def-use chains.☆53Updated 5 years ago
- A Python 3 module that provides functions for splitting identifiers found in source code files.☆48Updated 2 years ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆289Updated 5 months ago
- Checks the PDFs submitted to a conference, e.g., for formatting violations and double anonymous violations☆61Updated 3 years ago
- Neural Code Translator provides instructions, datasets, and a deep learning infrastructure (based on seq2seq) that aims at learning code …☆39Updated 5 years ago
- Program error cause finder for C# - research tool☆17Updated 3 years ago
- Automatic Repair Framework that abstract repair tools and bug benchmarks☆69Updated last year
- AVATAR: Fixing Semantic Bugs with Fix Patterns of Static Analysis Violations☆25Updated 3 years ago
- NL2Type: Inferring JavaScript Function Types from Natural Language Information☆23Updated 5 years ago
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Updated 4 years ago
- the code for three models introduced in DYNAMIC NEURAL PROGRAM EMBEDDINGS FOR PROGRAM REPAIR (ICLR 18)☆32Updated 6 years ago
- Neural bag of words code search implementation using PyTorch and data from the CodeSearchNet project.☆70Updated 2 years ago
- Sequence-to-Sequence Learning for End-to-End Program Repair (IEEE TSE 2019). Open-science repo. http://arxiv.org/pdf/1901.01808☆82Updated last year
- Source code and data about our large scale study about Java annotaion in practice☆12Updated last year
- ☆13Updated 3 years ago