saltudelft / CD4PyLinks
CD4Py: Code De-Duplication for Python
☆24Updated 5 years ago
Alternatives and similar repositories for CD4Py
Users that are interested in CD4Py are comparing it to the libraries listed below
Sorting:
- Type4Py: Deep Similarity Learning-Based Type Inference for Python☆65Updated 2 years ago
- ☆23Updated 2 years ago
- Web queries dataset for code search☆32Updated 2 years ago
- Neural bag of words code search implementation using PyTorch and data from the CodeSearchNet project.☆72Updated 2 years ago
- Semantic Code Search☆37Updated 2 years ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆56Updated last year
- Finding similar repositories on GitHub☆51Updated 2 years ago
- Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.☆22Updated 3 years ago
- Utilities used by the Deep Program Understanding team☆104Updated 2 years ago
- A Python 3 module that provides functions for splitting identifiers found in source code files.☆47Updated 2 years ago
- A toolkit for pre-processing large source code corpora☆46Updated 3 years ago
- C# Data Extraction for "Learning to Represent Edits"☆27Updated 7 years ago
- Static code analysis package for Python repositories☆33Updated 2 years ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆30Updated last year
- Code for "Deep Graph Matching and Searching for Semantic Code Retrieval"☆24Updated 4 years ago
- ☆15Updated 4 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Updated 4 years ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆18Updated 3 years ago
- Models and datasets for annotated code search.☆35Updated 2 years ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆32Updated 2 years ago
- Evaluation of source authorship attribution tool☆23Updated 4 years ago
- A redistributable subset of the ETH Py150 corpus [https://www.sri.inf.ethz.ch/py150], introduced in the ICML 2020 paper 'Learning and Eva…☆32Updated 5 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Updated 4 years ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆87Updated 3 years ago
- ☆10Updated 5 years ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 5 years ago
- Stuff related to scraping the Code Review StackExchange☆12Updated 2 years ago
- A light-weight, extendable, high level, universal code parser built on top of tree-sitter☆130Updated 4 years ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆25Updated 2 years ago
- ☆18Updated 4 years ago