saltudelft / CD4Py
CD4Py: Code De-Duplication for Python
☆24Updated 4 years ago
Alternatives and similar repositories for CD4Py
Users that are interested in CD4Py are comparing it to the libraries listed below
Sorting:
- ☆15Updated 3 years ago
- ☆23Updated 2 years ago
- Web queries dataset for code search☆32Updated last year
- Type4Py: Deep Similarity Learning-Based Type Inference for Python☆63Updated last year
- Semantic Code Search☆35Updated 2 years ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆55Updated last year
- Flow graphs for Python☆26Updated 2 years ago
- Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.☆22Updated 3 years ago
- C# Data Extraction for "Learning to Represent Edits"☆26Updated 6 years ago
- Incremental Python parser for constrained generation of code by LLMs.☆16Updated 7 months ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆16Updated 4 years ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆16Updated 2 years ago
- Models and datasets for annotated code search.☆35Updated last year
- Code for the paper "A Structural Model for Contextual Code Changes"☆32Updated last year
- Stuff related to scraping the Code Review StackExchange☆11Updated 2 years ago
- Fork of the awesome function_parser library from Github's CodeSearchNet Challenge repo: https://github.com/github/CodeSearchNet/tree/mast…☆28Updated 2 years ago
- ☆43Updated 3 months ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆88Updated 3 years ago
- ☆40Updated 4 months ago
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆15Updated last year
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago
- PyTorch library for synthesizing programs from natural language☆18Updated 9 months ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆23Updated 2 years ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆84Updated 4 years ago
- Official implementation of our work, 'GypSum: Learning Hybrid Representations for Code Summarization'.☆14Updated 3 years ago
- Fast and robust AST parsing of any language☆39Updated 4 months ago
- ManyTypes4Py: A benchmark Python dataset for machine learning-based type inference☆23Updated 3 years ago
- Utilities used by the Deep Program Understanding team☆102Updated last year
- A toolkit for pre-processing large source code corpora☆47Updated 2 years ago
- ICSE 2021 Artifact for: Shipwright: A Human-in-the-Loop System for Dockerfile Repair.☆22Updated 4 years ago