saltudelft / CD4PyLinks
CD4Py: Code De-Duplication for Python
☆24Updated 4 years ago
Alternatives and similar repositories for CD4Py
Users that are interested in CD4Py are comparing it to the libraries listed below
Sorting:
- ☆15Updated 3 years ago
- Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.☆22Updated 3 years ago
- Type4Py: Deep Similarity Learning-Based Type Inference for Python☆63Updated last year
- Semantic Code Search☆35Updated 2 years ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆16Updated 2 years ago
- ☆23Updated 2 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15Updated 3 years ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆88Updated 3 years ago
- Stuff related to scraping the Code Review StackExchange☆11Updated 2 years ago
- Models and datasets for annotated code search.☆35Updated 2 years ago
- code and data for paper "BASHEXPLAINER: Retrieval-Augmented Bash Code Comment Generation based on Fine-tuned CodeBERT", which accepted in…☆12Updated 2 years ago
- Web queries dataset for code search☆32Updated 2 years ago
- C# Data Extraction for "Learning to Represent Edits"☆26Updated 6 years ago
- Incremental Python parser for constrained generation of code by LLMs.☆16Updated 8 months ago
- A dataset for natural language code search.☆14Updated 5 years ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆16Updated 4 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Updated 3 years ago
- ☆10Updated 4 years ago
- BLANCA - Benchmarks for LANguage models on Coding Artifacts☆9Updated 3 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆17Updated last year
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆55Updated last year
- A Python 3 module that provides functions for splitting identifiers found in source code files.☆48Updated 2 years ago
- AVATAR: Fixing Semantic Bugs with Fix Patterns of Static Analysis Violations☆28Updated 4 years ago
- Code for "Deep Graph Matching and Searching for Semantic Code Retrieval"☆24Updated 3 years ago
- This repository contains an implementation for design patterns detection. In this task, feature engineering and ensemble learning are app…☆10Updated 2 years ago
- DataSet and source code for PyART☆11Updated 2 years ago
- Official implementation of our work, 'GypSum: Learning Hybrid Representations for Code Summarization'.☆14Updated 3 years ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆30Updated 10 months ago
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Updated 4 years ago