JoaoFelipe / apted
Python APTED algorithm for the Tree Edit Distance
☆91Updated 7 years ago
Alternatives and similar repositories for apted:
Users that are interested in apted are comparing it to the libraries listed below
- APTED algorithm for the Tree Edit Distance☆120Updated 7 years ago
- Tree edit distance using the Zhang Shasha algorithm☆447Updated 4 years ago
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Updated 4 years ago
- Django Dataset for Code Translation Tasks☆31Updated 7 years ago
- Code and data for ACL20 paper "Incorporating External Knowledge through Pre-training for Natural Language to Code Generation"☆96Updated 2 years ago
- C# Data Extraction for "Learning to Represent Edits"☆26Updated 6 years ago
- Code for "CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning" (WWW 2019)☆36Updated 5 years ago
- super fast cpp implementation of longest common subsequence/substring☆67Updated last year
- Re-implementation of "CODE2SEQ: GENERATING SEQUENCES FROM STRUCTURED REPRESENTATIONS OF CODE"☆45Updated 8 months ago
- ☆19Updated 2 years ago
- Baseline for the Conala: Code/Natural Language Challenge☆60Updated 3 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆83Updated 2 years ago
- Lyra: A Benchmark for Turducken-Style Code Generation☆15Updated 2 years ago
- A Pre-trained BERT on StackOverflow Corpus☆47Updated 4 years ago
- VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning☆38Updated 2 years ago
- A plugin for code generation in PyCharm/IntelliJ using tranX☆35Updated 2 years ago
- ☆22Updated 5 years ago
- Code for the ICLR 2019 paper "Learning to Represent Edits"☆12Updated 2 years ago
- A toolkit for pre-processing large source code corpora☆47Updated 2 years ago
- Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)☆41Updated 3 years ago
- Models and datasets for annotated code search.☆35Updated last year
- StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…☆169Updated 3 years ago
- Mapping Language to Code in a Programmatic Context☆80Updated 4 years ago
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆58Updated 3 years ago
- ☆44Updated 2 years ago
- Web queries dataset for code search☆32Updated last year
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆165Updated 3 years ago
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- Extracting six domain-specific QA datasets from MS MARCO☆17Updated 5 years ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆31Updated last year