JoaoFelipe / aptedLinks
Python APTED algorithm for the Tree Edit Distance
☆100Updated 8 years ago
Alternatives and similar repositories for apted
Users that are interested in apted are comparing it to the libraries listed below
Sorting:
- Tree edit distance using the Zhang Shasha algorithm☆457Updated 5 years ago
- Code for generating the JuICe dataset.☆37Updated 4 years ago
- Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".☆43Updated 2 years ago
- Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing☆42Updated 5 years ago
- APTED algorithm for the Tree Edit Distance☆127Updated 7 years ago
- Source Code for ACL-21 main conference paper "CoSQA: 20,000+ Web Queries for Code Search and Question Answering".☆46Updated 3 years ago
- Code and data for "TURL: Table Understanding through Representation Learning"☆131Updated 2 months ago
- The unified platform for data-related resources.☆135Updated 2 years ago
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆59Updated 4 years ago
- Source Code and Data for Software Domain NER☆147Updated 3 years ago
- Dataset and code for EMNLP2020 paper "HybridQA: A Dataset of Multi-Hop Question Answeringover Tabular and Textual Data"☆243Updated 2 years ago
- super fast cpp implementation of longest common subsequence/substring☆72Updated 2 years ago
- Models and datasets for annotated code search.☆35Updated 2 years ago
- DuoRAT is a ServiceNow Research project that was started at Element AI.☆56Updated 2 years ago
- Code and data for ACL20 paper "Incorporating External Knowledge through Pre-training for Natural Language to Code Generation"☆97Updated 4 months ago
- Data and Code for ICLR2020 Paper "TabFact: A Large-scale Dataset for Table-based Fact Verification"☆416Updated 2 years ago
- Model-based Interactive Semantic Parsing (MISP) framework☆53Updated last year
- StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…☆172Updated 4 years ago
- A dataset of complex questions on semi-structured Wikipedia tables☆181Updated 4 years ago
- A Pre-trained BERT on StackOverflow Corpus☆47Updated 4 years ago
- "Semantic Evaluation for Text-to-SQL with Distilled Test Suite", EMNLP2020☆41Updated 5 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Updated 3 years ago
- Source code of the paper "Do Syntax Trees Help Pre-trained Transformers Extract Information?" (EACL 2021)☆75Updated 4 years ago
- VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning☆40Updated 3 years ago
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Updated 5 years ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆32Updated 2 years ago
- The Definition Extraction From Text corpus and relevant formatting scripts☆81Updated 2 years ago
- Mapping Language to Code in a Programmatic Context☆80Updated 5 years ago
- source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.☆57Updated 4 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 5 years ago