ccasimiro88 / TranslateAlignRetrieve
Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.
☆59Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TranslateAlignRetrieve
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Lexical Simplification with Pretrained Encoders☆69Updated 3 years ago
- Transformer based translation quality estimation☆107Updated last year
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 5 months ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆39Updated 5 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆156Updated last month
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated last year
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆92Updated 2 years ago
- ☆181Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆158Updated last year
- ☆36Updated 2 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆114Updated 3 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆32Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆101Updated 2 years ago
- ☆73Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆77Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆81Updated 3 years ago
- ☆16Updated last year
- Code to reproduce the experiments from the paper.☆101Updated last year
- cLang-8 is a dataset for grammatical error correction.☆102Updated 2 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆97Updated last year
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆79Updated 2 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆60Updated 2 years ago
- ☆12Updated 3 years ago
- ☆35Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago