luismsgomes / mosestokenizer
☆20Updated 2 years ago
Related projects: ⓘ
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- ☆23Updated last year
- A library of translation-based text similarity measures☆25Updated 9 months ago
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆39Updated 2 years ago
- ☆28Updated 3 months ago
- ☆21Updated 5 months ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆20Updated last year
- Statistics on multilingual datasets☆17Updated 2 years ago
- ☆21Updated 2 years ago
- Pretraining scripts for BART transformer model☆11Updated last year
- ☆16Updated 3 years ago
- OpusFilter - Parallel corpus processing toolkit☆101Updated last month
- Source code for paper Grammatical Error Correction in Low-Resource Scenarios (W-NUT 2019)☆13Updated 2 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- Scripts for document-level grammatical error correction.☆16Updated 3 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 2 years ago
- GMEG☆29Updated 2 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆33Updated last year
- A repository for experiments in quality-aware decoding☆14Updated 2 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆78Updated 3 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆80Updated 3 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated last month
- Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…☆31Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 2 years ago
- ParCourE - Parallel Corpus Explorer☆12Updated 2 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Updated 3 years ago
- ☆21Updated 3 years ago
- Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.☆10Updated 2 years ago
- ☆19Updated last year