EdinburghNLP / opus-100-corpus
☆91Updated last year
Alternatives and similar repositories for opus-100-corpus:
Users that are interested in opus-100-corpus are comparing it to the libraries listed below
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆41Updated 3 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- ☆28Updated 8 months ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆97Updated 4 years ago
- a tool for calcualting character n-gram F score☆69Updated 2 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 2 years ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- ☆22Updated 4 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated 2 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆24Updated 3 years ago
- ☆20Updated 4 years ago
- YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources☆25Updated 5 years ago
- Terminology Dataset☆23Updated 4 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- A repository with the code related to experiments around context-aware machine translation☆48Updated 2 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ☆24Updated 2 years ago
- ☆36Updated 2 years ago
- English-French MT dialogue dataset☆17Updated 2 years ago
- Code for "Simulated Multiple Reference Training Improves Low-Resource Machine Translation"☆15Updated 4 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 5 years ago
- ☆36Updated 3 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated 6 months ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 2 years ago
- Code and resources for evaluating cross-lingual embedding spaces☆28Updated 4 years ago