JunjieHu / dali
Domain Adaptation of Neural Machine Translation by Lexicon Induction
☆20Updated 5 years ago
Alternatives and similar repositories for dali:
Users that are interested in dali are comparing it to the libraries listed below
- ☆28Updated 7 months ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- ☆24Updated 2 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources☆25Updated 5 years ago
- ☆20Updated 4 years ago
- ☆22Updated 4 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆17Updated 4 years ago
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆14Updated 3 years ago
- ☆33Updated 3 years ago
- Feature Decay Algorithms☆11Updated 10 years ago
- Dynamic data selection for neural machine translation☆20Updated 6 years ago
- Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear project…☆16Updated 3 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆92Updated 6 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Updated 6 years ago
- Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.☆23Updated 3 years ago
- NMT domain adaptation papers (updating...)☆17Updated 5 years ago
- Source code for the AAAI 2020 long paper <Modeling Fluency and Faithfulness for Diverse Neural Machine Translation>.☆19Updated 4 years ago
- Larger-Context NMT☆12Updated 7 years ago
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆40Updated 2 years ago
- Terminology Dataset☆23Updated 4 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 4 years ago
- ☆33Updated 4 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆47Updated 2 years ago
- ☆20Updated last year
- This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…☆23Updated 3 years ago
- Improved ParaBank Rewriter☆22Updated 4 years ago