naist-nlp / mbrs
A library for minimum Bayes risk (MBR) decoding
☆37Updated last month
Alternatives and similar repositories for mbrs:
Users that are interested in mbrs are comparing it to the libraries listed below
- Fairseq tutorial☆17Updated 2 years ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆11Updated last year
- ☆23Updated last year
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆33Updated last year
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Updated 2 years ago
- ☆13Updated 4 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- ☆23Updated 8 months ago
- Python package to augment multilingual data☆14Updated 2 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- ☆28Updated 9 months ago
- ☆20Updated 2 years ago
- A repository for experiments in quality-aware decoding☆15Updated 2 years ago
- A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT☆26Updated 4 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆81Updated 9 months ago
- Curriculum training☆17Updated 3 weeks ago
- ☆44Updated 4 years ago
- ☆22Updated 3 years ago
- ☆25Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- ☆20Updated 4 years ago
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Updated 7 months ago
- A simple library for querying the URIEL typological database.☆88Updated 11 months ago
- Tools for formatting WMT hypothesis and test sets in XML☆25Updated 2 weeks ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Updated 4 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Updated last year