google / wmt-mqm-human-evaluationLinks
☆91Updated last year
Alternatives and similar repositories for wmt-mqm-human-evaluation
Users that are interested in wmt-mqm-human-evaluation are comparing it to the libraries listed below
Sorting:
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 5 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆115Updated 6 months ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Updated 2 years ago
- A repository with the code related to experiments around context-aware machine translation☆51Updated 3 years ago
- ☆25Updated 2 years ago
- ☆21Updated 3 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Updated 5 months ago
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆79Updated last year
- ☆86Updated 2 years ago
- ☆20Updated 2 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Updated 2 years ago
- ☆23Updated 2 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆36Updated 3 weeks ago
- ☆20Updated 4 years ago
- ☆33Updated 3 years ago
- ☆45Updated 4 years ago
- ☆24Updated 2 years ago
- ☆38Updated 4 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 4 years ago
- ☆43Updated 2 years ago
- ☆15Updated 2 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61Updated 4 years ago
- Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation☆26Updated 2 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆134Updated 2 years ago
- GEMBA — GPT Estimation Metric Based Assessment☆124Updated last year
- [WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian☆23Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆59Updated 2 years ago
- ☆28Updated 9 months ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆76Updated 4 years ago