MicrosoftTranslator / GEMBA
GEMBA — GPT Estimation Metric Based Assessment
☆117Updated 8 months ago
Alternatives and similar repositories for GEMBA:
Users that are interested in GEMBA are comparing it to the libraries listed below
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆105Updated last month
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆100Updated last year
- ☆34Updated 10 months ago
- Parallel corpora for the biomedical domain☆48Updated 9 months ago
- A Multilingual Replicable Instruction-Following Model☆93Updated last year
- a tool for calcualting character n-gram F score☆72Updated 2 years ago
- A repository with the code related to experiments around context-aware machine translation☆49Updated 2 years ago
- ☆84Updated 7 months ago
- A library of translation-based text similarity measures☆25Updated last year
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆34Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆123Updated 8 months ago
- NTREX -- News Test References for MT Evaluation☆83Updated 10 months ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Updated 2 years ago
- ☆15Updated 2 years ago
- ☆23Updated last year
- ☆20Updated 2 years ago
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆15Updated 2 years ago
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆78Updated last year
- EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"☆18Updated 2 years ago
- ☆11Updated 2 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆120Updated last year
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- ☆21Updated 2 years ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆71Updated 8 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated last month
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆68Updated last year
- How to finetune mbart using fairseq☆24Updated 4 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆25Updated last week