AppraiseDev/Appraise

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AppraiseDev/Appraise)

AppraiseDev / Appraise

Appraise code used as part of WMT21 human evaluation campaign

☆30

Alternatives and similar repositories for Appraise

Users that are interested in Appraise are comparing it to the libraries listed below

Sorting:

cfedermann / Appraise
View on GitHub
Appraise evaluation system for manual evaluation of machine translation output
☆77May 7, 2021Updated 4 years ago
AppraiseDev / OCELoT
View on GitHub
Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations
☆23Nov 5, 2025Updated 3 months ago
BramVanroy / mateo-demo
View on GitHub
MAchine Translation Evaluation Online (MATEO)
☆25Jun 2, 2025Updated 8 months ago
ymoslem / MT-Tools
View on GitHub
Collection of Common Machine Translation Tools
☆11Jul 26, 2022Updated 3 years ago
wmt-conference / wmt22-news-systems
View on GitHub
☆21Feb 13, 2023Updated 3 years ago
wmt-conference / wmt21-news-systems
View on GitHub
☆26Jan 9, 2023Updated 3 years ago
Unbabel / word-level-qe-corpus-builder
View on GitHub
Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.
☆10Sep 19, 2022Updated 3 years ago
YerevaNN / PARASITE
View on GitHub
🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…
☆11Jun 8, 2021Updated 4 years ago
eval4nlp / SharedTask2021
View on GitHub
☆17Nov 23, 2021Updated 4 years ago
Helsinki-NLP / OpusFilter
View on GitHub
OpusFilter - Parallel corpus processing toolkit
☆115Feb 11, 2026Updated 2 weeks ago
TharinduDR / TransQuest
View on GitHub
Transformer based translation quality estimation
☆114Jul 20, 2023Updated 2 years ago
ZurichNLP / ContraDecode
View on GitHub
The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…
☆36Aug 29, 2025Updated 6 months ago
zouharvi / subset2evaluate
View on GitHub
Find informative examples to efficiently (human)-evaluate NLG models.
☆18Feb 9, 2026Updated 2 weeks ago
MicrosoftTranslator / ToShipOrNotToShip
View on GitHub
☆20Dec 16, 2024Updated last year
dayeonki / mt_feedback
View on GitHub
(NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations
☆15Apr 14, 2025Updated 10 months ago
Unbabel / OpenKiwi
View on GitHub
Open-Source Machine Translation Quality Estimation in PyTorch
☆232Jun 23, 2022Updated 3 years ago
sheffieldnlp / deepQuest
View on GitHub
Framework for neural-based Quality Estimation
☆41Sep 23, 2020Updated 5 years ago
sheffieldnlp / mlqe-pe
View on GitHub
Multilingual Quality Estimation and Automatic Post-editing Dataset
☆42Mar 24, 2022Updated 3 years ago
amazon-science / contrastive-controlled-mt
View on GitHub
Code and data for the IWSLT 2022 shared task on Formality Control for SLT
☆22May 24, 2023Updated 2 years ago
bitextor / bicleaner
View on GitHub
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆160Jun 18, 2024Updated last year
mahfuzibnalam / terminology_evaluation
View on GitHub
☆21May 30, 2022Updated 3 years ago
luismsgomes / mosestokenizer
View on GitHub
☆20Oct 22, 2021Updated 4 years ago
BH-So / unsupervised-paraphrase-generation
View on GitHub
"Unsupervised Paraphrase Generation using Pre-trained Language Model."
☆22Aug 28, 2020Updated 5 years ago
rwth-i6 / CharacTER
View on GitHub
☆23Feb 4, 2020Updated 6 years ago
google / wmt-mqm-human-evaluation
View on GitHub
☆98Sep 25, 2025Updated 5 months ago
amazon-science / doc-mt-metrics
View on GitHub
☆26Jul 30, 2024Updated last year
Unbabel / COMET
View on GitHub
A Neural Framework for MT Evaluation
☆717Feb 5, 2026Updated 3 weeks ago
deep-spin / UA_COMET
View on GitHub
Repository for "Uncertainty-Aware Machine Translation Evaluation", accepted to Findings of EMNLP 2021.
☆34Sep 22, 2021Updated 4 years ago
ondrejklejch / MT-ComparEval
View on GitHub
Tool for comparison and evaluation of machine translation.
☆56May 17, 2022Updated 3 years ago
chikiulo / yisi
View on GitHub
YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources
☆26May 28, 2019Updated 6 years ago
thompsonb / vecalign
View on GitHub
Improved Sentence Alignment in Linear Time and Space
☆192Mar 6, 2023Updated 2 years ago
Genius1237 / TyDiP
View on GitHub
TyDiP Multilingual Politeness dataset and code
☆12Oct 15, 2023Updated 2 years ago
MicrosoftTranslator / GEMBA
View on GitHub
GEMBA — GPT Estimation Metric Based Assessment
☆146Dec 15, 2025Updated 2 months ago
boknilev / nmt-repr-analysis
View on GitHub
☆38Apr 23, 2019Updated 6 years ago
AIPHES / DiscoScore
View on GitHub
DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence
☆36Jul 25, 2023Updated 2 years ago
EleanorJiang / BlonDe
View on GitHub
Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …
☆82Sep 21, 2023Updated 2 years ago
MicrosoftTranslator / NTREX
View on GitHub
NTREX -- News Test References for MT Evaluation
☆88Jun 5, 2024Updated last year
snover / terp
View on GitHub
TER-plus Machine Translation metric.
☆31May 23, 2022Updated 3 years ago
Shaddadi / veritex
View on GitHub
☆12Jun 18, 2024Updated last year