Repository for DEMETR: Diagnosing Evaluation Metrics for Translation
☆17Nov 29, 2022Updated 3 years ago
Alternatives and similar repositories for demetr
Users that are interested in demetr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20May 14, 2022Updated 4 years ago
- ☆29Dec 2, 2024Updated last year
- ☆35Dec 17, 2025Updated 5 months ago
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆17Nov 20, 2025Updated 6 months ago
- ☆24Apr 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Sep 1, 2021Updated 4 years ago
- Building and Using A Seed Corpus for the Human Language Project☆11Feb 9, 2018Updated 8 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Domain-Specific Text Generation for Machine Translation (with LLMs) - scripts and config files for the paper☆18Aug 19, 2023Updated 2 years ago
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆130Apr 23, 2026Updated last month
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 3 years ago
- ☆18Mar 20, 2019Updated 7 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Dec 8, 2022Updated 3 years ago
- [ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT☆91Oct 14, 2025Updated 7 months ago
- Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2…☆21Jul 19, 2023Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆45Aug 10, 2024Updated last year
- Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://a…☆46Jul 30, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆139Aug 2, 2023Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 4 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆12Mar 18, 2023Updated 3 years ago
- To help search, filter, and download papers from 'acl anthology' (https://aclanthology.org/).☆18Sep 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Jan 30, 2023Updated 3 years ago
- ☆47Mar 25, 2025Updated last year
- human_detectors hosts the data released from the paper "People who frequently use ChatGPT for writing tasks are accurate and robust detec…☆51May 9, 2025Updated last year
- ☆54Oct 24, 2024Updated last year
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 8 months ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 3 years ago
- ☆12Jul 15, 2024Updated last year
- Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]☆27Oct 3, 2025Updated 7 months ago
- ☆22Sep 19, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- Story understanding and plot analysis pilot.☆11Dec 27, 2022Updated 3 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Jul 29, 2022Updated 3 years ago
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"☆12Jun 11, 2025Updated 11 months ago