zouharvi / subset2evaluateLinks
Find informative examples to efficiently (human)-evaluate NLG models.
☆11Updated this week
Alternatives and similar repositories for subset2evaluate
Users that are interested in subset2evaluate are comparing it to the libraries listed below
Sorting:
- ☆28Updated 6 months ago
- ☆39Updated 3 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Updated 3 years ago
- ☆52Updated 3 years ago
- ☆9Updated 2 years ago
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆15Updated 2 years ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆23Updated last month
- A software for transferring pre-trained English models to foreign languages☆18Updated 2 years ago
- The geometry of multilingual language model representations (EMNLP 2022).☆21Updated 2 years ago
- A framework for evaluating Machine Translation models.☆9Updated last week
- Tool to perform paired evaluation of automatic systems☆12Updated 3 years ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆17Updated 2 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Updated 3 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Updated 4 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Updated 2 years ago
- ☆35Updated 3 years ago
- Repository for DISRPT2023 shared task☆17Updated 10 months ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆25Updated 2 years ago
- Appraise code used as part of WMT21 human evaluation campaign☆24Updated 3 months ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- Pretraining scripts for BART transformer model☆11Updated 2 years ago
- ☆15Updated 3 years ago
- ☆23Updated last year
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆28Updated 3 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆29Updated 2 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated last year
- ☆20Updated 5 months ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Updated 3 years ago
- ☆89Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆83Updated last year