tamohannes / urartu
π¦ An open-source NLP framework that offers high-level wrappers designed for effortless launch, enhanced reproducibility, superior control, and unmatched flexibility for your experiments.
β11Updated this week
Alternatives and similar repositories for urartu
Users that are interested in urartu are comparing it to the libraries listed below
Sorting:
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (hβ¦β82Updated 4 years ago
- β43Updated 2 years ago
- β10Updated 5 years ago
- β15Updated 2 years ago
- Split bib files for anthology bibliography for overleafβ10Updated 8 months ago
- This project collects methods that enhance the comparison between AMR graphs.β16Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.β143Updated 2 years ago
- β39Updated 3 years ago
- The geometry of multilingual language model representations (EMNLP 2022).β20Updated 2 years ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper heβ¦β23Updated 2 months ago
- β26Updated 2 years ago
- β59Updated last year
- β100Updated 2 years ago
- β89Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACLβ93Updated 3 months ago
- β20Updated 5 months ago
- β31Updated 3 months ago
- Crosslingual Reasoning through Test-Time Scalingβ14Updated this week
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021β14Updated 4 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.β10Updated this week
- β58Updated 3 years ago
- FRANK: Factuality Evaluation Benchmarkβ55Updated 2 years ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisationβ¦β10Updated 2 years ago
- Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systemsβ16Updated 3 years ago
- β29Updated 2 years ago
- Heuristic Analysis for NLI Systemsβ125Updated 4 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.β54Updated 3 years ago
- β97Updated last year
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translationβ15Updated 2 years ago
- β27Updated 5 months ago