moussaKam / FrugalScore
FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performance
☆14Updated 2 years ago
Alternatives and similar repositories for FrugalScore:
Users that are interested in FrugalScore are comparing it to the libraries listed below
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- ☆33Updated last year
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Updated 4 years ago
- ☆48Updated 2 years ago
- ☆17Updated last year
- A repository for experiments in quality-aware decoding☆15Updated 2 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- ☆44Updated last year
- ☆38Updated last year
- Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022☆96Updated 11 months ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 7 months ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- PERIN is Permutation-Invariant Semantic Parser developed for MRP 2020☆45Updated 2 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated 2 years ago
- Efficient Memory-Augmented Transformers☆34Updated 2 years ago
- ☆44Updated 3 years ago
- ☆58Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆54Updated 2 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆14Updated 11 months ago
- ☆71Updated 3 years ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆71Updated 3 years ago
- Official code repository for "Exploring Neural Models for Query-Focused Summarization".☆50Updated last year
- ☆25Updated last year
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering☆45Updated 2 years ago
- A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering☆44Updated 2 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated 2 years ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆26Updated last year
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Updated 3 years ago