microsoft / DeFacto
DeFacto - Demonstrations and Feedback for improving factual consistency of text summarization
☆30Updated 2 years ago
Alternatives and similar repositories for DeFacto:
Users that are interested in DeFacto are comparing it to the libraries listed below
- Perturbation CheckLists for Evaluating NLG Evaluation Metrics, EMNLP 2021☆9Updated 3 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆60Updated last year
- ☆22Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- ☆22Updated 2 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆36Updated 2 years ago
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- ☆25Updated 2 years ago
- Generative Retrieval Transformer☆28Updated last year
- ☆15Updated 3 years ago
- Code for our BlackboxNLP'20 paper "BERTnesia: Investigating the capture and forgetting of knowledge in BERT"☆9Updated 3 years ago
- ☆38Updated last year
- ☆35Updated last year
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Updated 3 years ago
- Knowledge Infused Decoding☆71Updated last year
- PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialog…☆27Updated 3 years ago
- ☆48Updated 2 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Updated 2 years ago
- Query-focused summarization data☆41Updated 2 years ago
- ☆33Updated last year
- Evaluating Machines by their Real-World Language Use☆33Updated last year
- Scripts to process & score QE predictions into WMT format.☆9Updated 3 years ago
- Codebase for public release of the plug-and-blend framework.☆22Updated 3 years ago
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆58Updated last month
- ☆25Updated last year
- ☆17Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 8 months ago
- Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".☆18Updated 2 years ago
- ☆32Updated last month
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆16Updated 2 years ago