Yale-LILY / AutoACU
☆11Updated 2 months ago
Related projects: ⓘ
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆41Updated last month
- ☆10Updated 9 months ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆17Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆38Updated 9 months ago
- ☆39Updated last year
- ☆32Updated last year
- https://arxiv.org/abs/2404.10917☆11Updated 3 months ago
- ☆55Updated last year
- FaVIQ: Fact Verification from Information-seeking Questions☆43Updated last year
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- ☆50Updated 2 years ago
- ☆14Updated last year
- The TechQA dataset -- http://ibm.biz/Tech_QA☆19Updated last year
- ☆33Updated last year
- Repository for the CODAH dataset☆22Updated last year
- ☆45Updated last year
- ☆14Updated 11 months ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆29Updated last year
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆51Updated last year
- An open source toolkit for multimodal generative conversational task assistants, helping assist people with real-world complex tasks☆35Updated 3 months ago
- Data and code for the SciFact-Open task☆24Updated 9 months ago
- ☆33Updated 3 weeks ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11Updated 3 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Updated last year
- ☆57Updated 2 years ago
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆25Updated 3 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- ☆30Updated 2 years ago