allenai / open-mds
The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an issue if you run into any trouble!
☆32Updated last year
Alternatives and similar repositories for open-mds:
Users that are interested in open-mds are comparing it to the libraries listed below
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆19Updated 2 years ago
- ☆28Updated last year
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Updated 2 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated last year
- ☆45Updated 3 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆39Updated 2 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- Schema2QA Question Answering Dataset☆18Updated 2 years ago
- ☆33Updated last year
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆19Updated last year
- ☆53Updated 3 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Updated 3 years ago
- Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles☆43Updated 10 months ago
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Updated last year
- ☆54Updated 2 years ago
- Multidocument Summarization for Literature Review Shared Task 2022☆29Updated 2 years ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆19Updated 2 years ago
- ☆38Updated last year
- ☆31Updated 3 years ago
- ☆77Updated last year
- Dense hybrid representations for text retrieval☆62Updated 2 years ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆68Updated this week
- ☆39Updated 4 months ago
- Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)☆36Updated last year
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆67Updated 3 years ago
- ☆86Updated last month