allenai / open-mds
The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an issue if you run into any trouble!
☆32Updated last year
Alternatives and similar repositories for open-mds:
Users that are interested in open-mds are comparing it to the libraries listed below
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆19Updated 2 years ago
- ☆38Updated last year
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- ☆33Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆29Updated 2 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Updated 3 years ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆19Updated last year
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆45Updated 3 years ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Updated 4 years ago
- ☆11Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 8 months ago
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Updated 2 years ago
- ☆31Updated 3 years ago
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆55Updated last year
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated 2 years ago
- PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxi…☆104Updated last year
- Schema2QA Question Answering Dataset☆18Updated 2 years ago
- ☆54Updated 2 years ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆31Updated last year
- ☆28Updated last year
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago
- Dense hybrid representations for text retrieval☆62Updated 2 years ago
- ☆38Updated 4 months ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆19Updated 2 years ago
- ☆77Updated 11 months ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago
- ☆38Updated 2 years ago
- ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences☆28Updated last year