allenai / open-mds
The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an issue if you run into any trouble!
☆31Updated last year
Alternatives and similar repositories for open-mds:
Users that are interested in open-mds are comparing it to the libraries listed below
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆18Updated 2 years ago
- ☆55Updated 2 years ago
- ☆53Updated 3 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆18Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https …☆43Updated 5 months ago
- ☆33Updated last year
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆18Updated 2 years ago
- ☆36Updated last year
- Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)☆34Updated last year
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆75Updated last year
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆50Updated last year
- ☆77Updated 8 months ago
- ☆37Updated last month
- ☆15Updated 3 years ago
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆60Updated last year
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆11Updated last year
- ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences☆28Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- Constrained decoding utilities for text generation using Huggingface seq2seq models☆23Updated last year
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- Official code repository for "Exploring Neural Models for Query-Focused Summarization".☆49Updated last year
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆67Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆28Updated 2 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- ☆45Updated 2 years ago
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆55Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆57Updated 2 years ago