multilexsum / dataset
Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits
☆19Updated 2 years ago
Alternatives and similar repositories for dataset:
Users that are interested in dataset are comparing it to the libraries listed below
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated last year
- ☆24Updated 4 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 6 months ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated 2 years ago
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OF…☆27Updated 3 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Updated 2 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- ☆15Updated 3 years ago
- Multidocument Summarization for Literature Review Shared Task 2022☆28Updated 2 years ago
- ☆14Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆29Updated 2 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- ☆46Updated 5 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- ☆37Updated last year
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆18Updated last year
- ☆24Updated last year
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Updated 2 years ago
- ☆93Updated 11 months ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆140Updated 2 years ago
- Repository for the CODAH dataset☆22Updated 2 years ago
- Schema2QA Question Answering Dataset☆18Updated 2 years ago
- ☆74Updated 3 years ago
- ☆45Updated 2 years ago