morningmoni / CiteSum
Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", EMNLP 2022
☆33Updated 2 years ago
Alternatives and similar repositories for CiteSum:
Users that are interested in CiteSum are comparing it to the libraries listed below
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- ☆33Updated last year
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated 2 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Updated 2 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated last year
- Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles☆43Updated 10 months ago
- ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences☆28Updated last year
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Updated 2 years ago
- Official code repository for "Exploring Neural Models for Query-Focused Summarization".☆50Updated last year
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- Generating claims for zero-shot scientific fact checking☆30Updated 3 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆67Updated 3 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated last year
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Updated last year
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆47Updated 7 months ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Updated 4 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆75Updated 3 years ago
- The Multitask Long Document Benchmark☆38Updated 2 years ago
- ☆54Updated 2 years ago
- ☆24Updated last year
- ☆14Updated last year
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆12Updated 2 years ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆29Updated 2 years ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 3 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year