yaolu/Multi-XScience

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yaolu/Multi-XScience)

yaolu / Multi-XScience

Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles

☆50

Alternatives and similar repositories for Multi-XScience

Users that are interested in Multi-XScience are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / PRIMER
View on GitHub
The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization
☆157Nov 4, 2022Updated 3 years ago
Alex-Fabbri / Multi-News
View on GitHub
Large-scale multi-document summarization dataset and code
☆295May 8, 2023Updated 3 years ago
diegoantognini / GameWikiSum
View on GitHub
Video Games Dataset for Multi-Document Summarization
☆20Sep 20, 2025Updated 10 months ago
jacklxc / CORWA
View on GitHub
CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022
☆17May 2, 2025Updated last year
allenai / mup
View on GitHub
☆18Oct 22, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Kel-Lu / SciGen
View on GitHub
SciGen
☆25Aug 10, 2021Updated 4 years ago
AlexGidiotis / DANCER-summ
View on GitHub
Code for the paper "A Divide-and-Conquer Approach to the Summarization of Long Documents"
☆18Jun 8, 2021Updated 5 years ago
allenai / ms2
View on GitHub
☆68Oct 5, 2022Updated 3 years ago
zacharyhorvitz / ParaGuide
View on GitHub
Code for "ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer"
☆16Jul 17, 2024Updated 2 years ago
talaugust / definition-complexity
View on GitHub
☆14Jun 13, 2022Updated 4 years ago
complementizer / wcep-mds-dataset
View on GitHub
☆62Aug 20, 2024Updated last year
Wendy-Xiao / Extsumm_local_global_context
View on GitHub
This is the official code for Extractive Summarization of Long Documents by Combining Global and Local Context
☆69Oct 13, 2020Updated 5 years ago
armancohan / long-summarization
View on GitHub
Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"
☆391Mar 24, 2023Updated 3 years ago
zhongxia96 / MGSum
View on GitHub
Code for ACL'20 paper "Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization" .
☆36Feb 2, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
abhinavkashyap / sciwing
View on GitHub
SciWING is a modern toolkit for scientific document processing from WING-NUS
☆63May 1, 2023Updated 3 years ago
aviclu / CDLM
View on GitHub
☆51May 11, 2022Updated 4 years ago
guyfe / LongSumm
View on GitHub
LongSumm - Scientific Document Summarization Task
☆73Jun 30, 2022Updated 4 years ago
HLTCHKUST / CAiRE-COVID
View on GitHub
A machine learning-based system that uses state-of-the-art natural language processing (NLP) question answering (QA) techniques combined …
☆27Mar 24, 2023Updated 3 years ago
heyunh2015 / PARADE_dataset
View on GitHub
code and dataset of EMNLP 2020 paper "PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge"
☆12Nov 6, 2020Updated 5 years ago
rktamplayo / DenoiseSum
View on GitHub
[ACL2020] Unsupervised Opinion Summarization with Noising and Denoising
☆21Jul 3, 2020Updated 6 years ago
WING-NUS / SciAssist
View on GitHub
☆20Feb 17, 2024Updated 2 years ago
yzhangcs / ctc-copy
View on GitHub
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
☆20Oct 17, 2023Updated 2 years ago
Georgetown-IR-Lab / ExtendedSumm
View on GitHub
On Generating Extended Summaries of Long Documents
☆78Jan 26, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tetsu9923 / SciReviewGen
View on GitHub
Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."
☆21Jun 4, 2023Updated 3 years ago
martiansideofthemoon / longeval-summarization
View on GitHub
Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…
☆45Aug 10, 2024Updated last year
hasteck / EDLAE_NeurIPS2020
View on GitHub
☆12Oct 19, 2020Updated 5 years ago
GSidiropoulos / kgsqa_for_unseen_domains
View on GitHub
Knowledge Graph Simple Question Answering for Unseen Domains
☆13Jul 2, 2025Updated last year
metacarbon / shareAtt
View on GitHub
Beyond KV Caching: Shared Attention for Efficient LLMs
☆20Jul 19, 2024Updated 2 years ago
TysonYu / Laysumm
View on GitHub
The code repository for the paper "Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization".
☆24Nov 12, 2020Updated 5 years ago
RuifengYuan / FactExsum-coling2020
View on GitHub
Code for Fact-level Extractive Summarization with Hierarchical Graph Mask on BERT (coling 2020)
☆16Mar 25, 2023Updated 3 years ago
verypluming / JaNLI
View on GitHub
☆17May 31, 2023Updated 3 years ago
ibivu / protein-glue
View on GitHub
Accompanying code for the ProteinGLUE method
☆13Apr 12, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
anlp-nenji / nlproceedings
View on GitHub
LaTeX document class for the proceedings of ANLP
☆21Oct 28, 2025Updated 8 months ago
psunlpgroup / MACSum
View on GitHub
Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.
☆34Jul 25, 2023Updated 2 years ago
simonepri / fever-transformers
View on GitHub
📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks
☆12Feb 21, 2020Updated 6 years ago
sdmhans / arxiv_dataset_extraction
View on GitHub
A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv
☆15Dec 7, 2020Updated 5 years ago
vzhong / e3
View on GitHub
Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.
☆48Jul 22, 2023Updated 3 years ago
deepeshhada / ReXPlug
View on GitHub
ReXPlug: Explainable Recommendation using Plug and Play Language Model, SIGIR 2021
☆10Nov 14, 2021Updated 4 years ago
wskbest / MFC-Bench
View on GitHub
☆12Oct 17, 2024Updated last year