Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles
☆50Jun 15, 2024Updated last year
Alternatives and similar repositories for Multi-XScience
Users that are interested in Multi-XScience are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Nov 4, 2022Updated 3 years ago
- Large-scale multi-document summarization dataset and code☆295May 8, 2023Updated 2 years ago
- Video Games Dataset for Multi-Document Summarization☆19Sep 20, 2025Updated 7 months ago
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆17May 2, 2025Updated last year
- ☆18Oct 22, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the paper "A Divide-and-Conquer Approach to the Summarization of Long Documents"☆18Jun 8, 2021Updated 4 years ago
- A dataset of fine-grained knowledge graphs of scientific claims☆16Sep 24, 2021Updated 4 years ago
- ☆61Aug 20, 2024Updated last year
- code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith☆14May 18, 2021Updated 4 years ago
- Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"☆13Feb 14, 2022Updated 4 years ago
- Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"☆390Mar 24, 2023Updated 3 years ago
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆63May 1, 2023Updated 3 years ago
- LongSumm - Scientific Document Summarization Task☆74Jun 30, 2022Updated 3 years ago
- A machine learning-based system that uses state-of-the-art natural language processing (NLP) question answering (QA) techniques combined …☆27Mar 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch-IRGAN is a PyTorch version implementation of the item recommendation part of IRGAN.☆12Dec 2, 2019Updated 6 years ago
- Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits☆40Jan 8, 2026Updated 3 months ago
- code and dataset of EMNLP 2020 paper "PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge"☆12Nov 6, 2020Updated 5 years ago
- [AAAI'21] Code and dataset for our paper: Enhancing Scientific Papers Summarization with Citation Graph☆25Oct 16, 2022Updated 3 years ago
- ☆12Feb 26, 2024Updated 2 years ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."☆19Jun 4, 2023Updated 2 years ago
- ☆12Oct 19, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆20Feb 17, 2024Updated 2 years ago
- Code for Fact-level Extractive Summarization with Hierarchical Graph Mask on BERT (coling 2020)☆16Mar 25, 2023Updated 3 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Sep 26, 2023Updated 2 years ago
- ☆13Nov 11, 2022Updated 3 years ago
- Knowledge Graph Simple Question Answering for Unseen Domains☆13Jul 2, 2025Updated 10 months ago
- A Meta-Review Dataset for Controllable Text Generation☆29Mar 20, 2024Updated 2 years ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- Accompanying code for the ProteinGLUE method☆12Apr 12, 2022Updated 4 years ago
- Code base for "Contextualized Rewriting for Text Summarization"☆29Feb 13, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LaTeX document class for the proceedings of ANLP☆21Oct 28, 2025Updated 6 months ago
- ☆13Jul 3, 2023Updated 2 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Jul 25, 2023Updated 2 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.☆48Jul 22, 2023Updated 2 years ago
- Official repository for the autoPET III challenge.☆11Jan 8, 2026Updated 3 months ago
- ☆15Nov 24, 2020Updated 5 years ago