Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles
☆50Jun 15, 2024Updated last year
Alternatives and similar repositories for Multi-XScience
Users that are interested in Multi-XScience are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Nov 4, 2022Updated 3 years ago
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆17May 2, 2025Updated last year
- ☆18Oct 22, 2022Updated 3 years ago
- Code for the paper "A Divide-and-Conquer Approach to the Summarization of Long Documents"☆18Jun 8, 2021Updated 4 years ago
- ☆14Jun 13, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A dataset of fine-grained knowledge graphs of scientific claims☆16Sep 24, 2021Updated 4 years ago
- ☆61Aug 20, 2024Updated last year
- code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith☆14May 18, 2021Updated 5 years ago
- Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"☆13Feb 14, 2022Updated 4 years ago
- This is the official code for Extractive Summarization of Long Documents by Combining Global and Local Context☆69Oct 13, 2020Updated 5 years ago
- Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"☆390Mar 24, 2023Updated 3 years ago
- Code for ACL'20 paper "Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization" .☆36Feb 2, 2021Updated 5 years ago
- ☆51May 11, 2022Updated 4 years ago
- LongSumm - Scientific Document Summarization Task☆74Jun 30, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A machine learning-based system that uses state-of-the-art natural language processing (NLP) question answering (QA) techniques combined …☆27Mar 24, 2023Updated 3 years ago
- Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits☆41Jan 8, 2026Updated 4 months ago
- code and dataset of EMNLP 2020 paper "PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge"☆12Nov 6, 2020Updated 5 years ago
- [AAAI'21] Code and dataset for our paper: Enhancing Scientific Papers Summarization with Citation Graph☆25Oct 16, 2022Updated 3 years ago
- [ACL2020] Unsupervised Opinion Summarization with Noising and Denoising☆21Jul 3, 2020Updated 5 years ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- On Generating Extended Summaries of Long Documents☆78Jan 26, 2021Updated 5 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆45Aug 10, 2024Updated last year
- Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."☆21Jun 4, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Feb 17, 2024Updated 2 years ago
- Code for Fact-level Extractive Summarization with Hierarchical Graph Mask on BERT (coling 2020)☆16Mar 25, 2023Updated 3 years ago
- The code repository for the paper "Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization".☆24Nov 12, 2020Updated 5 years ago
- A Meta-Review Dataset for Controllable Text Generation☆29Mar 20, 2024Updated 2 years ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- ☆17May 31, 2023Updated 2 years ago
- Code base for "Contextualized Rewriting for Text Summarization"☆30Feb 13, 2023Updated 3 years ago
- LaTeX document class for the proceedings of ANLP☆21Oct 28, 2025Updated 6 months ago
- A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv☆15Dec 7, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Jul 25, 2023Updated 2 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.☆48Jul 22, 2023Updated 2 years ago
- Full paper available on Researchgate☆17Oct 21, 2018Updated 7 years ago
- ROUGE summarization evaluation metric, enhanced with use of Word Embeddings☆23Oct 8, 2018Updated 7 years ago
- Word acquisition in neural language models (TACL 2022).☆21Jan 30, 2025Updated last year
- ☆12Oct 17, 2024Updated last year