allenai/open-mds

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/open-mds)

allenai / open-mds

The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an issue if you run into any trouble!

☆33

Alternatives and similar repositories for open-mds

Users that are interested in open-mds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yale-nlp / ODSum
View on GitHub
Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"
☆11Sep 20, 2024Updated last year
multilexsum / dataset
View on GitHub
Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits
☆23Dec 15, 2022Updated 3 years ago
allenai / smashed
View on GitHub
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…
☆35May 24, 2024Updated 2 years ago
allenai / ms2
View on GitHub
☆68Oct 5, 2022Updated 3 years ago
allenai / mslr-shared-task
View on GitHub
Multidocument Summarization for Literature Review Shared Task 2022
☆30Oct 16, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cylnlp / convsumx
View on GitHub
Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation
☆19Mar 23, 2024Updated 2 years ago
allenai / PRIMER
View on GitHub
The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization
☆157Nov 4, 2022Updated 3 years ago
dot-legal / reference
View on GitHub
Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.
☆13Jul 12, 2022Updated 4 years ago
himkt / allennlp-optuna
View on GitHub
⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy
☆33Nov 23, 2021Updated 4 years ago
THU-BPM / ISESL-SQL
View on GitHub
The source code of paper "Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph" in KDD2022.
☆15Jan 9, 2023Updated 3 years ago
dmis-lab / SeqTagQA
View on GitHub
Sequence Tagging for Biomedical Extractive Question Answering (Bioinformatics'2020)
☆10Jul 3, 2023Updated 3 years ago
mscarey / legislice
View on GitHub
API client for fetching and comparing passages from legislation
☆14Jun 29, 2026Updated 3 weeks ago
JSv4 / AtticusClassifier
View on GitHub
Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus
☆14Jan 2, 2021Updated 5 years ago
sergiog95 / csabstracts
View on GitHub
Dataset of scientific abstracts for the purpose of sentence classification
☆10Sep 17, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
algox / rulii
View on GitHub
Business Rule Engine
☆22Updated this week
allenai / scidocs
View on GitHub
Dataset accompanying the SPECTER model
☆148Dec 19, 2022Updated 3 years ago
asahi417 / relbert
View on GitHub
The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…
☆48Dec 2, 2024Updated last year
mscarey / justopinion
View on GitHub
Download client for legal opinions
☆13Jun 12, 2026Updated last month
bettyblocks / material-ui-component-set
View on GitHub
A Betty Blocks Component Set based on Material UI
☆25May 21, 2026Updated 2 months ago
ruiqi-zhong / DescribeDistributionalDifferences
View on GitHub
Code for preprint: Summarizing Differences between Text Distributions with Natural Language
☆43Feb 24, 2023Updated 3 years ago
allenai / beaker-gantry
View on GitHub
Gantry provides an API that streamlines running experiments in Beaker
☆32Updated this week
cceyda / lit-NER
View on GitHub
TorchServe+Streamlit for easily serving your HuggingFace NER models
☆33Jul 4, 2022Updated 4 years ago
terrierteam / pyterrier_t5
View on GitHub
☆17Apr 30, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tleyden / open-ocr-client
View on GitHub
Client library for OpenOCR
☆32Dec 3, 2014Updated 11 years ago
Yale-LILY / DYLE
View on GitHub
Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization
☆57Aug 2, 2023Updated 2 years ago
neelguha / legal-segmenter
View on GitHub
A simple library for segmenting legal texts
☆18Apr 22, 2023Updated 3 years ago
jtonglet / Numerical-Hybrid-QA-Literature
View on GitHub
A list of Numerical Multimodal reasoning papers and their implementation
☆11May 13, 2024Updated 2 years ago
Shen-Chenhui / MReD
View on GitHub
A Meta-Review Dataset for Controllable Text Generation
☆28Mar 20, 2024Updated 2 years ago
ibm-hyperknowledge / hkpy
View on GitHub
A Python module to provide software abstractions to ease accessing hyperknowledge graphs
☆11Dec 19, 2024Updated last year
nstawfik / MedSentEval
View on GitHub
☆11Nov 19, 2020Updated 5 years ago
ShuyangCao / hibrids_summ
View on GitHub
Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".
☆13May 24, 2022Updated 4 years ago
WGLab / Bioformer
View on GitHub
Bioformer: an efficient BERT model for biomedical text mining
☆56Feb 7, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
unitedstates / BillMap
View on GitHub
Utilities and applications for the FlatGov project by Demand Progress
☆17Feb 8, 2023Updated 3 years ago
allenai / scirepeval
View on GitHub
SciRepEval benchmark training and evaluation scripts
☆89May 5, 2026Updated 2 months ago
facebookresearch / quip
View on GitHub
Official repository for the paper "Question Answering Infused Pre-training of General-Purpose Contextualized Representations" by Robin Ji…
☆16Aug 13, 2021Updated 4 years ago
zzstoatzz / raggy
View on GitHub
scraping and querying documents for LLMs
☆24Oct 6, 2025Updated 9 months ago
MIT-LCP / 2019_toronto_health_hack
View on GitHub
2019 Toronto Datathon https://www.tdothealthhack.com
☆11Oct 4, 2019Updated 6 years ago
yakuza8 / first-order-predicate-logic-theorem-prover
View on GitHub
Autonomous Theorem Prover for First Order Predicate Logic
☆12Jun 29, 2020Updated 6 years ago
scalaboy / MutiTaskLearn
View on GitHub
☆12Oct 5, 2022Updated 3 years ago