The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an issue if you run into any trouble!
☆33Jun 24, 2023Updated 2 years ago
Alternatives and similar repositories for open-mds
Users that are interested in open-mds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆21Dec 15, 2022Updated 3 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- Using Huggingface to generate relation expressions☆15Jan 15, 2021Updated 5 years ago
- Code for EACL 2023 paper "LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control"☆20Feb 7, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Twin Neural Network Training with PyTorch and fast.ai and its Deployment with TorchServe on Amazon SageMaker☆11May 21, 2024Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆30Oct 16, 2022Updated 3 years ago
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆19Mar 23, 2024Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Nov 4, 2022Updated 3 years ago
- ⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy☆33Nov 23, 2021Updated 4 years ago
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Jul 12, 2022Updated 3 years ago
- Sequence Tagging for Biomedical Extractive Question Answering (Bioinformatics'2020)☆11Jul 3, 2023Updated 2 years ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Jan 2, 2021Updated 5 years ago
- Dataset accompanying the SPECTER model☆146Dec 19, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆47Dec 2, 2024Updated last year
- Download client for legal opinions☆13Jan 26, 2025Updated last year
- A Betty Blocks Component Set based on Material UI☆25Apr 21, 2026Updated 2 weeks ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Feb 24, 2023Updated 3 years ago
- EMNLP 2022: Biomedical NER for the Enterprise with Distillated BERN2 and the Kazu Framework☆11Aug 29, 2024Updated last year
- ☆16Updated this week
- Themed, fully featured PDF viewer for the Atom editor☆12Jan 28, 2026Updated 3 months ago
- A Python module to provide software abstractions to ease accessing hyperknowledge graphs☆11Dec 19, 2024Updated last year
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆57Aug 2, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A simple library for segmenting legal texts☆18Apr 22, 2023Updated 3 years ago
- Client library for OpenOCR☆31Dec 3, 2014Updated 11 years ago
- ☆13Apr 29, 2026Updated last week
- Official repository for the paper "Question Answering Infused Pre-training of General-Purpose Contextualized Representations" by Robin Ji…☆15Aug 13, 2021Updated 4 years ago
- ☆12Oct 5, 2022Updated 3 years ago
- Rust library for indexing and quickly searching large pretraining corpora☆31Oct 30, 2025Updated 6 months ago
- scraping and querying documents for LLMs☆24Oct 6, 2025Updated 7 months ago
- Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"☆14Dec 26, 2022Updated 3 years ago
- This repository contains materials for the Open Legal Data Forum at the Legal Hacker 2019 (September 2019 + Brooklyn, NYC)☆17Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A library for Partially Homomorphic Encryption in Python☆12May 30, 2017Updated 8 years ago
- UNMAINTAINED: A multidocument text summarizer, the final project for CIS-530: Intro to Computational Linguistics☆40Sep 26, 2015Updated 10 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆17May 2, 2025Updated last year
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Resources for grounding protein families and complexes from text and describing their hierarchical relationships.☆18Mar 26, 2026Updated last month
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago