multilexsum/dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/multilexsum/dataset)

multilexsum / dataset

Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits

☆23

Alternatives and similar repositories for dataset

Users that are interested in dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / open-mds
View on GitHub
The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …
☆33Jun 24, 2023Updated 3 years ago
prohandler / GS-Bulk-Emails
View on GitHub
Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email
☆17Dec 11, 2024Updated last year
allenai / smashed
View on GitHub
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…
☆35May 24, 2024Updated 2 years ago
zoranmedic / mdcr
View on GitHub
Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…
☆12Oct 21, 2022Updated 3 years ago
clinicalml / cotrain-prompting
View on GitHub
Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance
☆16Sep 23, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
allenai / author-explorer
View on GitHub
An exploration of a few AI authors and the data Semantic Scholar has about their citations.
☆31Feb 16, 2023Updated 3 years ago
allenai / scidocs
View on GitHub
Dataset accompanying the SPECTER model
☆148Dec 19, 2022Updated 3 years ago
allenai / scirepeval
View on GitHub
SciRepEval benchmark training and evaluation scripts
☆89May 5, 2026Updated 2 months ago
dot-legal / reference
View on GitHub
Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.
☆13Jul 12, 2022Updated 4 years ago
himkt / allennlp-optuna
View on GitHub
⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy
☆33Nov 23, 2021Updated 4 years ago
JoelNiklaus / LEXTREME
View on GitHub
This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP
☆26Dec 28, 2023Updated 2 years ago
kkalouli / GKR_semantic_parser
View on GitHub
The Graphical Knowledge Representation (GKR) parser: it transforms a given sentence into a layered semantic graph
☆13May 16, 2022Updated 4 years ago
mscarey / legislice
View on GitHub
API client for fetching and comparing passages from legislation
☆14Jun 29, 2026Updated 3 weeks ago
JSv4 / AtticusClassifier
View on GitHub
Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus
☆14Jan 2, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
menik1126 / Swing-Bench
View on GitHub
[ICLR2026🔥Oral] SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
☆15Feb 26, 2026Updated 4 months ago
mscarey / justopinion
View on GitHub
Download client for legal opinions
☆13Jun 12, 2026Updated last month
ruiqi-zhong / DescribeDistributionalDifferences
View on GitHub
Code for preprint: Summarizing Differences between Text Distributions with Natural Language
☆43Feb 24, 2023Updated 3 years ago
shtoshni / fast-coref
View on GitHub
Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)
☆35Jul 28, 2023Updated 2 years ago
taoyds / cosql
View on GitHub
A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases
☆14Mar 22, 2022Updated 4 years ago
terrierteam / pyterrier_t5
View on GitHub
☆17Apr 30, 2026Updated 2 months ago
UBIAI / layout_lm_tutorial
View on GitHub
☆15Jun 16, 2021Updated 5 years ago
allenai / vila
View on GitHub
Incorporating VIsual LAyout Structures for Scientific Text Classification
☆180Mar 18, 2023Updated 3 years ago
dwadden / scifact-open
View on GitHub
Data and code for the SciFact-Open task
☆29Nov 24, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
tleyden / open-ocr-client
View on GitHub
Client library for OpenOCR
☆32Dec 3, 2014Updated 11 years ago
neelguha / legal-segmenter
View on GitHub
A simple library for segmenting legal texts
☆18Apr 22, 2023Updated 3 years ago
ibm-hyperknowledge / hkpy
View on GitHub
A Python module to provide software abstractions to ease accessing hyperknowledge graphs
☆11Dec 19, 2024Updated last year
nstawfik / MedSentEval
View on GitHub
☆11Nov 19, 2020Updated 5 years ago
unitedstates / BillMap
View on GitHub
Utilities and applications for the FlatGov project by Demand Progress
☆17Feb 8, 2023Updated 3 years ago
clinicalml / teaching-to-understand-ai
View on GitHub
Code and webpages for our study on teaching humans to defer to an AI
☆12Nov 6, 2023Updated 2 years ago
Websail-NU / CODAH
View on GitHub
Repository for the CODAH dataset
☆22Oct 29, 2022Updated 3 years ago
zzstoatzz / raggy
View on GitHub
scraping and querying documents for LLMs
☆24Oct 6, 2025Updated 9 months ago
MIT-LCP / 2019_toronto_health_hack
View on GitHub
2019 Toronto Datathon https://www.tdothealthhack.com
☆11Oct 4, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kieranrcampbell / ouijaflow
View on GitHub
Probabilistic single-cell pseudotime with Edward+Tensorflow
☆12Oct 5, 2017Updated 8 years ago
ZeweiChu / DiscoEval
View on GitHub
EMNLP DiscoEval paper
☆43Nov 12, 2019Updated 6 years ago
ICLRandD / LegalHackers2019
View on GitHub
This repository contains materials for the Open Legal Data Forum at the Legal Hacker 2019 (September 2019 + Brooklyn, NYC)
☆17Dec 8, 2022Updated 3 years ago
northanapon / dict-definition
View on GitHub
Preprocessing scripts to read definitions and other information from dictionaries
☆23Nov 7, 2017Updated 8 years ago
Sreyan88 / DALE
View on GitHub
Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP
☆11Oct 27, 2023Updated 2 years ago
PrimerAI / blanc
View on GitHub
Human-free quality estimation of document summaries
☆97Dec 1, 2025Updated 7 months ago
OHDSI / InspectOMOP
View on GitHub
InspectOmop is a lightweight python 3 package that assists in the extraction of electronic health record(EHR) data from relational databa…
☆15Apr 14, 2026Updated 3 months ago