neelguha / legal-ml-datasetsView external linksLinks
A collection of datasets and tasks for legal machine learning
☆424Jan 4, 2026Updated last month
Alternatives and similar repositories for legal-ml-datasets
Users that are interested in legal-ml-datasets are comparing it to the libraries listed below
Sorting:
- An open science effort to benchmark legal reasoning in foundation models☆535Aug 25, 2024Updated last year
- A simple library for segmenting legal texts☆17Apr 22, 2023Updated 2 years ago
- A dataset for pretraining language models targeted for legal tasks.☆142Jun 30, 2022Updated 3 years ago
- A collection of datasets and other resources for legal text processing.☆175Oct 20, 2025Updated 3 months ago
- 📖 A curated list of LegalNLP resources from all around the web.☆301Oct 14, 2025Updated 4 months ago
- NLP Web API for Legal Text☆18Dec 23, 2022Updated 3 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆95Mar 27, 2023Updated 2 years ago
- A list of selected resources, methods, and tools dedicated to Legal Text Analytics.☆696Nov 5, 2024Updated last year
- API client for fetching and comparing passages from legislation☆14Jan 26, 2025Updated last year
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆237Jul 23, 2025Updated 6 months ago
- Implementation of different summarization algorithms applied to legal case judgements.☆217Nov 9, 2022Updated 3 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆100Apr 13, 2023Updated 2 years ago
- SALI LMSS: Legal Matter Standard Specification☆73Mar 24, 2025Updated 10 months ago
- ☆40Jul 17, 2022Updated 3 years ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆59Aug 18, 2022Updated 3 years ago
- CUAD (NeurIPS 2021)☆471Jul 13, 2023Updated 2 years ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- This repository is dedicated to summarizing papers related to large language models with the field of law☆280Jan 15, 2026Updated last month
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆78Jun 19, 2024Updated last year
- LexNLP by LexPredict☆764May 27, 2024Updated last year
- Instant redline with AI summary☆36Dec 7, 2025Updated 2 months ago
- KL3M training data collection and preprocessing☆20Apr 14, 2025Updated 10 months ago
- ☆111Oct 8, 2025Updated 4 months ago
- LexPredict Legal Dictionaries☆131Aug 31, 2022Updated 3 years ago
- AI + Legal APIs: A Tool-Based Retrieval Augmented Generation Workbench for Legal AI UX Research.☆124Oct 29, 2024Updated last year
- A spaCy pipeline and model for NLP on unstructured legal text.☆672Jul 16, 2024Updated last year
- A list of selected resources, methods, and tools dedicated to legal data schemes and ontologies.☆148Mar 30, 2024Updated last year
- CAP database scripts.☆194Sep 10, 2024Updated last year
- A database of court reporters, tests and other experiments☆123Feb 9, 2026Updated last week
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Jan 2, 2021Updated 5 years ago
- Python libraries for extracting from data sources like Rechtspraak, ECHR, Cellar☆13Jul 2, 2025Updated 7 months ago
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆20Jul 24, 2023Updated 2 years ago
- Quickly go from a paper court form to a runnable, guided, step-by-step web application powered by Docassemble. Swap out branding and pre-…☆55Feb 3, 2026Updated 2 weeks ago
- Linking of legal documents to other legal documents.☆14Jun 2, 2022Updated 3 years ago
- ☆10Jul 15, 2024Updated last year
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Jul 12, 2022Updated 3 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆23Dec 28, 2023Updated 2 years ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆27Sep 14, 2024Updated last year