A collection of datasets and other resources for legal text processing.
☆215Mar 15, 2026Updated last month
Alternatives and similar repositories for awesome-legal-data
Users that are interested in awesome-legal-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Legal Reference Extraction☆45Feb 13, 2026Updated 2 months ago
- German Dataset for Legal Information Retrieval☆25Feb 26, 2024Updated 2 years ago
- A list of selected resources, methods, and tools dedicated to Legal Text Analytics.☆716Nov 5, 2024Updated last year
- Open Legal Data Platform☆135Updated this week
- A collection of datasets and tasks for legal machine learning☆435Jan 4, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A dataset for pretraining language models targeted for legal tasks.☆145Jun 30, 2022Updated 3 years ago
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Jul 12, 2022Updated 3 years ago
- Python toolbox to load, parse and process Official Journals of the European Union (EU).☆22May 3, 2024Updated last year
- Jupyter notebook showcases using the Open Legal Data API☆25Dec 22, 2025Updated 3 months ago
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documents☆30Nov 9, 2025Updated 5 months ago
- An open science effort to benchmark legal reasoning in foundation models☆569Mar 30, 2026Updated 2 weeks ago
- LexPredict Legal Dictionaries☆133Aug 31, 2022Updated 3 years ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆62Aug 18, 2022Updated 3 years ago
- 📖 A curated list of LegalNLP resources from all around the web.☆314Oct 14, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Named entity recognition for the legal domain☆43Jun 1, 2021Updated 4 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆96Mar 27, 2023Updated 3 years ago
- A dataset of semantically related sentence pairs in the German legal domain☆10Feb 26, 2021Updated 5 years ago
- ☆31May 14, 2025Updated 11 months ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆250Jul 23, 2025Updated 8 months ago
- Download client for legal opinions☆13Jan 26, 2025Updated last year
- NLP Web API for Legal Text☆18Dec 23, 2022Updated 3 years ago
- LexPredict ContraxSuite☆181Feb 16, 2023Updated 3 years ago
- A simple library for segmenting legal texts☆18Apr 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mining Legal Arguments in Court Decisions - Data and software☆76May 15, 2023Updated 2 years ago
- A list of selected resources, methods, and tools dedicated to legal data schemes and ontologies.☆160Mar 30, 2024Updated 2 years ago
- ☆20Jun 11, 2021Updated 4 years ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆41Jul 6, 2021Updated 4 years ago
- Neue Scraper☆10Feb 1, 2026Updated 2 months ago
- This is a prototype of a semi-automatic data anonymization app for German documents. ➡️ The project has moved to: https://gitlab.opencode…☆24Mar 20, 2026Updated 3 weeks ago
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆33May 25, 2022Updated 3 years ago
- KL3M training data collection and preprocessing☆21Apr 14, 2025Updated last year
- A minimal Akoma Ntoso -based legal informatics toolchain☆16Oct 25, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LexNLP by LexPredict☆774May 27, 2024Updated last year
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆19Jun 16, 2023Updated 2 years ago
- ☆40Jul 17, 2022Updated 3 years ago
- SALI LMSS: Legal Matter Standard Specification☆77Mar 10, 2026Updated last month
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆27Oct 4, 2022Updated 3 years ago
- Implementation of different summarization algorithms applied to legal case judgements.☆221Nov 9, 2022Updated 3 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆23Dec 28, 2023Updated 2 years ago