A collection of datasets and other resources for legal text processing.
☆246Apr 30, 2026Updated last month
Alternatives and similar repositories for awesome-legal-data
Users that are interested in awesome-legal-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Legal Reference Extraction☆47May 12, 2026Updated 2 weeks ago
- German Dataset for Legal Information Retrieval☆26Feb 26, 2024Updated 2 years ago
- Open Legal Data Platform☆144May 20, 2026Updated last week
- A list of selected resources, methods, and tools dedicated to Legal Text Analytics.☆719Nov 5, 2024Updated last year
- A collection of datasets and tasks for legal machine learning☆437Apr 19, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A dataset for pretraining language models targeted for legal tasks.☆145Jun 30, 2022Updated 3 years ago
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Jul 12, 2022Updated 3 years ago
- Python toolbox to load, parse and process Official Journals of the European Union (EU).☆23May 3, 2024Updated 2 years ago
- Jupyter notebook showcases using the Open Legal Data API☆25Dec 22, 2025Updated 5 months ago
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documents☆30Nov 9, 2025Updated 6 months ago
- An open science effort to benchmark legal reasoning in foundation models☆582Mar 30, 2026Updated 2 months ago
- LexPredict Legal Dictionaries☆134Aug 31, 2022Updated 3 years ago
- 📖 A curated list of LegalNLP resources from all around the web.☆324Oct 14, 2025Updated 7 months ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆253Jul 23, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A spaCy pipeline and model for NLP on unstructured legal text.☆689Jul 16, 2024Updated last year
- Named entity recognition for the legal domain☆43Jun 1, 2021Updated 4 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆96Mar 27, 2023Updated 3 years ago
- A list of selected resources, methods, and tools dedicated to legal data schemes and ontologies.☆172Mar 30, 2024Updated 2 years ago
- A dataset of semantically related sentence pairs in the German legal domain☆10Feb 26, 2021Updated 5 years ago
- ☆31Updated this week
- Reading legal authority for the last time☆43Updated this week
- Find legal citations in any block of text☆240Oct 3, 2025Updated 7 months ago
- NLP Web API for Legal Text☆18Dec 23, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LexPredict ContraxSuite☆182Feb 16, 2023Updated 3 years ago
- A simple library for segmenting legal texts☆18Apr 22, 2023Updated 3 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆76May 15, 2023Updated 3 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python. ➡️ The project has moved to: https://gitlab.opencode…☆21Mar 20, 2026Updated 2 months ago
- A Dataset of German Legal Documents for Named Entity Recognition☆177Oct 19, 2022Updated 3 years ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆43Jul 6, 2021Updated 4 years ago
- Argumentation Mining Tool for Lawyers☆15May 18, 2021Updated 5 years ago
- Neue Scraper☆10May 6, 2026Updated 3 weeks ago
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆32May 25, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- KL3M training data collection and preprocessing☆22Apr 14, 2025Updated last year
- A minimal Akoma Ntoso -based legal informatics toolchain☆16Oct 25, 2023Updated 2 years ago
- 👨🏽⚖️ LegalEngine - qqmbr team - Junction2017☆24Aug 2, 2018Updated 7 years ago
- Must-read Papers on Legal Intelligence☆497Jan 22, 2021Updated 5 years ago
- Code repo for CLERC: A Legal Precedent Dataset for Case Retrieval and Retrieval-Augmented Analysis Generation (NAACL 2025)☆29Jan 28, 2025Updated last year
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆19Jun 16, 2023Updated 2 years ago
- Regulärer Ausdruck zum Finden von Gesetzen in Texten/Regex to find German laws.☆20Jul 18, 2023Updated 2 years ago