A collection of datasets and other resources for legal text processing.
☆255Jun 9, 2026Updated last week
Alternatives and similar repositories for awesome-legal-data
Users that are interested in awesome-legal-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Legal Reference Extraction☆48May 12, 2026Updated last month
- German Dataset for Legal Information Retrieval☆27Feb 26, 2024Updated 2 years ago
- Open Legal Data Platform☆152Updated this week
- A list of selected resources, methods, and tools dedicated to Legal Text Analytics.☆721Nov 5, 2024Updated last year
- A collection of datasets and tasks for legal machine learning☆437Apr 19, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A dataset for pretraining language models targeted for legal tasks.☆147Jun 30, 2022Updated 3 years ago
- Python toolbox to load, parse and process Official Journals of the European Union (EU).☆24May 3, 2024Updated 2 years ago
- Jupyter notebook showcases using the Open Legal Data API☆25Dec 22, 2025Updated 5 months ago
- An open science effort to benchmark legal reasoning in foundation models☆594Mar 30, 2026Updated 2 months ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆64Aug 18, 2022Updated 3 years ago
- 📖 A curated list of LegalNLP resources from all around the web.☆327Oct 14, 2025Updated 8 months ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆259Jul 23, 2025Updated 10 months ago
- Named entity recognition for the legal domain☆43Jun 1, 2021Updated 5 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆104Apr 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆95Mar 27, 2023Updated 3 years ago
- A list of selected resources, methods, and tools dedicated to legal data schemes and ontologies.☆176Mar 30, 2024Updated 2 years ago
- A dataset of semantically related sentence pairs in the German legal domain☆10Feb 26, 2021Updated 5 years ago
- ☆31Updated this week
- Reading legal authority for the last time☆44Updated this week
- Find legal citations in any block of text☆250Oct 3, 2025Updated 8 months ago
- Download client for legal opinions☆13Jun 12, 2026Updated last week
- NLP Web API for Legal Text☆18Dec 23, 2022Updated 3 years ago
- LexPredict ContraxSuite☆185Feb 16, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple library for segmenting legal texts☆18Apr 22, 2023Updated 3 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆78May 15, 2023Updated 3 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python. ➡️ The project has moved to: https://gitlab.opencode…☆21Mar 20, 2026Updated 2 months ago
- A Dataset of German Legal Documents for Named Entity Recognition☆178Oct 19, 2022Updated 3 years ago
- ☆20Jun 11, 2021Updated 5 years ago
- Argumentation Mining Tool for Lawyers☆15May 18, 2021Updated 5 years ago
- This is a prototype of a semi-automatic data anonymization app for German documents. ➡️ The project has moved to: https://gitlab.opencode…☆24Mar 20, 2026Updated 2 months ago
- KL3M training data collection and preprocessing☆22Apr 14, 2025Updated last year
- A minimal Akoma Ntoso -based legal informatics toolchain☆16Oct 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LexNLP by LexPredict☆783May 27, 2024Updated 2 years ago
- 👨🏽⚖️ LegalEngine - qqmbr team - Junction2017☆24Aug 2, 2018Updated 7 years ago
- Code repo for CLERC: A Legal Precedent Dataset for Case Retrieval and Retrieval-Augmented Analysis Generation (NAACL 2025)☆28Jan 28, 2025Updated last year
- Regulärer Ausdruck zum Finden von Gesetzen in Texten/Regex to find German laws.☆20Jul 18, 2023Updated 2 years ago
- ☆40Jul 17, 2022Updated 3 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆27Oct 4, 2022Updated 3 years ago
- Implementation of different summarization algorithms applied to legal case judgements.☆222Nov 9, 2022Updated 3 years ago