Breakend/PileOfLaw

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Breakend/PileOfLaw)

Breakend / PileOfLaw

A dataset for pretraining language models targeted for legal tasks.

☆148

Alternatives and similar repositories for PileOfLaw

Users that are interested in PileOfLaw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

coastalcph / lex-glue
View on GitHub
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English
☆266Jul 23, 2025Updated last year
TiltonLAW / LegalWRITER
View on GitHub
GPT-3.5-trubo + Harvard's Case Access Project
☆19Jun 6, 2023Updated 3 years ago
maastrichtlawtech / law3027-advanced-legal-analytics
View on GitHub
📚 Materials for Advanced Legal Analytics (LAW3027) @ Maastricht University.
☆14May 8, 2024Updated 2 years ago
neelguha / legal-ml-datasets
View on GitHub
A collection of datasets and tasks for legal machine learning
☆441Apr 19, 2026Updated 3 months ago
reglab / casehold
View on GitHub
Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…
☆97Mar 27, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dot-legal / reference
View on GitHub
Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.
☆13Jul 12, 2022Updated 4 years ago
jsavelka / sbd_adjudicatory_dec
View on GitHub
☆20Jun 11, 2021Updated 5 years ago
openlegaldata / awesome-legal-data
View on GitHub
A collection of datasets and other resources for legal text processing.
☆281Jul 9, 2026Updated 2 weeks ago
GergesBernaba1 / LawyerSys
View on GitHub
Law Firm Management System
☆17Dec 19, 2025Updated 7 months ago
ds-modules / LEGALST-190
View on GitHub
UC Berkeley LEGALST 190 (Data, Prediction, and Law)
☆21Jun 10, 2025Updated last year
maastrichtlawtech / awesome-legal-nlp
View on GitHub
📖 A curated list of LegalNLP resources from all around the web.
☆334Oct 14, 2025Updated 9 months ago
bockph / Legal-Sentence-Role-Classification
View on GitHub
This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…
☆18Feb 22, 2022Updated 4 years ago
JoelNiklaus / LegalDatasets
View on GitHub
This repository serves as a collection of scrapers procuring and structuring various legal datasets
☆19Jun 16, 2023Updated 3 years ago
iliaschalkidis / LegalCrawler
View on GitHub
LegalCrawler: A tool for automated scraping of English legal corpora
☆64Aug 18, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mscarey / justopinion
View on GitHub
Download client for legal opinions
☆13Jun 12, 2026Updated last month
HazyResearch / legalbench
View on GitHub
An open science effort to benchmark legal reasoning in foundation models
☆614Mar 30, 2026Updated 3 months ago
maastrichtlawtech / law3025-legal-analytics
View on GitHub
📚 Materials for Legal Analytics (LAW3025) @ Maastricht University
☆14Jan 27, 2026Updated 5 months ago
darrow-labs / LegalLens
View on GitHub
☆10Jul 15, 2024Updated 2 years ago
beursken / PythonForLawyers
View on GitHub
Introduction to Software Development for Lawyers
☆29Mar 13, 2024Updated 2 years ago
TorchlightLegal / Database-Build_1.0
View on GitHub
This repo provides database architecture that provides case law for Legal Research & Machine Learning Model Study
☆12May 25, 2017Updated 9 years ago
Liquid-Legal-Institute / Legal-Text-Analytics
View on GitHub
A list of selected resources, methods, and tools dedicated to Legal Text Analytics.
☆727Nov 5, 2024Updated last year
elliottash / robot_judge_2019
View on GitHub
Course materials for "Building a Robot Judge: Data Science for the Law", Spring 2019
☆11Feb 20, 2020Updated 6 years ago
mscarey / AuthoritySpoke
View on GitHub
Reading legal authority for the last time
☆44Jun 30, 2026Updated 3 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
freelawproject / free.law
View on GitHub
The homepage for Free Law Project
☆70Updated this week
273v / lmss-suggestion-api
View on GitHub
SALI LMSS Suggestion API
☆18Jan 5, 2024Updated 2 years ago
alea-institute / kl3m-data
View on GitHub
KL3M training data collection and preprocessing
☆22Apr 14, 2025Updated last year
freelawproject / inception
View on GitHub
Our microservice for generating embeddings from blocks of text
☆54Feb 20, 2026Updated 5 months ago
ICLRandD / Blackstone
View on GitHub
A spaCy pipeline and model for NLP on unstructured legal text.
☆693Jul 16, 2024Updated 2 years ago
jluech / LAWrgMin
View on GitHub
Argumentation Mining Tool for Lawyers
☆15May 18, 2021Updated 5 years ago
davidawad / lobe
View on GitHub
Lobe is the world's first AI paralegal.
☆54Dec 8, 2022Updated 3 years ago
freelawproject / eyecite
View on GitHub
Find legal citations in any block of text
☆263Updated this week
raindrum / citeurl
View on GitHub
an extensible tool to generate hyperlinks from legal citations
☆44Jan 20, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
openlegaldata / legal-reference-extraction
View on GitHub
Legal Reference Extraction
☆49Jun 15, 2026Updated last month
freelawproject / disclosure-extractor
View on GitHub
A financial disclosure data extraction tool.
☆22Aug 2, 2023Updated 2 years ago
Law-AI / codscomad2023tutorial
View on GitHub
This repository contains links to different Law-AI resources such as datasets and tools.
☆19Jan 12, 2023Updated 3 years ago
Aditya-shahh / Legal-AI
View on GitHub
An AI based legal search platform, that curates relevant legal search based on searched problem, that could be used by lawyers to speed u…
☆16Jan 4, 2023Updated 3 years ago
DotDoug / TreatiseAI
View on GitHub
A simple GPT-3 interface to automate core legal writing tasks
☆13Mar 8, 2023Updated 3 years ago
ciarrocki / LibreLaw
View on GitHub
A Free Database of Legal Materials
☆31Feb 17, 2020Updated 6 years ago
freelawproject / citation-regexes
View on GitHub
A collection of regular expressions for matching citations to state, federal, and even international law
☆46Jul 6, 2021Updated 5 years ago