A dataset for pretraining language models targeted for legal tasks.
β142Jun 30, 2022Updated 3 years ago
Alternatives and similar repositories for PileOfLaw
Users that are interested in PileOfLaw are comparing it to the libraries listed below
Sorting:
- π Materials for Advanced Legal Analytics (LAW3027) @ Maastricht University.β14May 8, 2024Updated last year
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in Englishβ241Jul 23, 2025Updated 7 months ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasetsβ18Jun 16, 2023Updated 2 years ago
- GPT-3.5-trubo + Harvard's Case Access Projectβ18Jun 6, 2023Updated 2 years ago
- A collection of datasets and tasks for legal machine learningβ429Jan 4, 2026Updated 2 months ago
- KL3M training data collection and preprocessingβ20Apr 14, 2025Updated 10 months ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, Rβ¦β16Feb 22, 2022Updated 4 years ago
- Legalpioneer datasetβ15Apr 10, 2025Updated 10 months ago
- An open science effort to benchmark legal reasoning in foundation modelsβ541Aug 25, 2024Updated last year
- Find legal citations in any block of textβ210Oct 3, 2025Updated 5 months ago
- LegalCrawler: A tool for automated scraping of English legal corporaβ59Aug 18, 2022Updated 3 years ago
- π A curated list of LegalNLP resources from all around the web.β303Oct 14, 2025Updated 4 months ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β95Mar 27, 2023Updated 2 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legalβ100Apr 13, 2023Updated 2 years ago
- β20Jun 11, 2021Updated 4 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Aug 15, 2024Updated last year
- π Materials for Legal Analytics (LAW3025) @ Maastricht Universityβ13Jan 27, 2026Updated last month
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Developmentβ21Jul 24, 2023Updated 2 years ago
- A list of selected resources, methods, and tools dedicated to Legal Text Analytics.β699Nov 5, 2024Updated last year
- Our microservice for generating embeddings from blocks of textβ33Feb 20, 2026Updated 2 weeks ago
- β10Jul 15, 2024Updated last year
- π A community-curated list of awesome lawtech software and learning resources for legal technology and design.β30Oct 3, 2019Updated 6 years ago
- A spaCy pipeline and model for NLP on unstructured legal text.β674Jul 16, 2024Updated last year
- Lobe is the world's first AI paralegal.β51Dec 8, 2022Updated 3 years ago
- Introduction to Software Development for Lawyersβ28Mar 13, 2024Updated last year
- A collection of regular expressions for matching citations to state, federal, and even international lawβ40Jul 6, 2021Updated 4 years ago
- Download client for legal opinionsβ13Jan 26, 2025Updated last year
- Open Legal Data Platformβ129Feb 25, 2026Updated last week
- Course materials for "Building a Robot Judge: Data Science for the Law", Spring 2019β11Feb 20, 2020Updated 6 years ago
- A Free Database of Legal Materialsβ28Feb 17, 2020Updated 6 years ago
- This repository is about an APP to help lawyers to process law documents and suit cases using AI Agents trained with OpenAI and others LLβ¦β18Aug 14, 2023Updated 2 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.β41Nov 8, 2023Updated 2 years ago
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documentsβ30Nov 9, 2025Updated 3 months ago
- LexNLP by LexPredictβ767May 27, 2024Updated last year
- AI apps/benchmark for legaltechβ112Sep 22, 2021Updated 4 years ago
- Argumentation Mining Tool for Lawyersβ15May 18, 2021Updated 4 years ago
- The code used to evaluate embedding models on the Massive Legal Embedding Benchmark (MLEB).β31Feb 24, 2026Updated last week
- An AI based legal search platform, that curates relevant legal search based on searched problem, that could be used by lawyers to speed uβ¦β16Jan 4, 2023Updated 3 years ago
- UC Berkeley LEGALST 190 (Data, Prediction, and Law)β21Jun 10, 2025Updated 8 months ago