openlegaldata / awesome-legal-data
Collection of Datasets for Legal Text Processing
☆100Updated last year
Alternatives and similar repositories for awesome-legal-data:
Users that are interested in awesome-legal-data are comparing it to the libraries listed below
- A dataset for pretraining language models targeted for legal tasks.☆131Updated 2 years ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆55Updated 2 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆201Updated last year
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆87Updated 2 years ago
- Legal Reference Extraction☆29Updated 8 months ago
- Mining Legal Arguments in Court Decisions - Data and software☆67Updated last year
- 📖 A curated list of LegalNLP resources from all around the web.☆265Updated last year
- A list of selected resources, methods, and tools dedicated to legal data schemes and ontologies.☆107Updated last year
- ☆18Updated 3 years ago
- ☆26Updated 3 years ago
- Find legal citations in any block of text☆147Updated this week
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆71Updated 10 months ago
- LexPredict Legal Dictionaries☆117Updated 2 years ago
- German Dataset for Legal Information Retrieval☆19Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Convert legal statutes and cases from official sources (or juris) to graphs☆24Updated 4 months ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆14Updated 3 years ago
- A simple library for segmenting legal texts☆15Updated 2 years ago
- an extensible tool to generate hyperlinks from legal citations☆33Updated 6 months ago
- FOLIO: Federated Open Legal Information Ontology☆21Updated last month
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer☆37Updated 2 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆22Updated last year
- Open Legal Data Platform☆110Updated this week
- NLP Web API for Legal Text☆18Updated 2 years ago
- ☆38Updated 2 years ago
- Reading legal authority for the last time☆37Updated last month
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆86Updated 2 years ago
- An open science effort to benchmark legal reasoning in foundation models☆422Updated 8 months ago
- An EUR-Lex parser for Python.☆30Updated 9 months ago
- A collection of datasets and tasks for legal machine learning☆371Updated 9 months ago