Breakend / PileOfLawLinks
A dataset for pretraining language models targeted for legal tasks.
☆141Updated 3 years ago
Alternatives and similar repositories for PileOfLaw
Users that are interested in PileOfLaw are comparing it to the libraries listed below
Sorting:
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆95Updated 2 years ago
- A collection of datasets and other resources for legal text processing.☆168Updated 3 months ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆235Updated 6 months ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆59Updated 3 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆73Updated 2 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆99Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆78Updated last year
- ☆20Updated 4 years ago
- 📖 A curated list of LegalNLP resources from all around the web.☆297Updated 3 months ago
- multimodal document analysis☆166Updated 2 months ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆16Updated 3 years ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆18Updated 2 years ago
- Implementation of different summarization algorithms applied to legal case judgements.☆217Updated 3 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 3 years ago
- ☆28Updated 4 years ago
- Find legal citations in any block of text☆207Updated 4 months ago
- NLP Web API for Legal Text☆18Updated 3 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- StAtutory Reasoning Assessment☆15Updated 3 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Updated last year
- An open science effort to benchmark legal reasoning in foundation models☆531Updated last year
- ☆84Updated 2 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 3 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer☆40Updated 3 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated last month
- A collection of datasets and tasks for legal machine learning☆422Updated 3 weeks ago
- A simple library for segmenting legal texts☆17Updated 2 years ago