Breakend / PileOfLawLinks
A dataset for pretraining language models targeted for legal tasks.
☆134Updated 3 years ago
Alternatives and similar repositories for PileOfLaw
Users that are interested in PileOfLaw are comparing it to the libraries listed below
Sorting:
- Collection of Datasets for Legal Text Processing☆110Updated 2 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆90Updated 2 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆210Updated 2 years ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆54Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆68Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆73Updated last year
- Implementation of different summarization algorithms applied to legal case judgements.☆206Updated 2 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆92Updated 2 years ago
- multimodal document analysis☆166Updated last year
- 📖 A curated list of LegalNLP resources from all around the web.☆275Updated 2 years ago
- ☆18Updated 4 years ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆14Updated 3 years ago
- ☆27Updated 3 years ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated 2 years ago
- NLP Web API for Legal Text☆18Updated 2 years ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- ☆79Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 2 years ago
- ☆38Updated 2 years ago
- A simple library for segmenting legal texts☆17Updated 2 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆220Updated 2 years ago
- ☆40Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆94Updated last year
- ☆80Updated 8 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆67Updated 2 months ago
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆20Updated last year
- A collection of datasets and tasks for legal machine learning☆388Updated last year