Breakend / PileOfLaw
A dataset for pretraining language models targeted for legal tasks.
☆122Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for PileOfLaw
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆187Updated last year
- Collection of Datasets for Legal Text Processing☆83Updated last year
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆86Updated last year
- LegalCrawler: A tool for automated scraping of English legal corpora☆48Updated 2 years ago
- 📖 A curated list of LegalNLP resources from all around the web.☆244Updated last year
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆86Updated last year
- Mining Legal Arguments in Court Decisions - Data and software☆64Updated last year
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆14Updated 2 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆20Updated 10 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆65Updated 5 months ago
- ☆18Updated 3 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 2 years ago
- Zero-shot evaluation on LEXGLUE tasks with GTP3.5☆27Updated last year
- NLP Web API for Legal Text☆17Updated last year
- A simple library for segmenting legal texts☆13Updated last year
- ☆25Updated 2 years ago
- Find legal citations in any block of text☆123Updated 4 months ago
- Implementation of different summarization algorithms applied to legal case judgements.☆188Updated 2 years ago
- multimodal document analysis☆160Updated 5 months ago
- 📚 Materials for Advanced Legal Analytics (LAW3027) @ Maastricht University.☆13Updated 6 months ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆38Updated last year
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer☆33Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆103Updated 6 months ago
- ☆35Updated 2 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated last year
- LexPredict Legal Dictionaries☆111Updated 2 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆31Updated 3 years ago
- ☆82Updated 6 months ago
- ☆39Updated last year