Breakend / PileOfLaw
A dataset for pretraining language models targeted for legal tasks.
☆131Updated 2 years ago
Alternatives and similar repositories for PileOfLaw:
Users that are interested in PileOfLaw are comparing it to the libraries listed below
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆88Updated 2 years ago
- Collection of Datasets for Legal Text Processing☆101Updated last year
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆201Updated last year
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆72Updated 10 months ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆55Updated 2 years ago
- ☆18Updated 3 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆14Updated 3 years ago
- Find legal citations in any block of text☆150Updated this week
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆87Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆68Updated last year
- ☆26Updated 3 years ago
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer☆37Updated 2 years ago
- 📖 A curated list of LegalNLP resources from all around the web.☆268Updated last year
- Kelvin Legal Data OS - Public Examples☆19Updated last year
- NLP Web API for Legal Text☆18Updated 2 years ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- A simple library for segmenting legal texts☆15Updated 2 years ago
- ☆38Updated 2 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆22Updated last year
- A collection of datasets and tasks for legal machine learning☆374Updated 10 months ago
- noslegal taxonomy facets and release notes☆36Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- Code for DELSumm, an unsupervised summarization algorithm for legal case judgements.☆29Updated 2 years ago
- 📚 Materials for Advanced Legal Analytics (LAW3027) @ Maastricht University.☆13Updated 11 months ago
- AI + Legal APIs: A Tool-Based Retrieval Augmented Generation Workbench for Legal AI UX Research.☆67Updated 6 months ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆106Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆13Updated 9 months ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago