achouhan93 / eur-lex-sum
Dataset for cross-lingual legal text summarization from EUR-Lex document summaries
β13Updated last year
Alternatives and similar repositories for eur-lex-sum:
Users that are interested in eur-lex-sum are comparing it to the libraries listed below
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transferβ35Updated 2 years ago
- πΈοΈ A graph-augmented dense statute retriever. (EACL 2023)β21Updated last year
- Mining Legal Arguments in Court Decisions - Data and softwareβ66Updated last year
- Python toolbox to load, parse and process Official Journals of the European Union (EU).β14Updated 10 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.β96Updated last year
- β37Updated 2 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLPβ21Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.β105Updated 10 months ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β86Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- β92Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.β70Updated 8 months ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!β34Updated 3 years ago
- β33Updated last year
- cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).β66Updated last year
- LegalCrawler: A tool for automated scraping of English legal corporaβ53Updated 2 years ago
- An EUR-Lex parser for Python.β29Updated 8 months ago
- Legal document classification with EuroVoc descriptors on 22 languages.β25Updated last year
- A dataset for pretraining language models targeted for legal tasks.β127Updated 2 years ago
- Natural language understanding benchmarks for Norwegianβ14Updated last year
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- A comprehensive benchmark for entity disambiguationβ25Updated last year
- Code and model checkpoints for the MultiVerS model for scientific claim verification.β45Updated last year
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".β22Updated 3 years ago
- β84Updated 6 months ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extractionβ¦β104Updated 8 months ago
- A Framework for Comprehensive Quantity Extractionβ20Updated 11 months ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.β23Updated 8 months ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"β54Updated 3 years ago
- Collection of Datasets for Legal Text Processingβ92Updated last year