achouhan93 / eur-lex-sum
Dataset for cross-lingual legal text summarization from EUR-Lex document summaries
β12Updated 6 months ago
Related projects: β
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transferβ32Updated 2 years ago
- π A graph-augmented dense statute retriever. (EACL 2023)β17Updated 11 months ago
- Python toolbox to load, parse and process Official Journals of the European Union (EU).β13Updated 4 months ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLPβ19Updated 8 months ago
- An EUR-Lex parser for Python.β23Updated 2 months ago
- Mining Legal Arguments in Court Decisions - Data and softwareβ63Updated last year
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β80Updated last year
- Legal document classification with EuroVoc descriptors on 22 languages.β25Updated last year
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020β62Updated 4 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2β¦β65Updated last year
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an β¦β29Updated last year
- EU Regulation Corpus Compiler: A pipeline of Python programs to download EU regulatory documents from the Eur-Lex portal via the CELLAR eβ¦β14Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.β56Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.β102Updated 5 months ago
- β24Updated 8 months ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)β102Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.β96Updated last year
- Source code and data for Like a Good Nearest Neighborβ28Updated 7 months ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.β35Updated 2 years ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!β33Updated 3 years ago
- CMU Linguistic Annotation Backendβ14Updated 5 months ago
- A dataset for pretraining language models targeted for legal tasks.β115Updated 2 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal β¦β31Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ83Updated last year
- β78Updated 4 months ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"β51Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.β28Updated 2 years ago
- Collection of Datasets for Legal Text Processingβ77Updated last year
- StAtutory Reasoning Assessmentβ11Updated last year
- cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).β64Updated last year