lavis-nlp / GerDaLIRLinks
German Dataset for Legal Information Retrieval
☆20Updated last year
Alternatives and similar repositories for GerDaLIR
Users that are interested in GerDaLIR are comparing it to the libraries listed below
Sorting:
- Collection of Datasets for Legal Text Processing☆105Updated last year
- ☆24Updated 3 weeks ago
- Mining Legal Arguments in Court Decisions - Data and software☆68Updated 2 years ago
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer☆37Updated 2 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆22Updated last year
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- Legal Reference Extraction☆32Updated last month
- A dataset for pretraining language models targeted for legal tasks.☆132Updated 2 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆205Updated last year
- A Dataset of German Legal Documents for Named Entity Recognition☆169Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆68Updated 2 years ago
- A library to evaluate factual correctness of abstractive summaries.☆11Updated 2 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆90Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆32Updated 3 years ago
- ☆27Updated 3 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Updated 2 years ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆24Updated 11 months ago
- 🔍 A statutory article retrieval dataset in French. (ACL 2022)☆40Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆30Updated 2 years ago
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆17Updated last year
- 🕸️ A graph-augmented dense statute retriever. (EACL 2023)☆21Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆109Updated last year
- ☆91Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 10 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Evaluate language models using multiple choice items☆13Updated 3 weeks ago