lavis-nlp / GerDaLIRLinks
German Dataset for Legal Information Retrieval
☆20Updated last year
Alternatives and similar repositories for GerDaLIR
Users that are interested in GerDaLIR are comparing it to the libraries listed below
Sorting:
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆229Updated 4 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago
- multimodal document analysis☆166Updated 2 weeks ago
- A Dataset of German Legal Documents for Named Entity Recognition☆172Updated 3 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆92Updated 2 years ago
- ☆173Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆70Updated 2 years ago
- ☆63Updated 5 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆69Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆63Updated last year
- Mining Legal Arguments in Court Decisions - Data and software☆72Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆108Updated last year
- A multilingual version of MS MARCO passage ranking dataset☆144Updated 2 years ago
- ☆40Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- ☆28Updated 6 months ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆44Updated last year
- Coreference Resolution☆79Updated 4 years ago
- ☆370Updated last year
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆19Updated 3 weeks ago
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer☆40Updated 3 years ago
- A Framework for Textual Entailment based Zero Shot text classification☆153Updated last year
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆45Updated last year
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxi…☆107Updated last year
- ☆45Updated 2 years ago
- A dataset for pretraining language models targeted for legal tasks.☆140Updated 3 years ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆157Updated 2 months ago