lavis-nlp / GerDaLIRLinks
German Dataset for Legal Information Retrieval
☆20Updated last year
Alternatives and similar repositories for GerDaLIR
Users that are interested in GerDaLIR are comparing it to the libraries listed below
Sorting:
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆230Updated 4 months ago
- ☆29Updated 7 months ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆93Updated 2 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆172Updated 3 years ago
- A multilingual version of MS MARCO passage ranking dataset☆145Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago
- ☆176Updated last year
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆45Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Updated last year
- Mining Legal Arguments in Court Decisions - Data and software☆73Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- coFR: COreference resolution tool for FRench (and singletons).☆26Updated 5 years ago
- 📖 A curated list of LegalNLP resources from all around the web.☆292Updated 2 months ago
- multimodal document analysis☆166Updated last month
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆70Updated 2 years ago
- A Framework for Textual Entailment based Zero Shot text classification☆153Updated last year
- GEMBA — GPT Estimation Metric Based Assessment☆134Updated last year
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 3 years ago
- A dataset for pretraining language models targeted for legal tasks.☆140Updated 3 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆69Updated 2 years ago
- ☆40Updated 2 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- A collection of datasets and other resources for legal text processing.☆154Updated last month
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆42Updated last year
- A Framework for Comprehensive Quantity Extraction☆20Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆198Updated 3 months ago
- ReFinED is an efficient and accurate entity linking (EL) system.☆226Updated last year
- Coreference Resolution☆79Updated 4 years ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆157Updated 2 months ago