lavis-nlp / GerDaLIR
German Dataset for Legal Information Retrieval
☆19Updated last year
Alternatives and similar repositories for GerDaLIR
Users that are interested in GerDaLIR are comparing it to the libraries listed below
Sorting:
- Collection of Datasets for Legal Text Processing☆103Updated last year
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆22Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- Mining Legal Arguments in Court Decisions - Data and software☆68Updated 2 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆89Updated 2 years ago
- A dataset for pretraining language models targeted for legal tasks.☆131Updated 2 years ago
- ☆27Updated 2 months ago
- Data and additional information regarding the paper: Contract Discovery. Dataset and a Few-Shot Semantic Retrieval Challenge with Competi…☆31Updated 4 years ago
- ☆26Updated 3 years ago
- ☆24Updated this week
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- A Dataset of German Legal Documents for Named Entity Recognition☆169Updated 2 years ago
- Plan and train German transformer models.☆23Updated 4 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆30Updated last month
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Zero-shot evaluation on LEXGLUE tasks with GTP3.5☆28Updated 2 years ago
- multimodal document analysis☆164Updated 11 months ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆55Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- Evaluate language models using multiple choice items☆13Updated this week
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆31Updated last year
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer☆37Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…☆17Updated last month
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- Entity linking evaluation and analysis tool☆23Updated last month
- MultiCite code and data. Models are available on Huggingface.☆31Updated 3 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year