German Dataset for Legal Information Retrieval
☆25Feb 26, 2024Updated 2 years ago
Alternatives and similar repositories for GerDaLIR
Users that are interested in GerDaLIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analysing the German Bundestag by means of Natural Language Processing with the Bundestags-Mine.☆14Dec 3, 2024Updated last year
- ☆30May 14, 2025Updated 10 months ago
- A dataset of semantically related sentence pairs in the German legal domain☆10Feb 26, 2021Updated 5 years ago
- Legal Reference Extraction☆44Feb 13, 2026Updated last month
- Regulärer Ausdruck zum Finden von Gesetzen in Texten/Regex to find German laws.☆21Jul 18, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings l…☆23Feb 22, 2022Updated 4 years ago
- ☆17Jun 6, 2024Updated last year
- Convert legal statutes and cases from official sources (or juris) to graphs☆30Sep 11, 2025Updated 6 months ago
- PyTorch code for JEREX: Joint Entity-Level Relation Extractor☆67Dec 9, 2021Updated 4 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆177Oct 19, 2022Updated 3 years ago
- Code repository of the NAACL'21 paper "CoRT: Complementary Rankings from Transformers"☆12Jul 7, 2021Updated 4 years ago
- bb25 is a fast, self-contained BM25 + Bayesian calibration implementation with a minimal Python API.☆96Mar 17, 2026Updated last week
- ☆13Apr 7, 2025Updated 11 months ago
- ☆11Jul 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A curated list of awesome resources to create and customize your Curriculum Vitae☆25Mar 16, 2026Updated last week
- Poems retrieval demo built with GNES framework☆14Oct 3, 2019Updated 6 years ago
- German Parliamentary Corpus (GerParCor)☆30Jan 14, 2026Updated 2 months ago
- ☆15Nov 14, 2022Updated 3 years ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- A rolling version of the Latent Dirichlet Allocation.☆13Nov 27, 2023Updated 2 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"☆12Oct 20, 2023Updated 2 years ago
- ☆47Sep 6, 2025Updated 6 months ago
- A really fast document ranking engine using BM25 and TF-IDF. Based on Python using NLP packages NLTK and spacY.☆16May 8, 2018Updated 7 years ago
- ☆15Oct 30, 2023Updated 2 years ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Apr 8, 2022Updated 3 years ago
- EQUATE (Evaluating Quantitative Understanding Aptitude in Textual Entailment), framework for evaluating quantitative reasoning ability in…☆14Feb 13, 2022Updated 4 years ago
- Implements SemRe-Rank: improving automatic term extraction by incorporating semantic relatedness with personalised pagerank☆16Apr 7, 2018Updated 7 years ago
- Code repo for CLERC: A Legal Precedent Dataset for Case Retrieval and Retrieval-Augmented Analysis Generation (NAACL 2025)☆25Jan 28, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- python interface for mate tools☆17Jan 23, 2018Updated 8 years ago
- daten von offenesparlament.de☆14Oct 5, 2017Updated 8 years ago
- code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith☆14May 18, 2021Updated 4 years ago
- morphologically informed POS tagging for German☆25Nov 14, 2025Updated 4 months ago
- Source code for Paper "Legal Feature Enhanced Semantic Matching Network for Similar Case Matching".☆15Feb 17, 2020Updated 6 years ago
- Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation☆39Mar 3, 2025Updated last year
- LexEval: A Comprehensive Benchmark for Evaluating Large Language Models in Legal Domain☆91Oct 30, 2024Updated last year