maastrichtlawtech / bsard
π A statutory article retrieval dataset in French. (ACL 2022)
β39Updated last year
Alternatives and similar repositories for bsard:
Users that are interested in bsard are comparing it to the libraries listed below
- A multilingual version of MS MARCO passage ranking datasetβ143Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.β74Updated 3 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ39Updated last year
- Dense hybrid representations for text retrievalβ62Updated last year
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β51Updated last year
- β84Updated 7 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β93Updated 2 years ago
- β54Updated 2 years ago
- β58Updated 2 years ago
- β44Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- Automatically detect errors in annotated corpora.β47Updated last year
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatioβ¦β44Updated last year
- β97Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"β100Updated 2 years ago
- Unified Learned Sparse Retrieval Frameworkβ64Updated 10 months ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.β102Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ126Updated last year
- πΈοΈ A graph-augmented dense statute retriever. (EACL 2023)β21Updated last year
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillationβ109Updated 3 years ago
- β42Updated last year
- β68Updated 3 years ago
- β45Updated 3 years ago
- β29Updated last year
- Inquisitive Parrots for Searchβ189Updated last year
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal β¦β32Updated 3 years ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedbackβ94Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.β55Updated 11 months ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paperβ70Updated last year
- Train Dense Passage Retriever (DPR) with a single GPUβ130Updated 3 years ago