maastrichtlawtech / bsard
🔍 A statutory article retrieval dataset in French. (ACL 2022)
☆39Updated last year
Alternatives and similar repositories for bsard:
Users that are interested in bsard are comparing it to the libraries listed below
- A multilingual version of MS MARCO passage ranking dataset☆143Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆73Updated 2 years ago
- Inquisitive Parrots for Search☆183Updated 11 months ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆37Updated 10 months ago
- ☆55Updated 2 years ago
- Search Engines with Autoregressive Language models☆281Updated last year
- EMNLP 2021 - Pre-training architectures for dense retrieval☆244Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆328Updated last year
- Dense hybrid representations for text retrieval☆62Updated last year
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation☆109Updated 3 years ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated 9 months ago
- Build Text Rerankers with Deep Language Models☆253Updated 11 months ago
- Efficient Attention for Long Sequence Processing☆91Updated last year
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆346Updated last year
- ☆16Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- Train Dense Passage Retriever (DPR) with a single GPU☆130Updated 3 years ago
- ☆57Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- ☆42Updated last year
- ☆41Updated 3 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆107Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆99Updated 2 years ago
- Unified Learned Sparse Retrieval Framework☆63Updated 8 months ago
- Scalable training for dense retrieval models.☆273Updated last year
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆261Updated 2 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆99Updated last year
- Long-context pretrained encoder-decoder models☆94Updated 2 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆68Updated last year