maastrichtlawtech / bsardLinks
π A statutory article retrieval dataset in French. (ACL 2022)
β40Updated last year
Alternatives and similar repositories for bsard
Users that are interested in bsard are comparing it to the libraries listed below
Sorting:
- A multilingual version of MS MARCO passage ranking datasetβ145Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.β76Updated 3 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ43Updated last year
- β43Updated 2 years ago
- Inquisitive Parrots for Searchβ191Updated this week
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.β104Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β334Updated last year
- β86Updated 2 months ago
- β98Updated 2 years ago
- Unified Learned Sparse Retrieval Frameworkβ64Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"β101Updated 2 years ago
- πΈοΈ A graph-augmented dense statute retriever. (EACL 2023)β21Updated last year
- Dense hybrid representations for text retrievalβ62Updated 2 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β90Updated 2 years ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held β¦β41Updated 2 years ago
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillationβ111Updated 3 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)β113Updated 2 years ago
- β37Updated 2 years ago
- β59Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β53Updated last year
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."β36Updated 2 years ago
- Search Engines with Autoregressive Language modelsβ286Updated 2 years ago
- β54Updated 2 years ago
- β47Updated 3 years ago
- Retrieval-Augmented Generation battle!β51Updated 5 months ago
- β34Updated 8 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.β45Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ128Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β48Updated last year