maastrichtlawtech / bsard
๐ A statutory article retrieval dataset in French. (ACL 2022)
โ39Updated last year
Alternatives and similar repositories for bsard:
Users that are interested in bsard are comparing it to the libraries listed below
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsโ37Updated 11 months ago
- A multilingual version of MS MARCO passage ranking datasetโ143Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.โ55Updated 10 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.โ74Updated 3 years ago
- โ55Updated 2 years ago
- Efficient Attention for Long Sequence Processingโ92Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puโฆโ40Updated 3 years ago
- โ84Updated 6 months ago
- Inquisitive Parrots for Searchโ187Updated last year
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.โ51Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarizationโ155Updated 2 years ago
- Dense hybrid representations for text retrievalโ62Updated last year
- โ41Updated 3 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.โ47Updated 2 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillationsโ132Updated 9 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: โฆโ328Updated last year
- This project aims at creating a search engine based on BERT language model.โ19Updated 4 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"โ100Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laโฆโ46Updated last year
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillationโ109Updated 3 years ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Promptingโ27Updated last year
- โ16Updated 2 years ago
- โ36Updated 2 years ago
- CLIR version of ColBERTโ67Updated 5 months ago
- ๐ฆฎ Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieโฆโ49Updated 2 years ago
- โ42Updated last year
- ๐ธ๏ธ A graph-augmented dense statute retriever. (EACL 2023)โ21Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedbackโ93Updated last year
- Using business-level retrieval system (BM25) with Python in just a few lines.โ31Updated 2 years ago
- Code to reproduce NeuralMind's submissions to COLIEE 2021 and COLIEE 2022โ24Updated 2 years ago