Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
โ341Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for gpl
Users that are interested in gpl are comparing it to the libraries listed below
Sorting:
- ๐ฆฎ Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieโฆโ49Apr 25, 2022Updated 3 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.โ2,107Oct 16, 2025Updated 5 months ago
- docTTTTTquery document expansion modelโ374Mar 25, 2023Updated 2 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.โ47Jul 25, 2023Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answeringโ175Jun 6, 2021Updated 4 years ago
- Inquisitive Parrots for Searchโ200Jun 5, 2025Updated 9 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.โ80Feb 16, 2022Updated 4 years ago
- This repository helps you evaluate your models on the FreshStack benchmark!โ33Dec 9, 2025Updated 3 months ago
- Efficient few-shot learning with Sentence Transformersโ2,699Dec 11, 2025Updated 3 months ago
- SQuARE: Software for question answering research.โ75Jun 25, 2024Updated last year
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Samplingโ60Jul 11, 2021Updated 4 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.โ24Sep 24, 2023Updated 2 years ago
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillationโ115Jul 11, 2021Updated 4 years ago
- SGPT: GPT Sentence Embeddings for Semantic Searchโ873Feb 17, 2024Updated 2 years ago
- Search Engines with Autoregressive Language modelsโ295Apr 4, 2023Updated 2 years ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.โ2,036Mar 9, 2026Updated last week
- EMNLP 2021 - Pre-training architectures for dense retrievalโ256Mar 18, 2022Updated 4 years ago
- โ21Sep 6, 2021Updated 4 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorchโ265Jan 27, 2023Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puโฆโ41Jan 5, 2022Updated 4 years ago
- โ75Jul 2, 2021Updated 4 years ago
- Scalable training for dense retrieval models.โ298Jun 10, 2025Updated 9 months ago
- Codebase for RetroMAE and beyond.โ272Jun 7, 2024Updated last year
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyseriniโ352Dec 21, 2023Updated 2 years ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)โ3,799Oct 14, 2025Updated 5 months ago
- A Test Collection of Computer Science Papers for Faceted Query by Exampleโ22Nov 28, 2021Updated 4 years ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learningโ772Apr 7, 2023Updated 2 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.โ734Jan 26, 2026Updated last month
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"โ297Oct 27, 2022Updated 3 years ago
- A toolkit for end-to-end neural ad hoc retrievalโ97Aug 20, 2024Updated last year
- A multilingual version of MS MARCO passage ranking datasetโ147Oct 19, 2023Updated 2 years ago
- SPLADE: sparse neural search (SIGIR21, SIGIR22)โ983May 3, 2024Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laโฆโ49Nov 13, 2023Updated 2 years ago
- Collections of IR Researchโ37May 18, 2025Updated 10 months ago
- Open-Source Information Retrieval Courses @ TU Wienโ698Jun 12, 2023Updated 2 years ago
- State-of-the-Art Text Embeddingsโ18,427Mar 12, 2026Updated last week
- Provides a common interface to many IR ranking datasets.โ386Feb 20, 2026Updated last month
- A Python framework for performing information retrieval experiments, building on http://terrier.org/โ497Mar 2, 2026Updated 3 weeks ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)โ40Aug 2, 2021Updated 4 years ago