Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
โ340Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for gpl
Users that are interested in gpl are comparing it to the libraries listed below
Sorting:
- ๐ฆฎ Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieโฆโ49Apr 25, 2022Updated 3 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.โ2,087Oct 16, 2025Updated 4 months ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answeringโ175Jun 6, 2021Updated 4 years ago
- docTTTTTquery document expansion modelโ374Mar 25, 2023Updated 2 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.โ47Jul 25, 2023Updated 2 years ago
- Inquisitive Parrots for Searchโ199Jun 5, 2025Updated 8 months ago
- This repository helps you evaluate your models on the FreshStack benchmark!โ33Dec 9, 2025Updated 2 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.โ80Feb 16, 2022Updated 4 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorchโ265Jan 27, 2023Updated 3 years ago
- Efficient few-shot learning with Sentence Transformersโ2,688Dec 11, 2025Updated 2 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puโฆโ41Jan 5, 2022Updated 4 years ago
- Search Engines with Autoregressive Language modelsโ295Apr 4, 2023Updated 2 years ago
- SGPT: GPT Sentence Embeddings for Semantic Searchโ873Feb 17, 2024Updated 2 years ago
- EMNLP 2021 - Pre-training architectures for dense retrievalโ256Mar 18, 2022Updated 3 years ago
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillationโ115Jul 11, 2021Updated 4 years ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.โ2,023Feb 21, 2026Updated last week
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Samplingโ60Jul 11, 2021Updated 4 years ago
- โ75Jul 2, 2021Updated 4 years ago
- SQuARE: Software for question answering research.โ75Jun 25, 2024Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.โ24Sep 24, 2023Updated 2 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyseriniโ352Dec 21, 2023Updated 2 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.โ727Jan 26, 2026Updated last month
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)โ3,782Oct 14, 2025Updated 4 months ago
- A toolkit for end-to-end neural ad hoc retrievalโ97Aug 20, 2024Updated last year
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)โ40Aug 2, 2021Updated 4 years ago
- Scalable training for dense retrieval models.โ298Jun 10, 2025Updated 8 months ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learningโ772Apr 7, 2023Updated 2 years ago
- A Python framework for performing information retrieval experiments, building on http://terrier.org/โ495Updated this week
- SPLADE: sparse neural search (SIGIR21, SIGIR22)โ979May 3, 2024Updated last year
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherโฆโ1,265Jul 24, 2025Updated 7 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laโฆโ49Nov 13, 2023Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasksโ926Sep 2, 2024Updated last year
- Open-Source Information Retrieval Courses @ TU Wienโ697Jun 12, 2023Updated 2 years ago
- Measuring if attention is explanation with ROARโ22Mar 3, 2023Updated 3 years ago
- Provides a common interface to many IR ranking datasets.โ381Feb 20, 2026Updated last week
- A Test Collection of Computer Science Papers for Faceted Query by Exampleโ22Nov 28, 2021Updated 4 years ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.โ1,752Dec 20, 2023Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.oโฆโ606Jun 15, 2022Updated 3 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)โ61Jun 12, 2023Updated 2 years ago