UKPLab / gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
☆323Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gpl
- Inquisitive Parrots for Search☆178Updated 8 months ago
- A multilingual version of MS MARCO passage ranking dataset☆142Updated last year
- Search Engines with Autoregressive Language models☆277Updated last year
- Build Text Rerankers with Deep Language Models☆251Updated 9 months ago
- docTTTTTquery document expansion model☆356Updated last year
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆340Updated 11 months ago
- Scalable training for dense retrieval models.☆271Updated last year
- Efficient Attention for Long Sequence Processing☆89Updated 11 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated last year
- Provides a common interface to many IR ranking datasets.☆323Updated this week
- ☆333Updated 11 months ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆168Updated 3 months ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆302Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆183Updated last month
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆417Updated 2 weeks ago
- Tevatron - A flexible toolkit for neural retrieval research and development.☆524Updated last month
- A python package for benchmarking interpretability techniques on Transformers.☆212Updated last month
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆261Updated last year
- A Python Search Engine for Humans 🥸☆185Updated 6 months ago
- ☆179Updated last year
- Neural information retrieval / Semantic search / Bi-encoders☆167Updated last year
- SpanMarker for Named Entity Recognition☆401Updated 3 months ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆243Updated 2 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆99Updated last year
- ☆83Updated 2 months ago
- Zero and Few shot named entity & relationships recognition☆349Updated 2 months ago
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆290Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆149Updated 4 months ago
- multimodal document analysis☆160Updated 5 months ago