facebookresearch / dpr-scale
Scalable training for dense retrieval models.
☆270Updated last year
Related projects ⓘ
Alternatives and complementary repositories for dpr-scale
- Inquisitive Parrots for Search☆177Updated 8 months ago
- ☆166Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆159Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆101Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆93Updated last year
- Search Engines with Autoregressive Language models☆277Updated last year
- ☆120Updated 2 months ago
- ☆179Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆147Updated 3 months ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆156Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated 2 years ago
- ☆95Updated last year
- Tevatron - A flexible toolkit for neural retrieval research and development.☆517Updated 2 weeks ago
- Build Text Rerankers with Deep Language Models☆251Updated 8 months ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆193Updated 8 months ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆457Updated 2 years ago
- Zero-shot Document Ranking with Large Language Models.☆95Updated 4 months ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆244Updated 2 years ago
- Fusion-in-Decoder☆550Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆188Updated 2 months ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆514Updated 11 months ago
- Dense hybrid representations for text retrieval☆61Updated last year
- ☆262Updated 10 months ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆269Updated 2 years ago
- Finetune mistral-7b-instruct for sentence embeddings☆70Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 7 months ago
- DSIR large-scale data selection framework for language model training☆227Updated 7 months ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆253Updated last year
- Train Dense Passage Retriever (DPR) with a single GPU☆128Updated 3 years ago