facebookresearch / dpr-scale
Scalable training for dense retrieval models.
☆271Updated last year
Related projects ⓘ
Alternatives and complementary repositories for dpr-scale
- Inquisitive Parrots for Search☆178Updated 8 months ago
- Search Engines with Autoregressive Language models☆277Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆101Updated last year
- ☆167Updated last year
- ☆95Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.☆141Updated 6 months ago
- ☆122Updated 2 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated 2 years ago
- Tevatron - A flexible toolkit for neural retrieval research and development.☆524Updated last month
- Build Text Rerankers with Deep Language Models☆251Updated 9 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆96Updated last year
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆156Updated last year
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆269Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆160Updated last year
- Zero-shot Document Ranking with Large Language Models.☆96Updated 4 months ago
- DSIR large-scale data selection framework for language model training☆230Updated 7 months ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆243Updated 2 years ago
- Train Dense Passage Retriever (DPR) with a single GPU☆127Updated 3 years ago
- A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"☆164Updated 2 years ago
- Dense hybrid representations for text retrieval☆62Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆149Updated 4 months ago
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆517Updated 11 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆190Updated this week
- Unified Learned Sparse Retrieval Framework☆60Updated 6 months ago
- ☆265Updated 11 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆71Updated 6 months ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 8 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆213Updated last year
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆67Updated last year