facebookresearch / SEAL
Search Engines with Autoregressive Language models
☆282Updated last year
Alternatives and similar repositories for SEAL:
Users that are interested in SEAL are comparing it to the libraries listed below
- Scalable training for dense retrieval models.☆275Updated last year
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆244Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆168Updated 3 years ago
- Train Dense Passage Retriever (DPR) with a single GPU☆130Updated 3 years ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆271Updated 2 years ago
- Multi-hop dense retrieval for question answering☆213Updated 3 years ago
- Build Text Rerankers with Deep Language Models☆257Updated 11 months ago
- Inquisitive Parrots for Search☆184Updated 11 months ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆347Updated last year
- EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535☆145Updated 2 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆291Updated last year
- A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"☆169Updated 2 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆262Updated 2 years ago
- Fusion-in-Decoder☆559Updated last year
- Unified Learned Sparse Retrieval Framework☆63Updated 9 months ago
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆99Updated 2 years ago
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation☆109Updated 3 years ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆157Updated 2 years ago
- ☆316Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- ☆96Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 2 years ago
- A multilingual version of MS MARCO passage ranking dataset☆143Updated last year
- NAACL2021 - COIL Contextualized Lexical Retriever☆151Updated 3 years ago
- ☆55Updated 2 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆132Updated 8 months ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆123Updated 3 years ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆153Updated last year