Web-scale retrieval for knowledge-intensive NLP
☆553Dec 6, 2022Updated 3 years ago
Alternatives and similar repositories for Sphere
Users that are interested in Sphere are comparing it to the libraries listed below
Sorting:
- Library for Knowledge Intensive Language Tasks☆967Mar 31, 2022Updated 3 years ago
- The AI Knowledge Editor☆184Jul 12, 2022Updated 3 years ago
- A library for building and serving multi-node distributed faiss indices.☆276Nov 1, 2023Updated 2 years ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,023Feb 21, 2026Updated last week
- Search Engines with Autoregressive Language models☆295Apr 4, 2023Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Aug 17, 2022Updated 3 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,087Oct 16, 2025Updated 4 months ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated 11 months ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,752Dec 20, 2023Updated 2 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆727Jan 26, 2026Updated last month
- The implementation of DeBERTa☆2,197Sep 29, 2023Updated 2 years ago
- Efficient few-shot learning with Sentence Transformers☆2,688Dec 11, 2025Updated 2 months ago
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆979May 3, 2024Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆340Jul 6, 2023Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆606Jun 15, 2022Updated 3 years ago
- Autoregressive Entity Retrieval☆797Jul 6, 2023Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆215Sep 29, 2024Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- SGPT: GPT Sentence Embeddings for Semantic Search☆873Feb 17, 2024Updated 2 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Jan 4, 2023Updated 3 years ago
- ☆51Jun 21, 2025Updated 8 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 8 months ago
- Scalable training for dense retrieval models.☆298Jun 10, 2025Updated 8 months ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆158Jan 6, 2023Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Question-answers, collected from Google☆132Jul 23, 2021Updated 4 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆209Aug 31, 2021Updated 4 years ago
- Ask Me Anything language model prompting☆547Jul 5, 2023Updated 2 years ago
- ☆2,946Jan 15, 2026Updated last month
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Mar 1, 2023Updated 3 years ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,860Apr 6, 2023Updated 2 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Jul 23, 2023Updated 2 years ago
- An Open-Source Package for Information Retrieval.☆442Oct 7, 2022Updated 3 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆119Aug 3, 2021Updated 4 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago