A library for building and serving multi-node distributed faiss indices.
☆276Nov 1, 2023Updated 2 years ago
Alternatives and similar repositories for distributed-faiss
Users that are interested in distributed-faiss are comparing it to the libraries listed below
Sorting:
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆894Nov 4, 2025Updated 3 months ago
- Web-scale retrieval for knowledge-intensive NLP☆553Dec 6, 2022Updated 3 years ago
- Some useful tips for faiss☆629Sep 1, 2025Updated 6 months ago
- ☆69Feb 4, 2021Updated 5 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated 11 months ago
- PyTorch-based library for various kinds of representational-similarity analysis☆24Jun 7, 2024Updated last year
- WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…☆1,100Sep 27, 2024Updated last year
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Library for Knowledge Intensive Language Tasks☆965Mar 31, 2022Updated 3 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆320Dec 9, 2023Updated 2 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆265Jan 27, 2023Updated 3 years ago
- Search Engines with Autoregressive Language models☆295Apr 4, 2023Updated 2 years ago
- Collections of vector search related libraries, service and research papers☆1,549Aug 6, 2024Updated last year
- SQuARE: Software for question answering research.☆75Jun 25, 2024Updated last year
- Benchmarks of approximate nearest neighbor libraries in Python☆5,601Jun 10, 2025Updated 8 months ago
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆14Mar 9, 2022Updated 3 years ago
- PyTorch code for MUST☆108May 1, 2025Updated 9 months ago
- Autoregressive Entity Retrieval☆797Jul 6, 2023Updated 2 years ago
- DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference☆162Mar 25, 2022Updated 3 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,087Oct 16, 2025Updated 4 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,023Feb 21, 2026Updated last week
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,295Mar 3, 2024Updated last year
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"☆13Oct 25, 2021Updated 4 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆606Jun 15, 2022Updated 3 years ago
- Anh - LAION's multilingual assistant datasets and models☆27Apr 5, 2023Updated 2 years ago
- ☆24Oct 23, 2020Updated 5 years ago
- Framework for evaluating ANNS algorithms on billion scale datasets.☆424Dec 17, 2025Updated 2 months ago
- ☆54Jan 18, 2023Updated 3 years ago
- High performance model preprocessing library on PyTorch☆646Mar 29, 2024Updated last year
- ⚡ A fast embedded library for approximate nearest neighbor search☆236Jul 21, 2023Updated 2 years ago
- A toolkit for generating datasets of midi files which have been degraded to be 'un-musical'.☆40Feb 27, 2025Updated last year
- Code and scripts for NAACL 2022 industry track paper "Fast and Light-weight Answer Text Retrieval in Dialogue Systems". Built on top of C…☆13Sep 17, 2025Updated 5 months ago
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- PyTorch extensions for high performance and large scale training.☆3,400Apr 26, 2025Updated 10 months ago