facebookresearch / distributed-faiss
A library for building and serving multi-node distributed faiss indices.
☆263Updated last year
Alternatives and similar repositories for distributed-faiss:
Users that are interested in distributed-faiss are comparing it to the libraries listed below
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆841Updated 9 months ago
- ⚡ A fast embedded library for approximate nearest neighbor search☆226Updated last year
- Some useful tips for faiss☆616Updated last year
- Framework for evaluating ANNS algorithms on billion scale datasets.☆366Updated 3 weeks ago
- Scalable training for dense retrieval models.☆282Updated 2 weeks ago
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆155Updated 3 years ago
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆308Updated last year
- Build Text Rerankers with Deep Language Models☆260Updated last year
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆316Updated 2 months ago
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆378Updated 11 months ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆177Updated 7 months ago
- Tevatron - A flexible toolkit for neural retrieval research and development.☆570Updated last week
- ☆235Updated this week
- Provides a common interface to many IR ranking datasets.☆343Updated this week
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆130Updated last year
- The pipeline for the OSCAR corpus☆166Updated last year
- Fast Inference Solutions for BLOOM☆562Updated 5 months ago
- Inquisitive Parrots for Search☆188Updated last year
- hnsw implemented by python☆65Updated 5 years ago
- DSIR large-scale data selection framework for language model training☆242Updated 11 months ago
- Codebase for RetroMAE and beyond.☆253Updated 9 months ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆316Updated last year
- Knowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.☆209Updated last year
- Search Engines with Autoregressive Language models☆282Updated last year
- ☆410Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆63Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- Official code for "Binary embedding based retrieval at Tencent"☆42Updated last year
- docTTTTTquery document expansion model☆361Updated last year
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆59Updated 5 months ago