facebookresearch / distributed-faissLinks
A library for building and serving multi-node distributed faiss indices.
☆266Updated last year
Alternatives and similar repositories for distributed-faiss
Users that are interested in distributed-faiss are comparing it to the libraries listed below
Sorting:
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆854Updated last year
- Some useful tips for faiss☆622Updated last year
- ⚡ A fast embedded library for approximate nearest neighbor search☆230Updated last year
- Framework for evaluating ANNS algorithms on billion scale datasets.☆379Updated 3 weeks ago
- Scalable training for dense retrieval models.☆292Updated 3 months ago
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆158Updated 4 years ago
- hnsw implemented by python☆66Updated 6 years ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆328Updated 5 months ago
- Pure python implementation of product quantization for nearest neighbor search☆343Updated 2 weeks ago
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆393Updated last year
- Build Text Rerankers with Deep Language Models☆263Updated last year
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆313Updated 2 years ago
- ☆411Updated last year
- Implementation of a Transformer, but completely in Triton☆266Updated 3 years ago
- Running BERT without Padding☆471Updated 3 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆64Updated last year
- Scalable PaLM implementation of PyTorch☆190Updated 2 years ago
- ☆119Updated last year
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆187Updated 10 months ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆170Updated 4 years ago
- Framework for benchmarking vector search engines☆320Updated this week
- Code repository for the paper - "Matryoshka Representation Learning"☆499Updated last year
- Inquisitive Parrots for Search☆191Updated this week
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆136Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆574Updated 3 weeks ago
- Code for ECCV2018 paper: Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors☆209Updated 5 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆613Updated 2 weeks ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆316Updated last year
- experiments with inference on llama☆104Updated last year
- DSIR large-scale data selection framework for language model training☆249Updated last year