facebookresearch / distributed-faissLinks
A library for building and serving multi-node distributed faiss indices.
☆268Updated last year
Alternatives and similar repositories for distributed-faiss
Users that are interested in distributed-faiss are comparing it to the libraries listed below
Sorting:
- Some useful tips for faiss☆622Updated last year
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆867Updated last year
- ⚡ A fast embedded library for approximate nearest neighbor search☆232Updated 2 years ago
- hnsw implemented by python☆68Updated 6 years ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆340Updated 7 months ago
- Scalable training for dense retrieval models.☆299Updated last month
- Code repository for the paper - "Matryoshka Representation Learning"☆527Updated last year
- ☆411Updated last year
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆314Updated 2 years ago
- Inquisitive Parrots for Search☆193Updated last month
- Pure python implementation of product quantization for nearest neighbor search☆351Updated last month
- Search Engines with Autoregressive Language models☆290Updated 2 years ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆190Updated 11 months ago
- The pipeline for the OSCAR corpus☆171Updated last year
- A scalable & efficient active learning/data selection system for everyone.☆214Updated last year
- CLIR version of ColBERT☆70Updated last month
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆63Updated 9 months ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆112Updated 2 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.☆150Updated last year
- Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature sto…☆94Updated last year
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆139Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆161Updated 4 years ago
- Scalable PaLM implementation of PyTorch☆190Updated 2 years ago
- A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks☆372Updated 2 years ago
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆398Updated last year
- experiments with inference on llama☆104Updated last year
- Build Text Rerankers with Deep Language Models☆262Updated last year