facebookresearch / distributed-faissLinks
A library for building and serving multi-node distributed faiss indices.
☆276Updated 2 years ago
Alternatives and similar repositories for distributed-faiss
Users that are interested in distributed-faiss are comparing it to the libraries listed below
Sorting:
- Some useful tips for faiss☆628Updated 4 months ago
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆892Updated 2 months ago
- ⚡ A fast embedded library for approximate nearest neighbor search☆235Updated 2 years ago
- hnsw implemented by python☆72Updated 6 years ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆345Updated last year
- A scalable & efficient active learning/data selection system for everyone.☆217Updated last year
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆318Updated 2 years ago
- Scalable training for dense retrieval models.☆298Updated 7 months ago
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆172Updated 4 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆179Updated 2 years ago
- ☆413Updated 2 years ago
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆421Updated last year
- Inquisitive Parrots for Search☆199Updated 7 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Updated 2 years ago
- experiments with inference on llama☆103Updated last year
- ☆122Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Updated 3 years ago
- Search Engines with Autoregressive Language models☆295Updated 2 years ago
- The pipeline for the OSCAR corpus☆175Updated 2 months ago
- Scalable PaLM implementation of PyTorch☆189Updated 3 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆593Updated last month
- ☆252Updated last year
- Code repository for the paper - "Matryoshka Representation Learning"☆588Updated last year
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆113Updated 2 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆155Updated last year
- DSIR large-scale data selection framework for language model training☆268Updated last year
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Updated 2 years ago
- ☆87Updated 3 years ago
- Visualize hnsw, faiss and other anns index☆468Updated 2 years ago