facebookresearch / distributed-faiss
A library for building and serving multi-node distributed faiss indices.
☆252Updated last year
Related projects ⓘ
Alternatives and complementary repositories for distributed-faiss
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆812Updated 5 months ago
- Some useful tips for faiss☆592Updated last year
- ⚡ A fast embedded library for approximate nearest neighbor search☆217Updated last year
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆167Updated 3 months ago
- Framework for evaluating ANNS algorithms on billion scale datasets.☆352Updated last week
- ☆411Updated 11 months ago
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆305Updated last year
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆359Updated 7 months ago
- Scalable training for dense retrieval models.☆270Updated last year
- Build Text Rerankers with Deep Language Models☆251Updated 8 months ago
- Inquisitive Parrots for Search☆177Updated 8 months ago
- Efficient, check-pointed data loading for deep learning with massive data sets.☆205Updated last year
- hnsw implemented by python☆62Updated 5 years ago
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆514Updated 11 months ago
- Search Engines with Autoregressive Language models☆277Updated last year
- Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature sto…☆90Updated 4 months ago
- DSIR large-scale data selection framework for language model training☆227Updated 7 months ago
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆140Updated 3 years ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆308Updated 5 months ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 9 months ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆111Updated last year
- ☆58Updated last year
- cuVS - a library for vector search and clustering on the GPU☆210Updated this week
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆168Updated 3 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆558Updated this week
- Tevatron - A flexible toolkit for neural retrieval research and development.☆517Updated 2 weeks ago
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆112Updated last year
- Fast Inference Solutions for BLOOM☆560Updated last month
- Codebase for RetroMAE and beyond.☆237Updated 5 months ago
- Running BERT without Padding☆460Updated 2 years ago