facebookresearch / distributed-faiss
A library for building and serving multi-node distributed faiss indices.
☆253Updated last year
Related projects ⓘ
Alternatives and complementary repositories for distributed-faiss
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆817Updated 6 months ago
- Some useful tips for faiss☆594Updated last year
- ⚡ A fast embedded library for approximate nearest neighbor search☆217Updated last year
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆142Updated 3 years ago
- Build Text Rerankers with Deep Language Models☆251Updated 9 months ago
- Framework for evaluating ANNS algorithms on billion scale datasets.☆356Updated this week
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆306Updated last year
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆168Updated 3 months ago
- Scalable training for dense retrieval models.☆271Updated last year
- Tevatron - A flexible toolkit for neural retrieval research and development.☆524Updated last month
- Inquisitive Parrots for Search☆178Updated 8 months ago
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆361Updated 7 months ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆308Updated 5 months ago
- Search Engines with Autoregressive Language models☆277Updated last year
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆685Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 10 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆141Updated 6 months ago
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆112Updated last year
- ☆412Updated last year
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆168Updated 3 years ago
- The pipeline for the OSCAR corpus☆162Updated 11 months ago
- DSIR large-scale data selection framework for language model training☆230Updated 7 months ago
- Fast Inference Solutions for BLOOM☆560Updated last month
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆780Updated 6 months ago
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆318Updated last year
- Flexible classic and NeurAl Retrieval Toolkit☆214Updated 4 months ago
- DataComp: In search of the next generation of multimodal datasets☆657Updated 10 months ago
- Scaling Data-Constrained Language Models☆321Updated last month
- Code repository for the paper - "Matryoshka Representation Learning"☆428Updated 9 months ago
- ☆59Updated last year