facebookresearch / distributed-faissLinks
A library for building and serving multi-node distributed faiss indices.
☆271Updated 2 years ago
Alternatives and similar repositories for distributed-faiss
Users that are interested in distributed-faiss are comparing it to the libraries listed below
Sorting:
- Some useful tips for faiss☆624Updated 2 months ago
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆875Updated this week
- ⚡ A fast embedded library for approximate nearest neighbor search☆234Updated 2 years ago
- hnsw implemented by python☆71Updated 6 years ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆345Updated 10 months ago
- Inquisitive Parrots for Search☆198Updated 5 months ago
- Scalable training for dense retrieval models.☆297Updated 4 months ago
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆168Updated 4 years ago
- Framework for evaluating ANNS algorithms on billion scale datasets.☆408Updated last week
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆316Updated 2 years ago
- Code repository for the paper - "Matryoshka Representation Learning"☆574Updated last year
- The pipeline for the OSCAR corpus☆173Updated last year
- A scalable & efficient active learning/data selection system for everyone.☆217Updated last year
- ☆413Updated last year
- experiments with inference on llama☆103Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated 2 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆432Updated 3 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated 2 years ago
- Search Engines with Autoregressive Language models☆293Updated 2 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆587Updated last week
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆113Updated 2 years ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆197Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.☆154Updated last year
- Build Text Rerankers with Deep Language Models☆263Updated last year
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆145Updated 2 years ago
- Pure python implementation of product quantization for nearest neighbor search☆352Updated 5 months ago
- Scalable PaLM implementation of PyTorch☆188Updated 2 years ago
- ☆101Updated 2 years ago
- Implementation of a Transformer, but completely in Triton☆276Updated 3 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆61Updated 2 years ago