criteo / autofaiss
Automatically create Faiss knn indices with the most optimal similarity search parameters.
☆850Updated 11 months ago
Alternatives and similar repositories for autofaiss:
Users that are interested in autofaiss are comparing it to the libraries listed below
- A library for building and serving multi-node distributed faiss indices.☆264Updated last year
- Some useful tips for faiss☆619Updated last year
- OpenAI CLIP text encoders for multiple languages!☆795Updated last year
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆863Updated last year
- Code repository for the paper - "Matryoshka Representation Learning"☆487Updated last year
- DataComp: In search of the next generation of multimodal datasets☆700Updated last year
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,684Updated 6 months ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,564Updated last year
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆837Updated 11 months ago
- Blazing fast framework for fine-tuning similarity learning models☆657Updated 2 weeks ago
- Community-maintained faiss wheel builder☆320Updated 2 months ago
- SGPT: GPT Sentence Embeddings for Semantic Search☆865Updated last year
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆317Updated last year
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆387Updated last year
- ⚡ A fast embedded library for approximate nearest neighbor search☆229Updated last year
- minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.☆458Updated last year
- CLIP-like model evaluation☆696Updated 3 weeks ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,544Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆573Updated 2 weeks ago
- WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…☆1,049Updated 6 months ago
- Library for 8-bit optimizers and quantization routines.☆716Updated 2 years ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆522Updated last year
- The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX☆418Updated last year
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,780Updated 2 months ago
- Collections of vector search related libraries, service and research papers☆1,484Updated 8 months ago
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,422Updated last month
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,238Updated 2 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆884Updated last year
- Creative interactive views of any dataset.☆837Updated 4 months ago
- Robust fine-tuning of zero-shot models☆696Updated 2 years ago