criteo / autofaiss
Automatically create Faiss knn indices with the most optimal similarity search parameters.
☆845Updated 10 months ago
Alternatives and similar repositories for autofaiss:
Users that are interested in autofaiss are comparing it to the libraries listed below
- A library for building and serving multi-node distributed faiss indices.☆264Updated last year
- Some useful tips for faiss☆616Updated last year
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆860Updated last year
- DataComp: In search of the next generation of multimodal datasets☆692Updated last year
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,560Updated last year
- Blazing fast framework for fine-tuning similarity learning models☆658Updated 2 months ago
- Code repository for the paper - "Matryoshka Representation Learning"☆474Updated last year
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆382Updated last year
- SGPT: GPT Sentence Embeddings for Semantic Search☆864Updated last year
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,757Updated last month
- Unofficial faiss wheel builder☆313Updated last month
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆833Updated 11 months ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆571Updated last week
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,523Updated last month
- OpenAI CLIP text encoders for multiple languages!☆788Updated last year
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆531Updated last year
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,237Updated 2 years ago
- ⚡ A fast embedded library for approximate nearest neighbor search☆227Updated last year
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆1,784Updated this week
- Easily compute clip embeddings and build a clip retrieval system with them☆2,527Updated 11 months ago
- Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch☆640Updated 3 months ago
- Generative Representational Instruction Tuning☆613Updated 2 weeks ago
- RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-a…☆862Updated this week
- ⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍☆534Updated 9 months ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆721Updated last year
- maximal update parametrization (µP)☆1,486Updated 8 months ago
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆452Updated last year
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆317Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Updated last year
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆320Updated 3 months ago