TusKANNy / kannolo
Official repository of kANNolo.
☆26Updated 4 months ago
Alternatives and similar repositories for kannolo:
Users that are interested in kannolo are comparing it to the libraries listed below
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆75Updated 2 months ago
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆60Updated 6 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆63Updated last year
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆15Updated 3 weeks ago
- Tree-based indexes for neural-search☆29Updated last year
- NLP with Rust for Python 🦀🐍☆61Updated 9 months ago
- A text embedding extension for the Polars Dataframe library.☆24Updated 4 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆19Updated last week
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini, "Efficient Inverted Indexes for Approximate Retrieva…☆56Updated 2 weeks ago
- Inference engine for GLiNER models, in Rust☆44Updated this week
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆49Updated 9 months ago
- History of Open-Source IR Systems☆11Updated 2 months ago
- Official code for "Binary embedding based retrieval at Tencent"☆42Updated last year
- Because it's there.☆16Updated 6 months ago
- ☆28Updated 4 months ago
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆23Updated last month
- Training code for Sparse Autoencoders on Embedding models☆38Updated last month
- implement llava using candle☆14Updated 9 months ago
- A small rust-based data loader☆24Updated 3 months ago
- Make triton easier☆47Updated 9 months ago
- ☆54Updated 7 months ago
- Code for "Training-free Graph Neural Networks and the Power of Labels as Features" (TMLR 2024)☆55Updated 7 months ago
- Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation☆55Updated this week
- ☆11Updated 2 months ago
- Pre-train Static Word Embeddings☆51Updated 3 weeks ago
- ☆12Updated last year
- Modular Rust transformer/LLM library using Candle☆36Updated 10 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated last year
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers☆133Updated 3 months ago
- Read and write tensorboard data using Rust☆20Updated last year