fabiocannizzo / FastBinarySearchLinks
Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers
☆140Updated 5 months ago
Alternatives and similar repositories for FastBinarySearch
Users that are interested in FastBinarySearch are comparing it to the libraries listed below
Sorting:
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆61Updated 8 months ago
- High-Performance SGEMM on CUDA devices☆94Updated 4 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆18Updated 8 months ago
- LLM training in simple, raw C/CUDA☆99Updated last year
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆80Updated 4 months ago
- A SIMD-based C++ library providing rank/select queries over mutable bitmaps.☆35Updated 2 years ago
- Clover: Quantized 4-bit Linear Algebra Library☆112Updated 7 years ago
- Simple high-throughput inference library☆115Updated 3 weeks ago
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- Make triton easier☆47Updated 11 months ago
- Gpu benchmark☆63Updated 4 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆153Updated 3 weeks ago
- Explore training for quantized models☆18Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆268Updated last week
- A place to store reusable transformer components of my own creation or found on the interwebs☆56Updated 3 weeks ago
- Inference of Mamba models in pure C☆186Updated last year
- 🔶 Compressed bitvector/container supporting efficient random access and rank queries☆43Updated 9 months ago
- A C++ library providing fast language model queries in compressed space.☆129Updated 2 years ago
- ☆21Updated 3 months ago
- ☆108Updated last year
- extensible collectives library in triton☆87Updated 2 months ago
- Experiment of using Tangent to autodiff triton☆79Updated last year
- ☆13Updated 4 years ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆76Updated 9 months ago
- Samples of good AI generated CUDA kernels☆65Updated last week
- ☆28Updated 4 months ago
- Official software repository of L. Delfino, D. Erriquez, S. Martinico, F. M. Nardini, C. Rulli, and R. Venturini. "kANNolo: Sweet and Smo…☆31Updated this week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆60Updated last month
- Python bindings for the fast integer compression library FastPFor.☆58Updated last year
- FlexAttention w/ FlashAttention3 Support☆26Updated 8 months ago