fabiocannizzo / FastBinarySearch
Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers
☆136Updated 4 months ago
Alternatives and similar repositories for FastBinarySearch:
Users that are interested in FastBinarySearch are comparing it to the libraries listed below
- High-Performance SGEMM on CUDA devices☆90Updated 3 months ago
- LLM training in simple, raw C/CUDA☆92Updated 11 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆77Updated 3 months ago
- Make triton easier☆47Updated 10 months ago
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆61Updated 6 months ago
- extensible collectives library in triton☆85Updated 3 weeks ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- Clover: Quantized 4-bit Linear Algebra Library☆112Updated 6 years ago
- ☆27Updated 3 months ago
- FlexAttention w/ FlashAttention3 Support☆26Updated 6 months ago
- Inference of Mamba models in pure C☆187Updated last year
- A C++ library providing fast language model queries in compressed space.☆129Updated 2 years ago
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆57Updated last week
- ☆101Updated 10 months ago
- ☆21Updated last month
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 7 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆257Updated 3 weeks ago
- ☆13Updated 3 years ago
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆156Updated 4 years ago
- benchmarking some transformer deployments☆26Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- RWKV-7: Surpassing GPT☆83Updated 5 months ago
- ☆78Updated 5 months ago
- Fast low-bit matmul kernels in Triton☆291Updated this week
- Experiment of using Tangent to autodiff triton☆78Updated last year
- Gpu benchmark☆59Updated 2 months ago
- Explore training for quantized models☆17Updated 3 months ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 3 months ago
- Experiments with BitNet inference on CPU☆53Updated last year
- TORCH_LOGS parser for PT2☆37Updated 2 weeks ago