fabiocannizzo / FastBinarySearch
Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers
☆113Updated last year
Related projects: ⓘ
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆53Updated 3 weeks ago
- LLM training in simple, raw C/CUDA☆79Updated 4 months ago
- Make triton easier☆39Updated 3 months ago
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆135Updated 3 years ago
- ☆61Updated 3 weeks ago
- Clover: Quantized 4-bit Linear Algebra Library☆110Updated 6 years ago
- benchmarking some transformer deployments☆26Updated last year
- If only std::set was a DBMS: collection of templated ACID in-memory exception-free thread-safe and concurrent containers in a header-only…☆32Updated last year
- Simple and fast low-bit matmul kernels in CUDA☆48Updated this week
- 🔶 Compressed bitvector/container supporting efficient random access and rank queries☆40Updated 2 weeks ago
- A tracing JIT compiler for PyTorch☆12Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 7 months ago
- A SIMD-based C++ library providing rank/select queries over mutable bitmaps.☆35Updated last year
- ☆22Updated last week
- Fast and memory-efficient exact attention☆20Updated 2 weeks ago
- cuVS - a library for vector search and clustering on the GPU☆170Updated this week
- ☆124Updated last week
- ☆65Updated last month
- Official code for "Binary embedding based retrieval at Tencent"☆42Updated 6 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆94Updated 2 weeks ago
- ☆55Updated 10 months ago
- Inference of Mamba models in pure C☆176Updated 6 months ago
- ☆12Updated 3 years ago
- asynchronous/distributed speculative evaluation for llama3☆36Updated last month
- A place to store reusable transformer components of my own creation or found on the interwebs☆43Updated 3 weeks ago
- Distributed preprocessing and data loading for language datasets☆40Updated 5 months ago
- ☆14Updated 4 months ago
- ☆50Updated this week
- ☆50Updated 3 months ago
- Efficient and effective query auto-completion in C++.☆51Updated 11 months ago