ashvardanian / SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics β for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 π
β1,304Updated this week
Alternatives and similar repositories for SimSIMD:
Users that are interested in SimSIMD are comparing it to the libraries listed below
- Fast Open-Source Search & Clustering engine Γ for Vectors & π Strings Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, Cβ¦β2,618Updated last month
- Up to 10x faster strings for C, C++, Python, Rust, Swift & Go, leveraging NEON, AVX2, AVX-512, SVE, & SWAR to accelerate search, hashing,β¦β2,476Updated this week
- Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uβ¦β1,200Updated 2 months ago
- Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindingsβ¦β585Updated last year
- An extensible, state of the art columnar file formatβ1,152Updated this week
- RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-aβ¦β862Updated this week
- C++ template library for high performance SIMD based sorting algorithmsβ923Updated this week
- Performance-portable, length-agnostic SIMD with runtime dispatchβ4,493Updated last week
- Learning how to write "Less Slow" code in C++ 20, C 99, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handβ¦β495Updated last month
- β1,030Updated 4 months ago
- cuVS - a library for vector search and clustering on the GPUβ356Updated this week
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Searchβ1,288Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntaxβ1,306Updated this week
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.β385Updated last month
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extensβ¦β1,315Updated last week
- Complete implementations from "Algorithms for Modern Hardware"β739Updated 2 years ago
- Tile primitives for speedy kernelsβ2,208Updated this week
- A collection of lock-free data structures written in standard C++11β863Updated 2 months ago
- A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the webβ1,716Updated 8 months ago
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vβ¦β4,332Updated this week
- Making data lake work for time seriesβ1,160Updated 7 months ago
- Deep learning at the speed of light.β1,528Updated 3 weeks ago
- nsync is a C library that exports various synchronization primitives, such as mutexesβ1,147Updated 8 months ago
- Time series analysis for Rust, with bindings to Python and Javascriptβ480Updated this week
- Fine-grained parallelism with sub-nanosecond overhead in Zigβ817Updated 3 weeks ago
- Framework for evaluating ANNS algorithms on billion scale datasets.β366Updated last month
- β576Updated 2 weeks ago
- CLIP inference in plain C/C++ with no extra dependenciesβ487Updated 7 months ago
- Implementations of SIMD instruction sets for systems which don't natively support them.β2,610Updated 3 weeks ago
- Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuildβ2,173Updated this week