ashvardanian / SimSIMDLinks
Up to 200x Faster Dot Products & Similarity Metrics β for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 π
β1,525Updated this week
Alternatives and similar repositories for SimSIMD
Users that are interested in SimSIMD are comparing it to the libraries listed below
Sorting:
- Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to acceleraβ¦β2,894Updated this week
- Fast Open-Source Search & Clustering engine Γ for Vectors & Arbitrary Objects Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Sβ¦β3,176Updated last week
- Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uβ¦β1,269Updated last month
- C++ template library for high performance SIMD based sorting algorithmsβ980Updated last month
- Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindingsβ¦β605Updated 2 years ago
- cuVS - a library for vector search and clustering on the GPUβ546Updated this week
- β1,057Updated 5 months ago
- Next-Gen Big Data File Formatβ506Updated last week
- RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-aβ¦β944Updated this week
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception haβ¦β1,864Updated last month
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Liβ¦β1,811Updated this week
- Very fast, high quality, platform-independent hashing algorithm.β657Updated 2 weeks ago
- An efficient C++17 GPU numerical computing library with Python-like syntaxβ1,356Updated this week
- nsync is a C library that exports various synchronization primitives, such as mutexesβ1,218Updated last month
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Searchβ1,509Updated 2 weeks ago
- HNSW tutorialβ152Updated last year
- Complete implementations from "Algorithms for Modern Hardware"β776Updated 2 years ago
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extensβ¦β1,527Updated last week
- nanobind: tiny and efficient C++/Python bindingsβ3,079Updated this week
- Lightning fast data version control system for structured and unstructured machine learning datasets. We aim to make versioning datasets β¦β1,031Updated last week
- Performance-portable, length-agnostic SIMD with runtime dispatchβ5,057Updated last week
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUsβ364Updated 5 months ago
- A novel data compression frameworkβ2,199Updated last week
- A curated list of awesome SIMD frameworks, libraries and softwareβ210Updated last year
- NVIDIA Math Libraries for the Python Ecosystemβ516Updated last month
- Lower-latency OpenMP-style minimalistic scoped thread-pool designed for 'Fork-Join' parallelism in Rust and C++, avoiding memory allocatiβ¦β260Updated last week
- Multi-platform high-performance compute language extension for Rust.β1,732Updated last week
- New file format for storage of large columnar datasets.β624Updated this week
- Minimal LLM inference in Rustβ1,013Updated 11 months ago
- Fast Static Symbol Table (FSST): efficient random-access string compressionβ466Updated last week