ashvardanian / SimSIMDLinks
Up to 200x Faster Dot Products & Similarity Metrics β for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 π
β1,510Updated 3 weeks ago
Alternatives and similar repositories for SimSIMD
Users that are interested in SimSIMD are comparing it to the libraries listed below
Sorting:
- Fast Open-Source Search & Clustering engine Γ for Vectors & Arbitrary Objects Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Sβ¦β3,140Updated this week
- Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to acceleraβ¦β2,796Updated this week
- Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindingsβ¦β607Updated 2 years ago
- cuVS - a library for vector search and clustering on the GPUβ525Updated this week
- RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-aβ¦β933Updated this week
- C++ template library for high performance SIMD based sorting algorithmsβ977Updated last week
- An efficient C++17 GPU numerical computing library with Python-like syntaxβ1,356Updated this week
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Liβ¦β1,756Updated this week
- β1,055Updated 4 months ago
- Next-Gen Big Data File Formatβ482Updated 2 weeks ago
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception haβ¦β1,856Updated 2 weeks ago
- Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Searchβ1,486Updated this week
- Performance-portable, length-agnostic SIMD with runtime dispatchβ5,036Updated this week
- Complete implementations from "Algorithms for Modern Hardware"β774Updated 2 years ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUsβ356Updated 5 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). β¦β2,341Updated 2 months ago
- Deep learning at the speed of light.β2,527Updated this week
- nsync is a C library that exports various synchronization primitives, such as mutexesβ1,215Updated 2 weeks ago
- Lightning fast data version control system for structured and unstructured machine learning datasets. We aim to make versioning datasets β¦β1,027Updated last week
- HNSW tutorialβ150Updated last year
- CUDA Core Compute Librariesβ1,918Updated last week
- Library for reading and writing large multi-dimensional arrays.β1,442Updated this week
- RAPIDS Memory Managerβ619Updated this week
- β585Updated last week
- A collection of lock-free data structures written in standard C++11β927Updated 8 months ago
- A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the webβ1,737Updated last year
- A curated list of awesome SIMD frameworks, libraries and softwareβ204Updated last year
- Minimal LLM inference in Rustβ1,013Updated 11 months ago
- Super-fast Structured Outputsβ484Updated last month
- Tile primitives for speedy kernelsβ2,734Updated last week