FALCONN-LIB / FFHT
Fast Fast Hadamard Transform
☆79Updated 3 years ago
Alternatives and similar repositories for FFHT:
Users that are interested in FFHT are comparing it to the libraries listed below
- The Surprisingly ParalleL spArse Tensor Toolkit.☆71Updated 3 years ago
- fast kernel evaluation in high dimensions via hashing☆23Updated 4 years ago
- FRP: Fast Random Projections☆43Updated 4 years ago
- sparse matrix pre-processing library☆81Updated 11 months ago
- Distributed NMF/NTF Library☆45Updated 4 months ago
- Parallel Tensor Infrastructure (ParTI!)☆28Updated 4 years ago
- RSVDPACK: Implementations of fast algorithms for computing the low rank SVD, interpolative and CUR decompositions of a matrix, using ran…☆90Updated 2 years ago
- Parameterless and Universal FInding of Nearest Neighbors☆59Updated last month
- Implementation of fast exact k-means algorithms☆45Updated 5 years ago
- Sparse Matrix-Matrix Multiplication Benchmark on Intel Xeon and Xeon Phi (KNC, KNL) from blog post:☆12Updated 8 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆51Updated 7 years ago
- High-Performance Tensor Transpose library☆194Updated last year
- Fork of magma to include more BLAS☆28Updated 8 years ago
- ulmBLAS☆106Updated 3 years ago
- experimental python CFFI interface to NVIDIA's cuSOLVER and cuSPARSE libraries.☆13Updated 4 years ago
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆34Updated 2 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆72Updated last month
- Header-only version of RedSVD☆58Updated 10 years ago
- fast Fourier transform on GPU in shared memory for AstroAccelerate project☆26Updated 4 years ago
- A GPU algorithm for sparse matrix-matrix multiplication☆70Updated 4 years ago
- FFLAS-FFPACK - Finite Field Linear Algebra Subroutines / Package☆60Updated 2 months ago
- ArrayFire's Machine Learning Library.☆104Updated 6 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Implementations of several fast approximate algorithms for geometric optimal transport (OT)☆118Updated 5 years ago
- Neural LSH [ICLR 2020] - Using supervised learning to produce better space partitions for fast nearest neighbor search.☆73Updated 4 years ago
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆203Updated 8 months ago
- Recursive LAPACK Collection☆42Updated 3 years ago
- Sketching-based Distributed Matrix Computations for Machine Learning☆100Updated 7 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- SONG: Approximate Nearest Neighbor Search on GPU. SONG is a graph-based approximate nearest neighbor search toolbox.☆67Updated this week