FALCONN-LIB / FFHT
Fast Fast Hadamard Transform
☆78Updated 3 years ago
Alternatives and similar repositories for FFHT:
Users that are interested in FFHT are comparing it to the libraries listed below
- The Surprisingly ParalleL spArse Tensor Toolkit.☆70Updated 3 years ago
- sparse matrix pre-processing library☆82Updated 10 months ago
- RSVDPACK: Implementations of fast algorithms for computing the low rank SVD, interpolative and CUR decompositions of a matrix, using ran…☆89Updated 2 years ago
- GraphBLAS Template Library (GBTL): C++ graph algorithms and primitives using semiring algebra as defined at graphblas.org☆133Updated last year
- Distributed NMF/NTF Library☆44Updated 3 months ago
- Parallel Tensor Infrastructure (ParTI!)☆28Updated 4 years ago
- High-Performance Linear Algebra-based Graph Primitives on GPUs☆222Updated 3 years ago
- CUDA Tensor Transpose (cuTT) library☆51Updated 7 years ago
- Sparse Matrix-Matrix Multiplication Benchmark on Intel Xeon and Xeon Phi (KNC, KNL) from blog post:☆12Updated 8 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆50Updated 7 years ago
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆203Updated 7 months ago
- FRP: Fast Random Projections☆43Updated 4 years ago
- fast kernel evaluation in high dimensions via hashing☆23Updated 4 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated 2 years ago
- GGNN: State of the Art Graph-based GPU Nearest Neighbor Search☆154Updated last month
- PQ Fast Scan☆60Updated 5 years ago
- LSH-GPU ANN package☆93Updated 5 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆71Updated this week
- Fork of magma to include more BLAS☆28Updated 8 years ago
- ☆91Updated 8 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Implementation of fast exact k-means algorithms☆46Updated 5 years ago
- A library of GPU kernels for sparse matrix operations.☆260Updated 4 years ago
- Parameterless and Universal FInding of Nearest Neighbors☆59Updated 2 weeks ago
- GPU Accelerated Subsampled Newton Method for Convex Optimization☆8Updated 7 years ago
- The SparseX sparse kernel optimization library☆40Updated 6 years ago
- complex tensor plugin for pytorch (deprecated)☆47Updated 6 years ago
- TMAC: A Toolbox of Modern Async-Parallel, Coordinate, Splitting, and Stochastic Methods☆46Updated 7 years ago
- ☆21Updated 4 years ago
- fast Fourier transform on GPU in shared memory for AstroAccelerate project☆26Updated 4 years ago