ShadenSmith / splatt
The Surprisingly ParalleL spArse Tensor Toolkit.
☆69Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for splatt
- sparse matrix pre-processing library☆81Updated 6 months ago
- Parallel Tensor Infrastructure (ParTI!)☆28Updated 4 years ago
- The SparseX sparse kernel optimization library☆39Updated 5 years ago
- High-Performance Machine Learning Primitives☆10Updated 3 years ago
- Sparse matrix computation library for GPU☆54Updated 4 years ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- A package for constructing sparse tensors from CSV-like data sources.☆10Updated 6 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- bhSPARSE: A Sparse BLAS Library☆16Updated 9 years ago
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆95Updated 5 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆28Updated last year
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆45Updated 9 years ago
- ☆90Updated 7 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆69Updated last week
- CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)☆26Updated 9 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- A general purpose library for numerical calculations with higher order tensors, Tensor-Train Decompositions / Matrix Product States and o…☆20Updated 2 years ago
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆201Updated 3 months ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- This repository contains the cuStinger data structure used for dynamic graph representation.☆18Updated 5 years ago
- RSVDPACK: Implementations of fast algorithms for computing the low rank SVD, interpolative and CUR decompositions of a matrix, using ran…☆85Updated 2 years ago
- CUDA Tensor Transpose (cuTT) library☆50Updated 7 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 5 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Fast Fast Hadamard Transform☆77Updated 2 years ago
- Hornet data structure for sparse dynamic graphs and matrices☆80Updated 5 years ago
- Sketching-based Distributed Matrix Computations for Machine Learning☆98Updated 6 years ago
- Distributed NMF/NTF Library☆41Updated 3 weeks ago
- Matlab implementations of communication-avoiding Krylov subspace methods☆12Updated 3 years ago