ksopyla / CudaDotProdLinks
Different implementation of sparse matrix multiplication. All matrices are in CSR format. The code contains different CUDA kernels for multiply sparse matrix vs dense vector and sparse matrix vs another sparse matrix. It contains several cuda kernel for sparse matrix dense vector product and sparse matrix sparse matrix product.
☆16Updated 14 years ago
Alternatives and similar repositories for CudaDotProd
Users that are interested in CudaDotProd are comparing it to the libraries listed below
Sorting:
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Updated 7 years ago
- sparse matrix pre-processing library☆83Updated last year
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago
- Benchmarking matrix multiplication implementations☆100Updated 8 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆110Updated 6 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆297Updated 6 years ago
- Efficient LDA solution on GPUs.☆24Updated 6 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 8 years ago
- GPU-specialized parameter server for GPU machine learning.☆101Updated 7 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 8 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 8 years ago
- LSH-GPU ANN package☆94Updated 6 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆179Updated 6 years ago
- High-Performance Tensor Transpose library☆200Updated 2 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆169Updated 7 years ago
- ☆93Updated 8 years ago
- Sparse matrix computation library for GPU☆56Updated 5 years ago
- The Surprisingly ParalleL spArse Tensor Toolkit.