ZhengzhongSun / Matrix-Inversion-with-CUDA
I implemented a parallel algorithm for matrix inversion based on Gauss-Jordan elimination.
☆45Updated 8 years ago
Related projects: ⓘ
- ☆42Updated 6 years ago
- ☆20Updated 5 years ago
- A shallow fork of SuiteSparse adding build files for Visual Studio and support for ACML☆100Updated 8 years ago
- C++ implementation of sparse matrix using CRS (Compressed Row Storage) format☆109Updated 4 years ago
- Conjugate Gradient for Least Squares in CUDA☆51Updated 9 years ago
- Implementation of ConjugateGradients method using C and Nvidia CUDA☆47Updated 2 years ago
- ☆88Updated 7 years ago
- Utilities for CUDA programming☆39Updated 5 years ago
- Source code examples from the Parallel Forall Blog☆94Updated 5 years ago
- A new QR decomposition algorithm implemented in CUDA☆15Updated 2 months ago
- Some CUDA design patterns and a bit of template magic for CUDA☆144Updated last year
- OpenMP tutorial☆36Updated 7 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆20Updated 6 years ago
- An implementation of parallel exclusive scan in CUDA☆57Updated 6 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆25Updated 7 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- MWE for using the Eigen library in CUDA kernels☆116Updated last year
- A few cuda examples built with cmake☆23Updated 5 years ago
- sparse matrix pre-processing library☆81Updated 4 months ago
- Implementation of the maximum network flow problem in CUDA.☆26Updated 3 years ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆40Updated 7 years ago
- PLEASE SEE THE OFFICIAL REPOSITORY. THIS IS NOT MAINTAINED ANYMORE.☆93Updated 4 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 8 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆50Updated last year
- CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable vers…☆26Updated 5 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 5 years ago
- parallel algorithm based on cuda☆62Updated 6 years ago
- Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts☆24Updated 2 years ago
- Template for GPU accelerated python libraries☆44Updated last year
- CUDA implementation of the Jacobi method☆24Updated 9 years ago