ZhengzhongSun / Matrix-Inversion-with-CUDA
I implemented a parallel algorithm for matrix inversion based on Gauss-Jordan elimination.
☆45Updated 9 years ago
Alternatives and similar repositories for Matrix-Inversion-with-CUDA:
Users that are interested in Matrix-Inversion-with-CUDA are comparing it to the libraries listed below
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆21Updated 6 years ago
- Utilities for CUDA programming☆40Updated 5 years ago
- C++ implementation of sparse matrix using CRS (Compressed Row Storage) format☆112Updated 4 years ago
- ☆42Updated 7 years ago
- A shallow fork of SuiteSparse adding build files for Visual Studio and support for ACML☆100Updated 9 years ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 2 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆150Updated last year
- ☆20Updated 6 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 6 years ago
- An implementation of parallel exclusive scan in CUDA☆62Updated 7 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆52Updated last year
- A few cuda examples built with cmake☆23Updated 5 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 9 years ago
- MWE for using the Eigen library in CUDA kernels☆118Updated 2 years ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Updated 3 years ago
- ☆91Updated 8 years ago
- CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.☆36Updated 7 years ago
- Conjugate Gradient for Least Squares in CUDA☆52Updated 9 years ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Updated 3 months ago
- fast Fourier transform on GPU in shared memory for AstroAccelerate project☆26Updated 4 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆27Updated 7 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆55Updated 3 years ago
- Sparse Matrix-Vector Multiplication implementations in C☆22Updated 2 years ago
- Conjugate Gradient solver written in CUDA☆29Updated 5 years ago
- Corrected source for the OpenCL in Action book (work in progress)☆64Updated 11 years ago
- A C++ header-only for data transfer between linear algebra libraries (Eigen, Armadillo, OpenCV, ArrayFire, LibTorch).☆81Updated 10 months ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated 2 years ago
- Efficient CUDA Stream Compaction Library☆33Updated last year
- Introduction to CUDA programming☆115Updated 7 years ago