mperlet / matrix_multiplication
Parallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
☆53Updated 2 years ago
Related projects: ⓘ
- MPI Tutorial Exercises☆43Updated 10 years ago
- ☆56Updated this week
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆20Updated 6 years ago
- OpenMP tutorial☆36Updated 7 years ago
- SpMV using CUDA☆16Updated 6 years ago
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆15Updated 4 years ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆55Updated 6 months ago
- The SparseX sparse kernel optimization library☆39Updated 5 years ago
- Introduction to CUDA programming☆111Updated 7 years ago
- ☆22Updated 4 years ago
- matrix multiplication in CUDA☆114Updated last year
- Learn OpenMP examples step by step☆81Updated 3 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 8 years ago
- ☆34Updated this week
- Sparse Matrix-Vector Multiplication implementations in C☆22Updated last year
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆22Updated 4 years ago
- Parallel clustering with OpenMP, MPI and CUDA☆40Updated 7 years ago
- Chai☆41Updated 9 months ago
- ☆63Updated 10 years ago
- Problem: LU Factorization using OpenMP and MPI: study of scalability.☆15Updated 10 years ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆20Updated 10 months ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 6 years ago
- A high performance implementation of kmeans algorithm with cuda☆18Updated 10 years ago
- Code repo for lotsofcores.com book 1, here since dropbox doesn't work for everyone☆26Updated 8 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆46Updated 3 years ago
- CompPhys - a Computational Physics repository☆82Updated 10 months ago
- IMPACT GPU Algorithms Teaching Labs☆55Updated last year
- ☆28Updated 4 years ago
- Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts☆24Updated 2 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆53Updated 2 years ago