mperlet / matrix_multiplicationLinks
Parallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
☆56Updated 3 years ago
Alternatives and similar repositories for matrix_multiplication
Users that are interested in matrix_multiplication are comparing it to the libraries listed below
Sorting:
- Sparse Matrix-Vector Multiplication implementations in C☆22Updated 2 years ago
- matrix multiplication in CUDA☆123Updated last year
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Updated 7 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 9 years ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆60Updated 3 months ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 9 years ago
- MPI Tutorial Exercises☆46Updated 11 years ago
- A high performance implementation of kmeans algorithm with cuda☆18Updated 10 years ago
- IMPACT GPU Algorithms Teaching Labs☆57Updated 2 years ago
- Three Matrix-Multiplication-Algorithms: Generate Algorithm, Strassen Algorithm and Coppersmith-Winograd Algorithm☆30Updated 3 years ago
- The SparseX sparse kernel optimization library☆39Updated 6 years ago
- Fast matrix multiplication☆29Updated 3 years ago
- Introduction to CUDA programming☆122Updated 8 years ago
- SpMV using CUDA☆19Updated 7 years ago
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆34Updated 2 years ago
- Parallel clustering with OpenMP, MPI and CUDA☆41Updated 8 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆32Updated 4 years ago
- pyCUDA implementation of forward propagation for Convolutional Neural Networks☆18Updated 6 years ago
- fast Fourier transform on GPU in shared memory for AstroAccelerate project☆26Updated 4 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- ☆20Updated 9 years ago
- Learn OpenMP examples step by step☆95Updated 5 months ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 7 years ago
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆22Updated 5 years ago
- ☆67Updated 11 years ago
- Neural Network implementation in C++ running for MNIST database.☆55Updated 9 years ago
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆16Updated 5 years ago
- Massively Scalable Clustering☆23Updated 6 years ago
- CompPhys - a Computational Physics repository☆90Updated last year
- HPC Challenge Benchmark☆56Updated 2 years ago