ULAFF / LAFF-On-PfHP
Repository for "LAFF-On Programming for High Performance"
☆40Updated 7 months ago
Alternatives and similar repositories for LAFF-On-PfHP:
Users that are interested in LAFF-On-PfHP are comparing it to the libraries listed below
- Slides/notes and Jupyter notebook demos for an introductory course of numerical analysis/scientific computing☆50Updated 3 weeks ago
- A C++ library for computing large scale tensor contractions.☆36Updated 6 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆106Updated last month
- Intermediate MPI lesson☆26Updated last year
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆209Updated 2 months ago
- Training materials provided by OpenACC.org.☆87Updated 6 months ago
- A variety of programming models relevant to scientists explained, with an emphasis on how programming constructs map to parts of the com…☆60Updated 6 years ago
- Custom-Precision Floating-point numbers.☆30Updated last month
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆198Updated 2 months ago
- C++ Template Linear Algebra PACKage☆43Updated this week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆30Updated 2 months ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated 2 weeks ago
- Public repository for vol 2 of The Art of HPC: parallel programming☆75Updated this week
- The sources for the OpenACC Programming and Best Practices Guide.☆36Updated last month
- Linnea is an experimental tool for the automatic generation of optimized code for linear algebra problems.☆68Updated 3 years ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆65Updated 4 months ago
- Round matrix elements to lower precision in MATLAB☆36Updated 2 years ago
- A Task-based Library for Solving Dense Nonsymmetric Eigenvalue Problems☆23Updated 2 years ago
- ulmBLAS☆104Updated 2 years ago
- ☆20Updated 2 months ago
- Fast gradient evaluation in C++ based on Expression Templates.☆94Updated last month
- ETH course - Solving PDEs in parallel on GPUs☆124Updated 2 months ago
- Public repository for The Art of HPC volume 1: Scientific Computing☆49Updated 10 months ago
- Lesson material for OpenMP-GPU workshop☆11Updated last year
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated this week
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆72Updated last month
- H2Lib public repository☆53Updated 2 years ago
- ☆38Updated this week