ULAFF / LAFF-On-PfHP
Repository for "LAFF-On Programming for High Performance"
☆37Updated 2 months ago
Related projects: ⓘ
- Experimental Linear Algebra Performance Studies☆12Updated 7 years ago
- Slides/notes and Jupyter notebook demos for an introductory course of numerical analysis/scientific computing☆47Updated this week
- Intermediate MPI lesson☆25Updated last year
- A variety of programming models relevant to scientists explained, with an emphasis on how programming constructs map to parts of the com…☆57Updated 5 years ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated 8 months ago
- MagmaDNN: a simple deep learning framework in c++☆45Updated 4 years ago
- Round matrix elements to lower precision in MATLAB☆35Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆21Updated last week
- Code repo for lotsofcores.com book 1, here since dropbox doesn't work for everyone☆26Updated 8 years ago
- Software to support people learning OpenMP with our book ... The OpenMP Common Core: Making OpenMP Simple Again☆70Updated 10 months ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆73Updated 3 months ago
- ulmBLAS☆102Updated 2 years ago
- Linnea is an experimental tool for the automatic generation of optimized code for linear algebra problems.☆65Updated 2 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆47Updated last year
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆84Updated 2 months ago
- The sources for the OpenACC Programming and Best Practices Guide.☆33Updated last month
- A Task-based Library for Solving Dense Nonsymmetric Eigenvalue Problems☆21Updated last year
- Examples from Programming in Parallel with CUDA☆101Updated last year
- Recursive LAPACK Collection☆42Updated 2 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆62Updated 2 months ago
- C++ HPC Tutorial materials☆46Updated 2 months ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆65Updated 2 weeks ago
- The ultimate memory bandwidth benchmark☆46Updated last year
- The SCMC and PSCMC programming language☆17Updated last year
- Public repository for vol 2 of The Art of HPC: parallel programming☆65Updated 5 months ago
- CSC Summer School in High-Performance Computing☆91Updated 2 months ago
- ☆40Updated this week
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆119Updated 10 months ago
- Custom-Precision Floating-point numbers.☆28Updated 3 months ago