VictorEijkhout / TheArtofHPC_pdfs
All pdfs of Victor Eijkhout's Art of HPC books and courses
☆514Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for TheArtofHPC_pdfs
- Public repository for vol 2 of The Art of HPC: parallel programming☆66Updated 7 months ago
- A curated list of awesome high performance computing resources☆663Updated this week
- Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks☆67Updated this week
- Public repository for The Art of HPC volume 1: Scientific Computing☆44Updated 7 months ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆176Updated 2 years ago
- Important concepts in numerical linear algebra and related areas☆728Updated 10 months ago
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆296Updated 2 months ago
- Examples from Programming in Parallel with CUDA☆108Updated last year
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆139Updated this week
- Run a parallel command inside a split tmux window☆136Updated 2 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆188Updated this week
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆277Updated this week
- Numerical linear algebra software package☆410Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,220Updated this week
- Kernel Tuner☆287Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆311Updated this week
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆123Updated this week
- ☆132Updated last year
- Awesome resources for GPUs☆495Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆196Updated 2 weeks ago
- Little OpenMP Library☆157Updated 2 years ago
- High-Performance FP32 Matrix Multiplication on CPU☆301Updated this week
- CSC Summer School in High-Performance Computing☆93Updated 4 months ago
- MLIR For Beginners tutorial☆824Updated last month
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆415Updated this week
- Deep learning accelerator architectures requiring half the multipliers☆263Updated 7 months ago
- CUDA Core Compute Libraries☆1,286Updated this week
- ☆234Updated 8 months ago
- A collaborative effort to consolidate expert knowledge on code guidelines for the correctness, modernization, and optimization of code wr…☆83Updated last week
- advanced compilers☆755Updated 2 months ago