ENCCS / gpu-programming
Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks
☆69Updated 3 months ago
Alternatives and similar repositories for gpu-programming:
Users that are interested in gpu-programming are comparing it to the libraries listed below
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆30Updated 5 months ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆36Updated this week
- Visualization of cache-optimized matrix multiplication☆104Updated 5 years ago
- Examples from Programming in Parallel with CUDA☆122Updated last year
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆49Updated 4 months ago
- Public repository for vol 2 of The Art of HPC: parallel programming☆75Updated last week
- Public repository for The Art of HPC volume 1: Scientific Computing☆50Updated 10 months ago
- LLM training in simple, raw C/CUDA☆91Updated 9 months ago
- High-Performance SGEMM on CUDA devices☆76Updated last month
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆160Updated 2 weeks ago
- All pdfs of Victor Eijkhout's Art of HPC books and courses☆588Updated 10 months ago
- Repository with examples and exercises for OLCF and AMD's HIP training series☆14Updated last year
- ☆131Updated last year
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆106Updated last month
- C++ HPC Tutorial materials☆48Updated 7 months ago
- NVIDIA tools guide☆102Updated last month
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆192Updated 2 years ago
- The CUDA target for Numba☆60Updated this week
- Atomistic Spin Simulation Framework☆65Updated 4 years ago
- High-Performance FP32 Matrix Multiplication on CPU☆333Updated this week
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated last year
- Slides/notes and Jupyter notebook demos for an introductory course of numerical analysis/scientific computing☆50Updated last month
- ☆63Updated this week
- A collaborative effort to consolidate expert knowledge on code guidelines for the correctness, modernization, and optimization of code wr…☆89Updated this week
- Online compiler for HIP and NVIDIA® CUDA® code to WebGPU☆137Updated last month
- Training examples for SYCL☆39Updated 3 weeks ago
- A simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.☆100Updated 11 years ago
- A variety of programming models relevant to scientists explained, with an emphasis on how programming constructs map to parts of the com…☆60Updated 6 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆75Updated last year
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆75Updated 2 years ago