ENCCS / gpu-programmingLinks
Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks
☆88Updated 2 months ago
Alternatives and similar repositories for gpu-programming
Users that are interested in gpu-programming are comparing it to the libraries listed below
Sorting:
- All pdfs of Victor Eijkhout's Art of HPC books and courses☆671Updated last year
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆350Updated 2 months ago
- Public repository for vol 2 of The Art of HPC: parallel programming☆86Updated last month
- LLM training in simple, raw C/CUDA☆99Updated last year
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆169Updated 2 weeks ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated this week
- Tensor library & inference framework for machine learning☆101Updated last week
- Custom PTX Instruction Benchmark☆126Updated 4 months ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆31Updated 3 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆57Updated 2 months ago
- Public repository for The Art of HPC volume 1: Scientific Computing☆59Updated last year
- Learning about CUDA by writing PTX code.☆133Updated last year
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆209Updated 3 years ago
- NVIDIA Math Libraries for the Python Ecosystem☆333Updated last week
- GPU documentation for humans☆81Updated last week
- High-Performance SGEMM on CUDA devices☆97Updated 5 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆201Updated 5 months ago
- N-Ways to Multi-GPU Programming☆37Updated 2 years ago
- The CUDA target for Numba☆149Updated last week
- Visualization of cache-optimized matrix multiplication☆152Updated 4 months ago
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆23Updated 2 weeks ago
- Examples from Programming in Parallel with CUDA☆157Updated 2 years ago
- Algebraic enhancements for GEMM & AI accelerators☆277Updated 4 months ago
- Fast GPT-2 inference written in Fortran☆196Updated 2 months ago
- LLM inference in Fortran☆59Updated last year
- Kernel Tuner☆353Updated this week
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆565Updated this week
- NVIDIA tools guide☆138Updated 6 months ago
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆80Updated this week
- The Foundation for All Legate Libraries☆218Updated this week