ENCCS / gpu-programming
Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks
☆73Updated 4 months ago
Alternatives and similar repositories for gpu-programming:
Users that are interested in gpu-programming are comparing it to the libraries listed below
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated last month
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆41Updated this week
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆343Updated last month
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆30Updated 6 months ago
- LLM training in simple, raw C/CUDA☆92Updated 10 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆182Updated last month
- Some CUDA example code with READMEs.☆93Updated 3 weeks ago
- Nvidia Instruction Set Specification Generator☆253Updated 8 months ago
- All pdfs of Victor Eijkhout's Art of HPC books and courses☆620Updated 11 months ago
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆37Updated this week
- High-Performance SGEMM on CUDA devices☆87Updated 2 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆169Updated 4 months ago
- Algebraic enhancements for GEMM & AI accelerators☆274Updated last month
- Public repository for vol 2 of The Art of HPC: parallel programming☆80Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆255Updated 2 weeks ago
- Visualization of cache-optimized matrix multiplication☆105Updated 2 weeks ago
- GPU documentation for humans☆32Updated this week
- N-Ways to Multi-GPU Programming☆19Updated last year
- AMD lab notes with code examples to demonstrate use of AMD GPUs☆96Updated 9 months ago
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆164Updated this week
- ☆242Updated last year
- Custom PTX Instruction Benchmark☆120Updated last month
- The CUDA target for Numba☆91Updated this week
- Learning about CUDA by writing PTX code.☆125Updated last year
- Examples from Programming in Parallel with CUDA☆130Updated 2 years ago
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆363Updated last week
- A unified framework across multiple programming platforms☆36Updated 9 months ago
- Little OpenMP Library☆159Updated 2 years ago
- ☆131Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆35Updated this week