ENCCS / gpu-programmingLinks
Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks
☆98Updated 3 weeks ago
Alternatives and similar repositories for gpu-programming
Users that are interested in gpu-programming are comparing it to the libraries listed below
Sorting:
- LLM training in simple, raw C/CUDA☆112Updated last year
- Public repository for vol 2 of The Art of HPC: parallel programming☆96Updated last week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆56Updated this week
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆376Updated 9 months ago
- Tensor library & inference framework for machine learning☆117Updated 4 months ago
- Fast GPT-2 inference written in Fortran☆204Updated 4 months ago
- High-Performance FP32 GEMM on CUDA devices☆117Updated last year
- Custom PTX Instruction Benchmark☆138Updated 11 months ago
- Quantum computing without the linear algebra☆78Updated 2 months ago
- LLM inference in Fortran☆65Updated last year
- NVIDIA Math Libraries for the Python Ecosystem☆544Updated 3 weeks ago
- All pdfs of Victor Eijkhout's Art of HPC books and courses☆773Updated last week
- C++ HPC Tutorial materials☆54Updated 3 months ago
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆417Updated 3 weeks ago
- HIP Python Low-level Bindings☆33Updated 2 months ago
- N-Ways to Multi-GPU Programming☆37Updated 5 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆67Updated 3 weeks ago
- ☆89Updated 2 months ago
- Public repository for The Art of HPC volume 1: Scientific Computing☆64Updated last year
- Fast and Furious AMD Kernels☆348Updated 2 weeks ago
- CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…☆823Updated 3 weeks ago
- Visualization of cache-optimized matrix multiplication☆157Updated 10 months ago
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆183Updated last month
- Algebraic enhancements for GEMM & AI accelerators☆287Updated 11 months ago
- Learning about CUDA by writing PTX code.☆151Updated last year
- The Foundation for All Legate Libraries☆233Updated this week
- ☆281Updated last week
- Little OpenMP Library☆170Updated 3 years ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆224Updated 3 years ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆36Updated 3 months ago