ENCCS / gpu-programmingLinks
Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks
☆90Updated last week
Alternatives and similar repositories for gpu-programming
Users that are interested in gpu-programming are comparing it to the libraries listed below
Sorting:
- LLM training in simple, raw C/CUDA☆105Updated last year
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆364Updated 5 months ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆51Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆516Updated last month
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆61Updated last week
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆33Updated 6 months ago
- High-Performance SGEMM on CUDA devices☆107Updated 8 months ago
- All pdfs of Victor Eijkhout's Art of HPC books and courses☆720Updated last year
- HIP Python Low-level Bindings☆30Updated last week
- Tensor library & inference framework for machine learning☆112Updated 2 weeks ago
- Custom PTX Instruction Benchmark☆129Updated 7 months ago
- C++ HPC Tutorial materials☆55Updated last year
- Visualization of cache-optimized matrix multiplication☆156Updated 7 months ago
- Competitive GPU kernel optimization platform.☆107Updated last week
- Learning about CUDA by writing PTX code.☆143Updated last year
- The Foundation for All Legate Libraries☆228Updated this week
- ☆271Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆210Updated 8 months ago
- Public repository for vol 2 of The Art of HPC: parallel programming☆89Updated last week
- ☆136Updated 2 years ago
- LLM inference in Fortran☆61Updated last year
- A unified framework across multiple programming platforms☆41Updated 4 months ago
- monorepo for rocm libraries☆150Updated this week
- Kernel Tuner☆368Updated last week
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆747Updated this week
- Fast GPT-2 inference written in Fortran☆198Updated last month
- Little OpenMP Library☆168Updated 3 years ago
- N-Ways to Multi-GPU Programming☆37Updated 2 months ago
- Quantum computing without the linear algebra☆76Updated 4 months ago
- A variety of programming models relevant to scientists explained, with an emphasis on how programming constructs map to parts of the com…☆63Updated 7 years ago