ENCCS / gpu-programmingLinks
Meta-GPU lesson covering general aspects of GPU programming as well as specific frameworks
☆89Updated 3 weeks ago
Alternatives and similar repositories for gpu-programming
Users that are interested in gpu-programming are comparing it to the libraries listed below
Sorting:
- LLM training in simple, raw C/CUDA☆104Updated last year
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆356Updated 5 months ago
- NVIDIA Math Libraries for the Python Ecosystem☆387Updated 2 weeks ago
- All pdfs of Victor Eijkhout's Art of HPC books and courses☆707Updated last year
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆50Updated 2 weeks ago
- Public repository for vol 2 of The Art of HPC: parallel programming☆88Updated 3 months ago
- Public repository for The Art of HPC volume 1: Scientific Computing☆61Updated last year
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆61Updated 2 weeks ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆32Updated 5 months ago
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆173Updated this week
- Quantum computing without the linear algebra☆76Updated 3 months ago
- Tensor library & inference framework for machine learning☆110Updated 3 weeks ago
- Visualization of cache-optimized matrix multiplication☆155Updated 6 months ago
- N-Ways to Multi-GPU Programming☆37Updated last month
- Learning about CUDA by writing PTX code.☆135Updated last year
- High-Performance SGEMM on CUDA devices☆101Updated 8 months ago
- This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Departmen…☆23Updated 3 years ago
- This is a mirror of https://gitlab.inria.fr/starpu/starpu where our development happens, but contributions are welcome here too!☆76Updated this week
- HIP Python Low-level Bindings☆29Updated 4 months ago
- Machine Learning for HPC Workflows☆141Updated this week
- Data Parallel Extension for NumPy☆111Updated this week
- Matrix multiplication schemes☆197Updated 4 months ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆213Updated 3 years ago
- A variety of programming models relevant to scientists explained, with an emphasis on how programming constructs map to parts of the com…☆63Updated 6 years ago
- LLM inference in Fortran☆62Updated last year
- Fast GPT-2 inference written in Fortran☆199Updated this week
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆139Updated 8 months ago
- Custom PTX Instruction Benchmark☆127Updated 6 months ago
- Little OpenMP Library☆167Updated 2 years ago
- NVIDIA tools guide☆142Updated 8 months ago