tgautam03 / CUDA-CLinks
Simple problems implemented in CUDA C
☆22Updated 3 months ago
Alternatives and similar repositories for CUDA-C
Users that are interested in CUDA-C are comparing it to the libraries listed below
Sorting:
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆120Updated 6 months ago
- Personal notes on CUDA programming☆55Updated 2 years ago
- ☆11Updated last year
- Neural network from scratch in CUDA/C++☆82Updated 6 months ago
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆169Updated this week
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆236Updated 10 months ago
- High-Performance SGEMM on CUDA devices☆97Updated 5 months ago
- ☆64Updated this week
- A parallel framework for training deep neural networks☆62Updated 4 months ago
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- Custom kernels in Triton language for accelerating LLMs☆23Updated last year
- LLM training in simple, raw C/CUDA☆99Updated last year
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆31Updated 3 months ago
- This repository is an AI Bootcamp material that consist of a workflow for LLM☆93Updated 2 months ago
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆574Updated last week
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆188Updated last year
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated 2 years ago
- NVIDIA tools guide☆138Updated 6 months ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆21Updated last year
- N-Ways to Multi-GPU Programming☆37Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Updated this week
- Learning about CUDA by writing PTX code.☆133Updated last year
- This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler☆14Updated 2 years ago
- ☆8Updated 5 years ago
- Python package for NN generation from physics☆14Updated 2 years ago
- SC24 Deep Learning at Scale Tutorial Material☆33Updated 5 months ago
- ☆47Updated 6 months ago
- This repository collects the materials from the course "Foundations of HPC" at Data Science and Scientific Computer, University of Triest…☆14Updated 4 years ago
- ☆21Updated 4 years ago
- This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Departmen…☆23Updated 3 years ago