drkennetz / cuda_examplesLinks

Some CUDA example code with READMEs.

☆169

Alternatives and similar repositories for cuda_examples

Users that are interested in cuda_examples are comparing it to the libraries listed below

Sorting:

CisMine / Guide-NVIDIA-Tools
NVIDIA tools guide
☆143Updated 6 months ago
JanakiSubu / GPU_CUDA_100
100 days of CUDA Challenge
☆46Updated last week
CisMine / Parallel-Computing-Cuda-C
CUDA Learning guide
☆419Updated last year
rkinas / cuda-learning
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…
☆363Updated 5 months ago
a-hamdi / GPU
100 days of building GPU kernels!
☆477Updated 3 months ago
gevtushenko / llm.c
LLM training in simple, raw C/CUDA
☆102Updated last year
hkproj / 100-days-of-gpu
☆358Updated 3 months ago
CisMine / GPU-in-ML-DL
Apply GPU in ML and DL
☆52Updated 5 months ago
JINO-ROHIT / advanced_ml
☆59Updated last week
1y33 / 100Days
GPU Kernels
☆191Updated 3 months ago
rkinas / triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆383Updated 4 months ago
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated 2 months ago
salykova / sgemm.c
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
☆350Updated 3 months ago
tgautam03 / xGeMM
Accelerated General (FP32) Matrix Multiplication from scratch in CUDA
☆123Updated 6 months ago
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆414Updated last month
mlops-discord / gpu-optimization-workshop
Slides, notes, and materials for the workshop
☆328Updated last year
tugot17 / pmpp
Complete solutions to the Programming Massively Parallel Processors Edition 4
☆450Updated last month
MekkCyber / CutlassAcademy
A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS
☆205Updated 3 months ago
unixpickle / learn-ptx
Learning about CUDA by writing PTX code.
☆133Updated last year
loganwatchorn / notes-pmpp
Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)
☆53Updated 11 months ago
AdepojuJeremy / CUDA-120-DAYS--CHALLENGE
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…
☆724Updated 4 months ago
Quentin-Anthony / nanoMPI
Simple MPI implementation for prototyping or learning
☆272Updated 2 weeks ago
wentasah / mmul-anim
Visualization of cache-optimized matrix multiplication
☆153Updated 4 months ago
evintunador / triton_docs_tutorials
making the official triton tutorials actually comprehensible
☆53Updated 2 weeks ago
R100001 / Programming-Massively-Parallel-Processors
☆173Updated last year
NVIDIA / accelerated-computing-hub
NVIDIA curated collection of educational resources related to general purpose GPU programming.
☆611Updated 3 weeks ago
HenryNdubuaku / cuda-tutorials
CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.
☆187Updated last month
salykova / sgemm.cu
High-Performance SGEMM on CUDA devices
☆98Updated 6 months ago
andrewkchan / yalm
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
☆396Updated 2 months ago
Infatoshi / mnist-cuda
☆287Updated 6 months ago