tgautam03 / CUDA-C
Simple problems implemented in CUDA C
☆17Updated last month
Alternatives and similar repositories for CUDA-C:
Users that are interested in CUDA-C are comparing it to the libraries listed below
- Apply GPU in ML and DL☆44Updated last month
- General Matrix Multiplication using NVIDIA Tensor Cores☆11Updated 2 months ago
- Neural network from scratch in CUDA/C++☆78Updated 2 months ago
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆107Updated 2 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆16Updated this week
- ☆11Updated last year
- Personal notes on CUDA programming☆56Updated 2 years ago
- ML/DL Math and Method notes☆59Updated last year
- A parallel framework for training deep neural networks☆57Updated last week
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆30Updated 6 months ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆217Updated 6 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆167Updated last week
- Material for the SC22 Deep Learning at Scale Tutorial☆40Updated last year
- ☆8Updated 5 years ago
- LLM training in simple, raw C/CUDA☆92Updated 10 months ago
- High-Performance SGEMM on CUDA devices☆87Updated 2 months ago
- ☆41Updated 2 weeks ago
- PyTorch examples for NERSC systems☆31Updated 5 months ago
- ☆209Updated this week
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆363Updated last week
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆174Updated last year
- This is a port of Mistral-7B model in JAX☆32Updated 8 months ago
- Learning about CUDA by writing PTX code.☆124Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆60Updated this week
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆153Updated this week
- ☆14Updated last year
- GPU Kernels☆155Updated this week
- Course of Introduction to Python for Data Sciences developed at Univ. Grenoble Alpes.☆27Updated 3 years ago
- CPU and GPU tutorial examples☆13Updated last month