notY0rick / cuda_practice
My own repository containing the codes I wrote to practice CUDA programming.
☆44Updated last year
Alternatives and similar repositories for cuda_practice:
Users that are interested in cuda_practice are comparing it to the libraries listed below
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆52Updated 8 months ago
- Apply GPU in ML and DL☆52Updated 2 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆179Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆130Updated last year
- ☆47Updated 3 weeks ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.☆67Updated last year
- CUDA Learning guide☆359Updated 10 months ago
- ☆67Updated last year
- GPT-2 in C☆68Updated 3 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆178Updated last week
- A really tiny autograd engine☆92Updated last year
- Some CUDA example code with READMEs.☆94Updated last month
- ☆51Updated this week
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆111Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆252Updated 5 months ago
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆228Updated 8 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆348Updated this week
- NVIDIA tools guide☆129Updated 3 months ago
- High Quality Resources on GPU Programming/Architecture☆586Updated 9 months ago
- GPU Kernels☆160Updated 2 weeks ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆324Updated 2 months ago
- Custom kernels in Triton language for accelerating LLMs☆18Updated last year
- Solve puzzles. Learn CUDA.☆63Updated last year
- Neural network from scratch in CUDA/C++☆78Updated 3 months ago
- Solve puzzles to improve your tinygrad skills!☆122Updated last month
- Learnings and programs related to CUDA☆379Updated 2 months ago
- Tutorials on tinygrad☆370Updated last month
- ML/DL Math and Method notes☆60Updated last year
- pytorch from scratch in pure C/CUDA and python☆40Updated 6 months ago