Firojpaudel / 100_days_of_CUDALinks
Challenging myself to learn CUDA (Basics → Intermediate) these 100 days.
☆27Updated 4 months ago
Alternatives and similar repositories for 100_days_of_CUDA
Users that are interested in 100_days_of_CUDA are comparing it to the libraries listed below
Sorting:
- 100 days of building GPU kernels!☆523Updated 6 months ago
- GPU Kernels☆203Updated 6 months ago
- ☆386Updated 6 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆405Updated 8 months ago
- Apply GPU in ML and DL☆54Updated last month
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆427Updated 7 months ago
- Learnings and programs related to CUDA☆422Updated 4 months ago
- Some CUDA example code with READMEs.☆176Updated 8 months ago
- ☆215Updated 10 months ago
- CUDA Learning guide☆467Updated last year
- ☆376Updated last month
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆319Updated 2 years ago
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆793Updated 7 months ago
- This Repo consists the python note books of IITM - Mathematical Foundations for Generative AI Course,☆305Updated 3 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆564Updated 4 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆386Updated last month
- CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.☆198Updated 4 months ago
- A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, …☆534Updated last week
- ghimiresunil / LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-InferencingLLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferenc…☆710Updated last week
- ☆14Updated 7 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 5 months ago
- ☆498Updated this week
- This repository contains an exhaustive coverage of a hands on approach to PyTorch along side powerful tools to accelerate model tuning an…☆188Updated 3 weeks ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆160Updated last year
- repo of paper implementations☆20Updated 8 months ago
- This repository documents my 100-day journey of learning and writing CUDA kernels.☆15Updated 4 months ago
- ☆144Updated last year
- ☆157Updated last year
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆166Updated 10 months ago
- Slides, notes, and materials for the workshop☆333Updated last year