1y33/100Days

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/1y33/100Days)

1y33 / 100Days

GPU Kernels

☆225

Alternatives and similar repositories for 100Days

Users that are interested in 100Days are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

a-hamdi / GPU
View on GitHub
100 days of building GPU kernels!
☆617Apr 27, 2025Updated last year
hkproj / 100-days-of-gpu
View on GitHub
☆440Apr 10, 2025Updated last year
rkinas / cuda-learning
View on GitHub
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…
☆462Feb 22, 2025Updated last year
rkinas / triton-resources
View on GitHub
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆496Mar 10, 2025Updated last year
a-hamdi / native-sparse-attention
View on GitHub
☆15Feb 23, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
EugenHotaj / llm_parallelisms.c
View on GitHub
LLM training parallelisms (DP, FSDP, TP, PP) in pure C
☆29Jan 27, 2026Updated 6 months ago
daemyung / practice-triton
View on GitHub
삼각형의 실전! Triton
☆16Feb 15, 2024Updated 2 years ago
WaveSpeedAI / QuantumAttention
View on GitHub
[WIP] Better (FP8) attention for Hopper
☆33Feb 24, 2025Updated last year
leimao / CUTLASS-Examples
View on GitHub
CUTLASS and CuTe Examples
☆137Nov 30, 2025Updated 7 months ago
hkproj / triton-flash-attention
View on GitHub
☆257Jan 2, 2025Updated last year
julienokumu / 100DaysOfGPUProgramming
View on GitHub
100 Days Of GPU Programming.
☆45Nov 7, 2025Updated 8 months ago
silvaxxx1 / MyLLM
View on GitHub
"LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"
☆147Jul 17, 2026Updated last week
kfish / micrograd-cpp-2023
View on GitHub
A C++ port of karpathy/micrograd, a tiny scalar-valued autograd engine and a neural net library
☆13Nov 24, 2023Updated 2 years ago
MekkCyber / CutlassAcademy
View on GitHub
A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS
☆268May 6, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
vipulSharma18 / NCCL-From-First-Principles
View on GitHub
NCCL communication API layer, and transport layer created from first principles.
☆16Aug 20, 2025Updated 11 months ago
Maharshi-Pandya / cudacodes
View on GitHub
Learnings and programs related to CUDA
☆440Jun 29, 2025Updated last year
shlokgpu / 100-days-cuda
View on GitHub
This repository documents my 100-day journey of learning and writing CUDA kernels.
☆35Mar 29, 2026Updated 4 months ago
gpusgobrr / explore-gemm
View on GitHub
Exploring how optimizations for GEMMs work
☆36Feb 28, 2026Updated 5 months ago
gpuasm / autosass
View on GitHub
☆17Mar 29, 2026Updated 3 months ago
ageron / jupyter-synth
View on GitHub
A Jupyter notebook to have fun with audio in Python and learn the fundamentals of audio processing
☆16Nov 26, 2025Updated 8 months ago
StuartSul / gpu-experiments
View on GitHub
A collection of GPU experiments and benchmarks for my personal understanding and research.
☆34Jul 22, 2026Updated last week
gau-nernst / learn-cuda
View on GitHub
Learn CUDA with PyTorch
☆357Jun 1, 2026Updated last month
JINO-ROHIT / inferGPT
View on GitHub
a simple c++ inference engine for gpt based architecture
☆40Dec 10, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rajneel18 / 100_CUDA_Kernels
View on GitHub
☆17May 6, 2025Updated last year
YuvrajSingh-mist / Paper-Replications
View on GitHub
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆424Nov 11, 2025Updated 8 months ago
CisMine / Parallel-Computing-Cuda-C
View on GitHub
CUDA Learning guide
☆567Jun 20, 2024Updated 2 years ago
SzymonOzog / GPU_Programming
View on GitHub
☆98May 30, 2026Updated last month
gpu-mode / Triton-Puzzles
View on GitHub
Puzzles for learning Triton
☆2,545Apr 1, 2026Updated 3 months ago
aryagxr / cuda
View on GitHub
coding CUDA everyday!
☆77Feb 5, 2026Updated 5 months ago
Bruce-Lee-LY / cuda_auto_tune
View on GitHub
NCU-driven iterative optimization workflow for CUDA/CUTLASS/Triton/CuTe DSL kernels.
☆24Apr 10, 2026Updated 3 months ago
cyhdmjzzy / DeepEP-Code-Analysis
View on GitHub
☆26Feb 27, 2026Updated 5 months ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 6 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
pranjalssh / fast.cu
View on GitHub
Fastest kernels written from scratch
☆587Sep 18, 2025Updated 10 months ago
saurabhaloneai / History-of-Deep-Learning
View on GitHub
learningggggggg 🐳
☆634Apr 2, 2025Updated last year
tugot17 / pmpp
View on GitHub
Complete solutions to the Programming Massively Parallel Processors Edition 4
☆819Jun 18, 2025Updated last year
palxx / _100_days_of_CUDA
View on GitHub
☆11Aug 4, 2025Updated 11 months ago
SwekeR-463 / Papers-Implemented
View on GitHub
repo of paper implementations
☆20Feb 25, 2025Updated last year
HydraQYH / expert_specialization_moe
View on GitHub
Expert Specialization MoE Solution based on CUTLASS
☆27Apr 14, 2026Updated 3 months ago
eunomia-bpf / cupti-tutorial
View on GitHub
Tutorials for NVIDIA CUPTI samples
☆71Jul 22, 2026Updated last week