This repository documents my 100-day journey of learning and writing CUDA kernels.
☆27Jun 25, 2025Updated 9 months ago
Alternatives and similar repositories for 100-days-cuda
Users that are interested in 100-days-cuda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Challenging myself to learn CUDA (Basics → Intermediate) these 100 days.☆32Mar 2, 2026Updated 3 weeks ago
- Developed a high-performance trading engine using Rust, leveraging its powerful features for low-level systems programming. Engineered to…☆23Nov 9, 2024Updated last year
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆73Feb 18, 2026Updated last month
- learning & making kernels in cuda / triton☆22Aug 24, 2025Updated 7 months ago
- "Torture the data and it will confess to anything."―Ronald Coase☆18Jan 8, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Setting up a public Github repo with files to create Claude Reader projects.☆25Feb 6, 2026Updated last month
- ☆31Jun 22, 2025Updated 9 months ago
- ☆10Aug 27, 2022Updated 3 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- Advancing TTP Analysis: Harnessing the Power of Large Language Models with Retrieval Augmented Generation☆11May 14, 2024Updated last year
- LLM query engine to retrieve augmented responses from json files.☆15Oct 12, 2023Updated 2 years ago
- The Tifinagh Hand-written Letters Dataset☆12Feb 17, 2024Updated 2 years ago
- ☆23Jul 11, 2025Updated 8 months ago
- 100 days of building GPU kernels!☆581Apr 27, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆37Oct 29, 2025Updated 5 months ago
- Training framework for Large Behavioral Models☆27Sep 17, 2025Updated 6 months ago
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆26Mar 18, 2026Updated last week
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- coding CUDA everyday!☆74Feb 5, 2026Updated last month
- Implementation of 12 AI agents evaluation techniques☆39Jul 31, 2025Updated 7 months ago
- Machine Learning in Darija☆24Jul 10, 2020Updated 5 years ago
- ☆16Feb 24, 2026Updated last month
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Towards Physics-informed Deep Learning for Turbulent Flow Prediction☆28Oct 18, 2021Updated 4 years ago
- From a+b to sparsemax(QK^T)V in Triton!☆28Jun 19, 2025Updated 9 months ago
- an IDE for the Julia programming language☆82Updated this week
- ☆130Mar 14, 2026Updated 2 weeks ago
- Contains my solutions for various online judge problems, organized in the worst possible way☆14Jul 25, 2015Updated 10 years ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- Python script to automatically create sigma rules from The hive observables☆25Mar 17, 2019Updated 7 years ago
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- ☆10Dec 23, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NeurIPS 2022 Workshop] A Case Study with Negated Prompts using T0 (3B, 11B), InstructGPT (350M-175B), GPT-3 (350M - 175B) & OPT (125M - …☆24Sep 27, 2022Updated 3 years ago
- Apply GPU in ML and DL☆67Mar 23, 2026Updated last week
- A curation of awesome portfolio website ideas for developers and designers to draw inspiration from. Raise a pull request to add more. 💜…☆17Apr 15, 2025Updated 11 months ago
- A comprehensive hands-on project for learning GPU programming with CUDA and HIP, covering fundamental concepts through advanced optimizat…☆35Nov 20, 2025Updated 4 months ago
- implement GPT-OSS 20B & 120B C++ inference from scratch on AMD GPUs☆170Oct 25, 2025Updated 5 months ago
- Certified robustness of deep neural networks☆19Aug 20, 2024Updated last year
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated 3 weeks ago