ShlokVFX / 100-days-cudaView external linksLinks
This repository documents my 100-day journey of learning and writing CUDA kernels.
☆22Jun 25, 2025Updated 7 months ago
Alternatives and similar repositories for 100-days-cuda
Users that are interested in 100-days-cuda are comparing it to the libraries listed below
Sorting:
- 專門為廢土伺服器所製作的存綠寶石Bot☆16Mar 20, 2024Updated last year
- ☆10Dec 23, 2023Updated 2 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- ☆23Jul 11, 2025Updated 7 months ago
- This repository contains an analysis of the effects of COVID-19 on trade trends up to December 2021. The dataset used provides daily trad…☆15Aug 16, 2023Updated 2 years ago
- The Vulkan GPU radix sort implementation from Google Fuchsia, but with CMake☆12Jan 13, 2023Updated 3 years ago
- ☆15Oct 30, 2025Updated 3 months ago
- Tutorial for (PyTorch) + (C++) + (Metal shader)☆16Oct 25, 2025Updated 3 months ago
- RyuseiLight is a beautiful, lightweight and extensible syntax highlighter.☆15Aug 9, 2021Updated 4 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆17Feb 9, 2026Updated last week
- This repository is a centralized knowledge base designed to help first-year CS students get started with programming, development, comput…☆67Oct 24, 2025Updated 3 months ago
- My leetcode solutions☆11Jan 11, 2023Updated 3 years ago
- Fork of rust concurrent hash map bencmarks to include leapfrog map.☆14Mar 13, 2022Updated 3 years ago
- 100 days of building GPU kernels!☆570Apr 27, 2025Updated 9 months ago
- Implementation of 12 AI agents evaluation techniques☆35Jul 31, 2025Updated 6 months ago
- A straightforward method to reduce your LLM inference API costs and token usage.☆21May 18, 2025Updated 8 months ago
- ☆96Updated this week
- ☆17Feb 9, 2024Updated 2 years ago
- Install `wasm-bindgen` by downloading the executable☆12Mar 3, 2023Updated 2 years ago
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- 清华大学计算机系《数据库系统概论》2022 年大作业项目 DBMS,支持基础 SQL 的解析和执行。☆12Jan 12, 2023Updated 3 years ago
- An experimental Progressive Web App based on Svelte-kit☆18Jul 6, 2022Updated 3 years ago
- A command-line tool for convert SVG image to PDF file☆17Mar 29, 2025Updated 10 months ago
- A macOS application that converts speech to text using OpenAI's Whisper model running locally. Press the Globe/Function key to start reco…☆25Nov 28, 2025Updated 2 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 2 months ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆46Jan 21, 2026Updated 3 weeks ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- ☆16Feb 9, 2026Updated last week
- Implementation of sentence embeddings with BERT in Rust, using the Burn library.☆20Sep 2, 2023Updated 2 years ago
- MFCC implementation with detailed comments.☆17Nov 26, 2020Updated 5 years ago
- 清华大学《计算机组成原理》大实验——五级流水线 RISC-V 处理器。「奋战三星期,造台计算机」☆21Mar 11, 2023Updated 2 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆60Jan 26, 2026Updated 3 weeks ago
- learning & making kernels in cuda / triton☆22Aug 24, 2025Updated 5 months ago
- Computer Assisted Police Sketching Using Generative Adversarial Networks (PYTHON-3)☆20Jun 28, 2019Updated 6 years ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆28Jan 25, 2025Updated last year
- Nitro-T is a family of text-to-image diffusion models focused on highly efficient training.☆38Jul 10, 2025Updated 7 months ago
- A curated list of replacements for existing software written in Rust☆24May 28, 2021Updated 4 years ago
- Benchmarking guide for the Azure AI Infrastructure.☆39Feb 5, 2026Updated last week
- ☆19Mar 3, 2025Updated 11 months ago