SwekeR-463 / 100kernels
100 days of learning & making kernels in cuda / triton
☆20Updated 2 weeks ago
Alternatives and similar repositories for 100kernels:
Users that are interested in 100kernels are comparing it to the libraries listed below
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆213Updated 2 months ago
- GPU Kernels☆157Updated this week
- ☆40Updated 2 weeks ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆52Updated 7 months ago
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 8 months ago
- ☆32Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆169Updated last week
- Question paper of courses taught at IISC as part of MTech AI curriculum☆58Updated 3 months ago
- Setting up Vscode to work with Pytorch in C/C++ with CUDA support☆25Updated last month
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆250Updated 4 months ago
- Coding an LLM and its building blocks from scratch.☆22Updated this week
- A really tiny autograd engine☆90Updated 11 months ago
- ☆212Updated this week
- Rust Implementation of micrograd☆51Updated 8 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 5 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆60Updated last week
- End-to-End LLM Guide☆104Updated 8 months ago
- ☆142Updated 2 months ago
- making the official triton tutorials actually comprehensible☆21Updated last week
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆313Updated 2 weeks ago
- Making of cuda kernel☆14Updated this week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆27Updated this week
- Learnings and programs related to CUDA☆370Updated last month
- Learning about CUDA by writing PTX code.☆125Updated last year
- Apply GPU in ML and DL☆48Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 10 months ago
- ☆29Updated last week
- ☆99Updated 7 months ago
- 100 days of building GPU kernels!☆321Updated this week
- Inference Llama 2 in C++☆44Updated 11 months ago