dlsyscourse / public_notebooksLinks
☆58Updated 7 months ago
Alternatives and similar repositories for public_notebooks
Users that are interested in public_notebooks are comparing it to the libraries listed below
Sorting:
- ☆36Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆69Updated 4 years ago
- ☆159Updated last year
- ☆89Updated 9 months ago
- ☆174Updated 5 months ago
- Collection of kernels written in Triton language☆132Updated 2 months ago
- ☆14Updated last month
- From-scratch diffusion model implemented in PyTorch.☆93Updated last year
- ring-attention experiments☆144Updated 8 months ago
- ☆49Updated 11 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆45Updated 11 months ago
- Cataloging released Triton kernels.☆238Updated 5 months ago
- ML/DL Math and Method notes☆61Updated last year
- Personal solutions to the Triton Puzzles☆19Updated 11 months ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆119Updated 8 months ago
- PyTorch centric eager mode debugger☆47Updated 6 months ago
- ☆298Updated 6 months ago
- 📑 Dive into Big Model Training☆114Updated 2 years ago
- A minimal implementation of vllm.☆44Updated 10 months ago
- The simplest but fast implementation of matrix multiplication in CUDA.☆36Updated 11 months ago
- ☆13Updated 3 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆134Updated last year
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆51Updated last year
- ☆170Updated last year
- Make triton easier☆46Updated last year
- the public repo for stats205 scribe notes at Stanford University☆13Updated 4 years ago
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆70Updated last year
- ☆12Updated last month
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago