dlsyscourse / public_notebooks
☆58Updated 5 months ago
Alternatives and similar repositories for public_notebooks
Users that are interested in public_notebooks are comparing it to the libraries listed below
Sorting:
- ☆32Updated last year
- ☆155Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆66Updated 4 years ago
- A minimal implementation of vllm.☆40Updated 9 months ago
- Collection of kernels written in Triton language☆122Updated last month
- ring-attention experiments☆142Updated 6 months ago
- Cataloging released Triton kernels.☆221Updated 4 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- Imperative deep learning framework with customized GPU and CPU backend☆29Updated last year
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆124Updated this week
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆132Updated last year
- ☆32Updated 2 months ago
- the public repo for stats205 scribe notes at Stanford University☆13Updated 3 years ago
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆70Updated 11 months ago
- PyTorch centric eager mode debugger☆47Updated 5 months ago
- Personal solutions to the Triton Puzzles☆18Updated 9 months ago
- ☆204Updated 3 weeks ago
- Make triton easier☆47Updated 11 months ago
- A bunch of kernels that might make stuff slower 😉☆40Updated this week
- ☆65Updated 6 months ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- ML/DL Math and Method notes☆60Updated last year
- ☆26Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆44Updated this week
- ☆168Updated last year
- ☆79Updated 10 months ago
- Code release for book "Efficient Training in PyTorch"☆65Updated last month
- Custom kernels in Triton language for accelerating LLMs☆19Updated last year
- ☆36Updated 5 months ago
- Learning material for CMU10-714: Deep Learning System☆248Updated last year