mlops-discord / gpu-optimization-workshop
Slides, notes, and materials for the workshop
☆306Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for gpu-optimization-workshop
- An ML Systems Onboarding list☆545Updated this week
- ☆65Updated 4 months ago
- End-to-End LLM Guide☆97Updated 4 months ago
- ☆133Updated 9 months ago
- Building blocks for foundation models.☆394Updated 10 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆107Updated last year
- GPU programming related news and material links☆1,237Updated last month
- UNet diffusion model in pure CUDA☆584Updated 4 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆483Updated 3 weeks ago
- Puzzles for learning Triton☆1,135Updated this week
- ☆152Updated this week
- Cataloging released Triton kernels.☆134Updated 2 months ago
- The Tensor (or Array)☆411Updated 3 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆715Updated last month
- CUDA Learning guide☆253Updated 5 months ago
- Notes on quantization in neural networks☆58Updated 11 months ago
- The Multilayer Perceptron Language Model☆523Updated 3 months ago
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆165Updated this week
- What would you do with 1000 H100s...☆903Updated 10 months ago
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024☆233Updated 6 months ago
- Best practices & guides on how to write distributed pytorch training code☆286Updated 2 weeks ago
- Applied AI experiments and examples for PyTorch☆166Updated 2 weeks ago
- Transform datasets at scale. Optimize datasets for fast AI model training.☆367Updated this week
- A set of scripts and notebooks on LLM finetunning and dataset creation☆93Updated last month
- ☆224Updated 4 months ago
- How to install CUDA & cuDNN for Machine Learning☆19Updated 4 months ago
- For optimization algorithm research and development.☆449Updated this week
- This repo contains my solutions to “Introduction to Machine Learning Interviews” by Chip Huyen.☆135Updated 4 months ago
- Alex Krizhevsky's original code from Google Code☆190Updated 8 years ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆252Updated last year