epoch-research / Compute-TrendsLinks
Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".
☆41Updated 3 years ago
Alternatives and similar repositories for Compute-Trends
Users that are interested in Compute-Trends are comparing it to the libraries listed below
Sorting:
- Make triton easier☆47Updated last year
- A collection of reproducible inference engine benchmarks☆32Updated 3 months ago
- ☆28Updated 6 months ago
- ☆125Updated last year
- ☆37Updated 2 years ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 7 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last week
- Training material for IPU users: tutorials, feature examples, simple applications☆86Updated 2 years ago
- Example ML projects that use the Determined library.☆32Updated 10 months ago
- Samples of good AI generated CUDA kernels☆86Updated 2 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆71Updated last month
- Write a fast kernel and run it on Discord. See how you compare against the best!☆48Updated last week
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆72Updated last year
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- train with kittens!☆62Updated 9 months ago
- ☆74Updated 4 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆102Updated last year
- PyTorch centric eager mode debugger☆47Updated 7 months ago
- ☆27Updated last year
- ☆108Updated 11 months ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆141Updated last year
- A parallel framework for training deep neural networks☆63Updated 4 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆66Updated 8 months ago
- A list of awesome neural symbolic papers.☆47Updated 3 years ago
- Experiment of using Tangent to autodiff triton☆80Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Updated 10 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆59Updated last week
- ML model training for edge devices☆165Updated last year
- Personal solutions to the Triton Puzzles☆19Updated last year