epoch-research / Compute-TrendsLinks
Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".
☆42Updated 3 years ago
Alternatives and similar repositories for Compute-Trends
Users that are interested in Compute-Trends are comparing it to the libraries listed below
Sorting:
- Make triton easier☆47Updated last year
- Personal solutions to the Triton Puzzles☆20Updated last year
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 9 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆57Updated this week
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆33Updated 5 months ago
- Train, tune, and infer Bamba model☆132Updated 3 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆96Updated last month
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆60Updated 3 weeks ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- Experiment of using Tangent to autodiff triton☆81Updated last year
- ☆28Updated 8 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆68Updated 9 months ago
- train with kittens!☆62Updated 10 months ago
- ☆38Updated 2 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆72Updated 2 months ago
- ☆42Updated last week
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆36Updated last year
- Samples of good AI generated CUDA kernels☆90Updated 3 months ago
- A parallel framework for training deep neural networks☆63Updated 6 months ago
- PyTorch centric eager mode debugger☆48Updated 9 months ago
- ☆27Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆107Updated last year
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- ☆21Updated 6 months ago
- ☆74Updated 5 months ago
- FlexAttention w/ FlashAttention3 Support☆27Updated 11 months ago
- ML model training for edge devices☆166Updated last year