epoch-research / Compute-TrendsLinks
Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".
☆43Updated 3 years ago
Alternatives and similar repositories for Compute-Trends
Users that are interested in Compute-Trends are comparing it to the libraries listed below
Sorting:
- A collection of reproducible inference engine benchmarks☆38Updated 8 months ago
- Make triton easier☆50Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 5 months ago
- ☆12Updated 4 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆66Updated 2 weeks ago
- ☆71Updated 9 months ago
- train with kittens!☆63Updated last year
- Personal solutions to the Triton Puzzles☆20Updated last year
- A Data-Centric Compiler for Machine Learning☆85Updated 3 weeks ago
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- Samples of good AI generated CUDA kernels☆99Updated 7 months ago
- Training material for IPU users: tutorials, feature examples, simple applications☆87Updated 2 years ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆69Updated 3 weeks ago
- ☆38Updated 2 years ago
- ☆28Updated 11 months ago
- ☆83Updated last month
- Home for OctoML PyTorch Profiler☆114Updated 2 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆125Updated 3 months ago
- ☆129Updated last year
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 11 months ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆52Updated 10 months ago
- ☆115Updated last year
- ☆219Updated 11 months ago
- ML/DL Math and Method notes☆65Updated 2 years ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Updated last year
- Train, tune, and infer Bamba model☆137Updated 7 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- ☆120Updated last year
- Experiment of using Tangent to autodiff triton☆81Updated last year