epoch-research / Compute-Trends
Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".
☆40Updated 3 years ago
Alternatives and similar repositories for Compute-Trends:
Users that are interested in Compute-Trends are comparing it to the libraries listed below
- Make triton easier☆47Updated 9 months ago
- Personal solutions to the Triton Puzzles☆18Updated 7 months ago
- ☆25Updated last week
- ☆61Updated 2 weeks ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆64Updated 3 months ago
- ☆26Updated last month
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated last year
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 2 months ago
- ☆12Updated 3 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆47Updated last year
- A list of awesome neural symbolic papers.☆45Updated 2 years ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆43Updated 7 months ago
- Framework to reduce autotune overhead to zero for well known deployments.☆62Updated 2 weeks ago
- ☆21Updated last week
- ☆25Updated last year
- Low-Rank Llama Custom Training☆21Updated 11 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆47Updated 2 weeks ago
- GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU☆22Updated last week
- Triton Implementation of HyperAttention Algorithm☆47Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- ☆100Updated 6 months ago
- Explore training for quantized models☆16Updated 2 months ago
- Unit Scaling demo and experimentation code☆16Updated last year
- Boosting 4-bit inference kernels with 2:4 Sparsity☆67Updated 6 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- ☆26Updated last year
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated last year
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆26Updated 3 months ago
- FlexAttention w/ FlashAttention3 Support☆26Updated 5 months ago