epoch-research / Compute-Trends
Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".
☆37Updated 2 years ago
Alternatives and similar repositories for Compute-Trends:
Users that are interested in Compute-Trends are comparing it to the libraries listed below
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆32Updated 9 months ago
- Make triton easier☆44Updated 7 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆43Updated 6 months ago
- ☆21Updated this week
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆24Updated 2 months ago
- Low-Rank Llama Custom Training☆21Updated 10 months ago
- Personal solutions to the Triton Puzzles☆18Updated 6 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated 10 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆63Updated last month
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated last month
- ☆36Updated last year
- ☆45Updated last year
- TensorRT LLM Benchmark Configuration☆12Updated 6 months ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems☆99Updated last week
- ☆54Updated last week
- ☆58Updated 8 months ago
- Unit Scaling demo and experimentation code☆16Updated 10 months ago
- Some microbenchmarks and design docs before commencement☆12Updated 3 years ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- ☆25Updated last year
- A safetensors extension to efficiently store sparse quantized tensors on disk☆66Updated this week
- GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU☆21Updated 3 months ago
- MLPerf™ logging library☆32Updated 3 weeks ago
- TORCH_LOGS parser for PT2☆30Updated this week
- A list of awesome neural symbolic papers.☆44Updated 2 years ago
- ☆38Updated last year
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- GPU operators for sparse tensor operations☆30Updated 10 months ago
- ☆97Updated 5 months ago