LukasHedegaard / pytorch-benchmarkLinks
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption
☆109Updated 2 years ago
Alternatives and similar repositories for pytorch-benchmark
Users that are interested in pytorch-benchmark are comparing it to the libraries listed below
Sorting:
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Updated 2 weeks ago
- Torch Distributed Experimental☆117Updated last year
- ☆160Updated 2 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 2 years ago
- This repository contains the experimental PyTorch native float8 training UX☆227Updated last year
- pytorch-profiler☆50Updated 2 years ago
- A research library for pytorch-based neural network pruning, compression, and more.☆162Updated 3 years ago
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆218Updated this week
- Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer☆114Updated 2 years ago
- The Triton backend for the PyTorch TorchScript models.☆171Updated last week
- ☆170Updated 2 years ago
- Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".☆129Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆114Updated last year
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆46Updated last week
- ☆208Updated 4 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago
- ML model training for edge devices☆168Updated 2 years ago
- Recent Advances on Efficient Vision Transformers☆55Updated 3 years ago
- A block oriented training approach for inference time optimization.☆34Updated last year
- Model compression for ONNX☆99Updated last year
- ☆34Updated 7 months ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Updated 2 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆73Updated 3 years ago
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆131Updated last year
- Implementation of a Transformer, but completely in Triton☆278Updated 3 years ago
- ☆36Updated last year
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆247Updated 2 weeks ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆125Updated last year
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆51Updated 2 years ago
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆118Updated last year