LukasHedegaard / pytorch-benchmark
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption
☆87Updated last year
Related projects: ⓘ
- Collection of SOTA efficient computer vision models for embedded applications, with pre-trained weights and training recipes☆74Updated last week
- ☆186Updated 2 years ago
- Recent Advances on Efficient Vision Transformers☆46Updated last year
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆118Updated 9 months ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 11 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆145Updated this week
- A library for researching neural networks compression and acceleration methods.☆134Updated 3 weeks ago
- ☆66Updated this week
- Torch Distributed Experimental☆115Updated last month
- A code generator from ONNX to PyTorch code☆132Updated last year
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆50Updated 2 months ago
- PyTorch Pruning Example☆46Updated last year
- Simplification of pruned models for accelerated inference | SoftwareX https://doi.org/10.1016/j.softx.2021.100907☆34Updated last year
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆30Updated last year
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆60Updated last year
- Visualizer for PyTorch image models☆40Updated 3 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆42Updated last year
- Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer☆98Updated last year
- A research library for pytorch-based neural network pruning, compression, and more.☆161Updated last year
- ☆27Updated last year
- Dynamic Neural Architecture Search Toolkit☆28Updated 3 months ago
- ☆151Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆69Updated 2 years ago
- ☆42Updated 7 months ago
- Demystify RAM Usage in Multi-Process Data Loaders☆171Updated last year
- Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.☆119Updated 3 weeks ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆298Updated this week
- [ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"☆143Updated last year
- Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)☆207Updated this week
- ☆56Updated last year