zhijian-liu / torchprofileLinks

A general and accurate MACs / FLOPs profiler for PyTorch models

☆624

Alternatives and similar repositories for torchprofile

Users that are interested in torchprofile are comparing it to the libraries listed below

Sorting:

1adrianb / pytorch-estimate-flops
Estimate/count FLOPS for a given neural network using pytorch
☆305Updated 3 years ago
ucbrise / actnn
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
☆200Updated 2 years ago
JJGO / shrinkbench
PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.
☆430Updated 2 years ago
amirgholami / ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
☆280Updated last year
idstcv / ZenNAS
☆226Updated 3 years ago
kaiyuyue / torchshard
Slicing a PyTorch Tensor Into Parallel Shards
☆299Updated last month
NVlabs / Taylor_pruning
Pruning Neural Networks with Taylor criterion in Pytorch
☆320Updated 5 years ago
aojunzz / NM-sparsity
☆236Updated 2 years ago
mit-han-lab / hardware-aware-transformers
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
☆336Updated last year
NVIDIA / PyProf
A GPU performance profiling tool for PyTorch models
☆503Updated 4 years ago
Tiiiger / QPyTorch
Low Precision Arithmetic Simulation in PyTorch
☆282Updated last year
lucaslie / torchprune
A research library for pytorch-based neural network pruning, compression, and more.
☆162Updated 2 years ago
awwong1 / torchprof
PyTorch layer-by-layer model profiler
☆606Updated 4 years ago
MingSun-Tse / Efficient-Deep-Learning
Collection of recent methods on (deep) neural network compression and acceleration.
☆948Updated 4 months ago
chrischoy / pytorch-custom-cuda-tutorial
Tutorial for building a custom CUDA function for Pytorch
☆519Updated 6 years ago
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆282Updated 4 years ago
ppwwyyxx / RAM-multiprocess-dataloader
Demystify RAM Usage in Multi-Process Data Loaders
☆196Updated 2 years ago
VITA-Group / TENAS
[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, …
☆169Updated 3 years ago
frgfm / torch-scan
Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)
☆219Updated 4 months ago
facebookresearch / AttentiveNAS
code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"
☆104Updated 3 years ago
PhilJd / contiguous_pytorch_params
Accelerate training by storing parameters in one contiguous chunk of memory.
☆290Updated 4 years ago
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆443Updated 2 years ago
JiahuiYu / slimmable_networks
Slimmable Networks, AutoSlim, and Beyond, ICLR 2019, and ICCV 2019
☆923Updated 2 years ago
walkerning / aw_nas
aw_nas: A Modularized and Extensible NAS Framework
☆250Updated last year
csyhhu / Awesome-Deep-Neural-Network-Compression
Summary, Code for Deep Neural Network Quantization
☆552Updated last month
marcoancona / TorchPruner
On-the-fly Structured Pruning for PyTorch models. This library implements several attributions metrics and structured pruning utils for n…
☆167Updated 5 years ago
cedrickchee / awesome-ml-model-compression
Awesome machine learning model compression research papers, quantization, tools, and learning material.
☆527Updated 10 months ago
yhhhli / APoT_Quantization
PyTorch implementation for the APoT quantization (ICLR 2020)
☆277Updated 7 months ago
mit-han-lab / amc
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆443Updated last year
mit-han-lab / haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
☆392Updated 4 years ago