adityaiitb / pyprof2Links

PyProf2: PyTorch Profiling tool

☆82

Alternatives and similar repositories for pyprof2

Users that are interested in pyprof2 are comparing it to the libraries listed below

Sorting:

stevenygd / SWALP
Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".
☆62Updated 6 years ago
pytorch / extension-script
Example repository for custom C++/CUDA operators for TorchScript
☆114Updated 2 years ago
ARM-software / scalpel
This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…
☆41Updated 6 years ago
NVlabs / tensorcom
☆109Updated 4 years ago
Cerebras / online-normalization
Online Normalization for Training Neural Networks (Companion Repository)
☆83Updated 4 years ago
hongyi-zhang / Fixup
A Re-implementation of Fixed-update Initialization
☆152Updated 6 years ago
NVlabs / condensa
Programmable Neural Network Compression
☆148Updated 3 years ago
ducha-aiki / LSUV-pytorch
Simple implementation of the LSUV initialization in PyTorch
☆58Updated last year
zhuwenxi / pytorch-profiling-tool
☆54Updated 7 years ago
suvojit-0x55aa / mixed-precision-pytorch
Training with FP16 weights in PyTorch
☆79Updated 5 years ago
moskomule / shampoo.pytorch
An implementation of shampoo
☆77Updated 7 years ago
noahgolmant / pytorch-lars
"Layer-wise Adaptive Rate Scaling" in PyTorch
☆87Updated 4 years ago
sniklaus / pytorch-extension
an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors
☆118Updated 2 months ago
ppwwyyxx / FRN-on-common-ImageNet-baseline
Filter Response Normalization tested on better ImageNet baselines.
☆35Updated 5 years ago
ag14774 / diffdist
☆62Updated 5 years ago
ildoonet / remote-dataloader
PyTorch DataLoader processed in multiple remote computation machines for heavy data processings
☆67Updated 5 years ago
TezRomacH / layer-to-layer-pytorch
PyTorch implementation of L2L execution algorithm
☆107Updated 2 years ago
cybertronai / imagenet18
Train ImageNet in 18 minutes on AWS
☆133Updated last year
jongwook / tfrecord_lite
Make TFRecord Usable Again
☆88Updated 2 years ago
alexfjw / prunnable-layers-pytorch
Prunable nn layers for pytorch.
☆48Updated 7 years ago
jwfromm / Riptide
Simple Training and Deployment of Fast End-to-End Binary Networks
☆157Updated 3 years ago
NVlabs / webloader
Efficient DataLoader for PyTorch and Keras for loading datasets from web servers and object stores.
☆30Updated 5 years ago
tensorpack / dataflow
Efficient Data Loading Pipeline in Pure Python
☆212Updated 4 years ago
noahgolmant / pytorch-lr-dropout
"Learning Rate Dropout" in PyTorch
☆34Updated 5 years ago
szagoruyko / binary-wide-resnet
PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)
☆126Updated 6 years ago
utsaslab / MONeT
MONeT framework for reducing memory consumption of DNN training
☆173Updated 4 years ago
BayesWatch / pytorch-prunes
Code for https://arxiv.org/abs/1810.04622
☆141Updated 5 years ago
Separius / CudaRelativeAttention
custom cuda kernel for {2, 3}d relative attention with pytorch wrapper
☆43Updated 5 years ago
MatthieuCourbariaux / 8-bit-deep-learning
Training neural networks with 8-bit computations
☆28Updated 9 years ago
pgmmpk / tfrecord
Python way to Read/Write TFRecords
☆64Updated 7 years ago