JiJingYu / delta_orthogonal_init_pytorchLinks

Delta Orthogonal Initialization for PyTorch

☆18

Alternatives and similar repositories for delta_orthogonal_init_pytorch

Users that are interested in delta_orthogonal_init_pytorch are comparing it to the libraries listed below

Sorting:

minhtannguyen / SRSGD
Code base for SRSGD.
☆28Updated 5 years ago
alecwangcq / EigenDamage-Pytorch
Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934
☆113Updated 5 years ago
hongyanz / TRADES-smoothing
[JMLR] TRADES + random smoothing for certifiable robustness
☆14Updated 4 years ago
toiaydcdyywlhzvlob / backpack
This repository is no longer maintained. Check
☆81Updated 5 years ago
cc-hpc-itwm / GradVis
☆39Updated 5 years ago
noahgolmant / pytorch-lars
"Layer-wise Adaptive Rate Scaling" in PyTorch
☆87Updated 4 years ago
alinlab / lookahead_pruning
Lookahead: A Far-sighted Alternative of Magnitude-based Pruning (ICLR 2020)
☆33Updated 4 years ago
VITA-Group / Orthogonality-in-CNNs
[NeurIPS '18] "Can We Gain More from Orthogonality Regularizations in Training Deep CNNs?" Official Implementation.
☆129Updated 3 years ago
moskomule / shampoo.pytorch
An implementation of shampoo
☆77Updated 7 years ago
anokland / local-loss
PyTorch code for training neural networks without global back-propagation
☆165Updated 5 years ago
hongyi-zhang / Fixup
A Re-implementation of Fixed-update Initialization
☆152Updated 6 years ago
lolemacs / soft-sharing
Implementation of soft parameter sharing for neural networks
☆69Updated 4 years ago
stevenygd / SWALP
Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".
☆62Updated 6 years ago
mpezeshki / Gradient_Starvation
Gradient Starvation: A Learning Proclivity in Neural Networks
☆61Updated 4 years ago
huangleiBuaa / OthogonalWN
This project is the Torch implementation of our accepted AAAI 2018 paper : orthogonal weight normalization method for solving orthogonali…
☆57Updated 5 years ago
ducha-aiki / LSUV-pytorch
Simple implementation of the LSUV initialization in PyTorch
☆58Updated last year
eladhoffer / norm_matters
☆23Updated 6 years ago
erogol / Net2Net
Net2Net implementation on PyTorch for any possible vision layers.
☆38Updated 7 years ago
tbung / pytorch-revnet
Implementation of the reversible residual network in pytorch
☆105Updated 3 years ago
eladhoffer / fix_your_classifier
☆34Updated 6 years ago
lottery-ticket / rewinding-iclr20-public
☆70Updated 5 years ago
vcl-iisc / ZSKD
Zero-Shot Knowledge Distillation in Deep Networks
☆67Updated 3 years ago
BayesWatch / deficient-efficient
Successfully training approximations to full-rank matrices for efficiency in deep learning.
☆17Updated 4 years ago
evcu / pytorchpruner
☆22Updated 7 years ago
JingzhaoZhang / why-clipping-accelerates
A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…
☆46Updated 5 years ago
briancheung / superposition
☆45Updated 5 years ago
BayesWatch / pytorch-blockswap
Code for BlockSwap (ICLR 2020).
☆33Updated 4 years ago
BayesWatch / pytorch-moonshine
Cheap distillation for convolutional neural networks.
☆33Updated 6 years ago
asteroidhouse / self-tuning-networks
Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088
☆53Updated 6 years ago
Healbadbad / curveball-pytorch
An Implementation of "Small steps and giant leaps: Minimal Newton solvers for Deep Learning" In pytorch
☆21Updated 7 years ago