JiJingYu / delta_orthogonal_init_pytorch
Delta Orthogonal Initialization for PyTorch
☆18Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for delta_orthogonal_init_pytorch
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆44Updated 4 years ago
- Code base for SRSGD.☆28Updated 4 years ago
- This project is the Torch implementation of our accepted AAAI 2018 paper : orthogonal weight normalization method for solving orthogonali…☆57Updated 4 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- Computing various norms/measures on over-parametrized neural networks☆49Updated 5 years ago
- ☆19Updated 5 years ago
- The Deep Weight Prior, ICLR 2019☆44Updated 3 years ago
- ☆23Updated 5 years ago
- ☆34Updated 5 years ago
- Repository with code for paper "Inhibited Softmax for Uncertainty Estimation in Neural Networks"☆25Updated 5 years ago
- Reproducible code for Augmentation paper☆18Updated 5 years ago
- This repository is no longer maintained. Check☆82Updated 4 years ago
- Bootstrap Your Own Latent (BYOL) pytorch implementation using DistributedDataParallel.☆28Updated last year
- Lookahead: A Far-sighted Alternative of Magnitude-based Pruning (ICLR 2020)☆33Updated 4 years ago
- pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsup…☆17Updated 4 years ago
- [ICLR 2019] ProbGAN: Towards Probabilistic GAN with Theoretical Guarantees☆32Updated 4 years ago
- An implementation of shampoo☆74Updated 6 years ago
- Implementation of Information Dropout☆39Updated 7 years ago
- Code for BlockSwap (ICLR 2020).☆33Updated 3 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- Implementation of "Variational Dropout and the Local Reparameterization Trick" paper with Pytorch☆50Updated 7 years ago
- PyTorch Implementations of Dropout Variants☆87Updated 6 years ago
- ICML 2020, Estimating Generalization under Distribution Shifts via Domain-Invariant Representations☆21Updated 4 years ago
- Mean Absolute Error Does Not Treat Examples Equally and Gradient Magnitude’s Variance Matters☆30Updated 4 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago
- Riemannian approach to batch normalization☆18Updated 7 years ago
- Cheap distillation for convolutional neural networks.☆33Updated 6 years ago
- PyTorch Implementation of CVPR'19 - On the Intrinsic Dimensionality of Image Representation☆23Updated 5 years ago
- ☆22Updated 6 years ago
- Applied Sparse regularization (L1), Weight decay regularization (L2), ElasticNet, GroupLasso and GroupSparseLasso to Neuronal Network.☆38Updated 2 years ago