yashkant / padam-tensorflow
Reproducing the paper "PADAM: Closing The Generalization Gap of Adaptive Gradient Methods In Training Deep Neural Networks" for the ICLR 2019 Reproducibility Challenge
☆51Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for padam-tensorflow
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Updated last year
- Mathematical consequences of orthogonal weights initialization and regularization in deep learning. Experiments with gain-adjusted orthog…☆17Updated 5 years ago
- ☆24Updated 4 years ago
- This repository has moved to: https://github.com/tkipf/c-swm☆27Updated 4 years ago
- A Random Matrix Approach to Extreme Learning Machine☆14Updated 6 years ago
- Code base for SRSGD.☆28Updated 4 years ago
- Repository for the code of the paper "Neural Networks Regularization Through Class-wise Invariant Representation Learning".☆12Updated 7 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Updated 6 years ago
- Code for our paper: "Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers".☆21Updated 2 years ago
- SparseMax activation function implementation (ICML 2016) (PyTorch)☆26Updated 6 years ago
- A TensorFlow implementation of the paper 'Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks'☆30Updated 6 months ago
- A pytorch implementation of Information Bottleneck GAN☆28Updated 5 years ago
- Official PyTorch implementation of the paper : ProbAct: A Probabilistic Activation Function for Deep Neural Networks.☆13Updated 5 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 2 months ago
- ☆29Updated 3 years ago
- ☆26Updated 5 years ago
- Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks☆13Updated 4 years ago
- Implementation of Kronecker Attention in Pytorch☆17Updated 4 years ago
- Meta-SGD Algorithms Implementation☆21Updated 4 months ago
- Unofficial pytorch implementation of ReZero in ResNet☆23Updated 4 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆44Updated 4 years ago
- Code for Stochastic Hyperparameter Optimization through Hypernetworks☆23Updated 6 years ago
- Anonymous Repository for ICML LRG Workshop: On Explainability Techniques for Graph Convolutional Networks☆23Updated 5 years ago
- Reproducible code for Augmentation paper☆18Updated 5 years ago
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Updated 4 years ago
- rich posterior approximations and anomaly detection☆20Updated 5 years ago
- Pytorch implementation for "Particle Flow Bayes' Rule"☆13Updated 5 years ago
- ReGAN: Sequence GAN using RE[INFORCE|LAX|BAR] based PG estimators☆41Updated 6 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated last year
- Deep Reinforcement Active learning - Master Thesis☆18Updated last year