vinbhaskara / adams
Exploiting Uncertainty of Loss Landscape for Stochastic Optimization
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for adams
- "Learning Rate Dropout" in PyTorch☆34Updated 4 years ago
- Unofficial pytorch implementation of ReZero in ResNet☆23Updated 4 years ago
- ☆23Updated 5 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 4 years ago
- Implementation of Kronecker Attention in Pytorch☆17Updated 4 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated last year
- ☆28Updated 4 years ago
- ☆13Updated 6 years ago
- Simple experiment of Apex (A PyTorch Extension)☆47Updated 5 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Updated 3 years ago
- A pytorch implementation of Information Bottleneck GAN☆28Updated 5 years ago
- Implementation of AlphaZero in PyTorch.☆10Updated 5 years ago
- ☆34Updated 5 years ago
- High-Level Training, Data Augmentation, and Utilities for Pytorch☆13Updated 5 years ago
- Implementation of the Budgeted Super Networks☆26Updated 5 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Updated 6 years ago
- ☆13Updated 7 years ago
- Advanced optimizer with Gradient-Centralization☆21Updated 4 years ago
- ☆13Updated 6 years ago
- PyTorch C++ Extension Example☆15Updated 6 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 10 months ago
- Lambda Networks implemented in PyTorch☆13Updated 3 years ago
- Simple implementation of the LSUV initialization in PyTorch☆58Updated 9 months ago
- An implementation of shampoo☆74Updated 6 years ago
- ICML2019 Accepted Paper. Overcoming Multi-Model Forgetting☆13Updated 5 years ago
- Code for replication of the paper "GANs beyond divergence minimization"☆21Updated 5 years ago
- Odds and Ends and Things I've implemented.☆78Updated 5 years ago
- Code for BlockSwap (ICLR 2020).☆33Updated 3 years ago
- A disciplined approach to neural network parameters - Reviewing the approach for setting Hyper parameters by Leslie Smith☆11Updated 6 years ago