facebookresearch / qhoptim
Implementations of quasi-hyperbolic optimization algorithms.
☆102Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for qhoptim
- Implementation of "Variational Dropout and the Local Reparameterization Trick" paper with Pytorch☆50Updated 7 years ago
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆84Updated 2 years ago
- Code for MSID, a Multi-Scale Intrinsic Distance for comparing generative models, studying neural networks, and more!☆50Updated 5 years ago
- Loss Patterns of Neural Networks☆82Updated 3 years ago
- custom cuda kernel for {2, 3}d relative attention with pytorch wrapper☆43Updated 4 years ago
- PyTorch Implementations of Dropout Variants☆87Updated 6 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated 3 months ago
- ☆61Updated last year
- ☆47Updated 3 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆38Updated 6 years ago
- Code for "Aggregated Momentum: Stability Through Passive Damping", Lucas et al. 2018☆34Updated 6 years ago
- Simple implementation of the LSUV initialization in PyTorch☆58Updated 10 months ago
- An implementation of shampoo☆74Updated 6 years ago
- [NeurIPS'19] [PyTorch] Adaptive Regularization in NN☆67Updated 5 years ago
- MTAdam: Automatic Balancing of Multiple Training Loss Terms☆36Updated 4 years ago
- An implementation of MixMatch with PyTorch☆36Updated 3 years ago
- Pretrained TorchVision models on CIFAR10 dataset (with weights)☆24Updated 4 years ago
- Utilities for Pytorch☆89Updated 2 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- Variance Networks: When Expectation Does Not Meet Your Expectations, ICLR 2019☆39Updated 4 years ago
- Code to accompany the paper Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning☆33Updated 4 years ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆54Updated 2 years ago
- Tensorboard parser☆22Updated 2 weeks ago
- A discrete sequential VAE☆38Updated 4 years ago
- "Learning Rate Dropout" in PyTorch☆34Updated 4 years ago
- The Deep Weight Prior, ICLR 2019☆44Updated 3 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆44Updated 4 years ago
- ☆34Updated 5 years ago
- This repository provides the code for replicating the experiments in the paper "Building One-Shot Semi-supervised (BOSS) Learning up to F…☆36Updated 4 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago