Rainymood / Gradient-Descent-The-Ultimate-OptimizerLinks
Paper and code for Gradient Descent: The Ultimate Optimizer
☆24Updated 2 years ago
Alternatives and similar repositories for Gradient-Descent-The-Ultimate-Optimizer
Users that are interested in Gradient-Descent-The-Ultimate-Optimizer are comparing it to the libraries listed below
Sorting:
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 6 years ago
- A lightweight library for tensorflow 2.0☆66Updated 5 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆40Updated last year
- Statistical adaptive stochastic optimization methods☆32Updated 5 years ago
- ☆45Updated 5 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆52Updated 4 years ago
- [NeurIPS'19] [PyTorch] Adaptive Regularization in NN☆68Updated 5 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Updated 5 years ago
- A runtime shape checker and auto-annotator for tensor programs (pronounced "stanley")☆40Updated 5 years ago
- Official code for ICML 2020 paper "Variational Bayesian Quantization"☆24Updated 2 years ago
- The Singular Values of Convolutional Layers☆72Updated 7 years ago
- Easing non-convex optimization with neural networks.☆23Updated 7 years ago
- ☆25Updated last year
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆63Updated 4 years ago
- NeurIPS 2019 Paper Implementation☆12Updated 2 years ago
- Uncertainty Autoencoders, AISTATS 2019☆56Updated 6 years ago
- Tools for working with Long Short-Term Memory (LSTM) networks and sequences in Pytorch☆36Updated 4 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- Pretrained TorchVision models on CIFAR10 dataset (with weights)☆24Updated 5 years ago
- Official repository for our ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology☆36Updated 4 years ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Updated 2 years ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆23Updated 5 years ago
- TBA☆76Updated 6 years ago
- Code base for SRSGD.☆29Updated 5 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Updated 4 years ago
- A discrete sequential VAE☆40Updated 5 years ago
- Ancestral Gumbel-Top-k Sampling☆25Updated 5 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 5 years ago
- Automatic and Simultaneous Adjustment of Learning Rate and Momentum for Stochastic Gradient Descent☆46Updated 5 years ago
- ☆12Updated 5 years ago