Adaptive gradient descent without descent
☆52Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for adaptive_GD
Users that are interested in adaptive_GD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Oct 20, 2023Updated 2 years ago
- Companion codes for the paper "Decentralized Frank-Wolfe Algorithm for Convex and Non-convex Problems", accepted by IEEE TAC☆12Oct 3, 2017Updated 8 years ago
- Benchmarking optimization methods on convex problems.☆34Aug 8, 2025Updated 8 months ago
- ☆12Aug 28, 2023Updated 2 years ago
- Code for the paper "Understanding the Role of Momentum in Stochastic Gradient Methods"☆14Oct 27, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Anderson accelerated Douglas-Rachford splitting☆30Dec 11, 2020Updated 5 years ago
- Nonconvex Regularized Robust Regression via I-LAMM Algorithm☆12May 9, 2022Updated 3 years ago
- Code for Non-convex Learning via Replica Exchange Stochastic Gradient MCMC, ICML 2020.☆26Dec 3, 2020Updated 5 years ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆19May 11, 2019Updated 6 years ago
- ☆20Aug 30, 2023Updated 2 years ago
- ☆10Jun 16, 2020Updated 5 years ago
- source for Stochastic Conjugate Gradient Algorithm with Variance Reduction☆10Nov 22, 2017Updated 8 years ago
- ☆37Feb 4, 2022Updated 4 years ago
- Optimization using Stochastic quasi-Newton methods☆42Feb 3, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MATLAB implementations of a variety of machine learning/signal processing algorithms.☆11Aug 24, 2016Updated 9 years ago
- ☆11Apr 8, 2016Updated 10 years ago
- This repo contains the code used for NeurIPS 2019 paper "Asymmetric Valleys: Beyond Sharp and Flat Local Minima".☆14Oct 25, 2019Updated 6 years ago
- L4: Practical loss-based stepsize adaptation for PyTorch☆18May 7, 2021Updated 4 years ago
- Material for the course of "Mathematics of Transformer"☆21Aug 3, 2025Updated 8 months ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆20Jan 11, 2019Updated 7 years ago
- Realistic renewable energy scenarios for stochastic grid optimization problems☆22Aug 17, 2022Updated 3 years ago
- ☆34Jan 25, 2024Updated 2 years ago
- Fast algorithms for sparse principal component analysis☆17Aug 5, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SBL matlab code☆26Nov 20, 2019Updated 6 years ago
- Code for ICML 2019 paper on "Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations"☆19Jan 2, 2021Updated 5 years ago
- Matrix Iteratively Reweighted Least Squares for low-rank matrix completion and estimation☆12Dec 30, 2020Updated 5 years ago
- ☆10Jul 6, 2021Updated 4 years ago
- Variational Factorization Machines☆17Dec 20, 2016Updated 9 years ago
- Matlab/Octave toolbox for nonconvex optimization☆51Jun 23, 2016Updated 9 years ago
- FastAST - A fast primal-dual interior point method for line spectral estimation via atomic norm soft thresholding.☆30Jan 27, 2023Updated 3 years ago
- Web上に公開されている小説をスクレイピングして青空文庫形式のテキストにする☆19Feb 9, 2017Updated 9 years ago
- [ICNC 2020] Hybrid Beamformer Codebook Design and Ordering for Compressive mmWave Channel Estimation☆10Jul 26, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AdaX: Adaptive Gradient Descent with Exponential Long Term Momery☆34May 8, 2020Updated 5 years ago
- Nonlinear SVGD for Learning Diversified Mixture Models☆13Jan 23, 2019Updated 7 years ago
- The performance of turbo equalizers in both ISI channel and multipath fading channel is evaluated☆11Nov 24, 2020Updated 5 years ago
- ☆11Jul 27, 2018Updated 7 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆182Nov 21, 2021Updated 4 years ago
- A collection of stochastic proximal gradient methods for composite non-convex problems.☆24Jun 17, 2020Updated 5 years ago
- ☆28Sep 3, 2019Updated 6 years ago