201419 / Optimizer-PyTorchLinks
Package of Optimizer implemented with PyTorch .
☆66Updated 5 years ago
Alternatives and similar repositories for Optimizer-PyTorch
Users that are interested in Optimizer-PyTorch are comparing it to the libraries listed below
Sorting:
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆182Updated 3 years ago
- ☆84Updated 4 years ago
- "Learning to learn by gradient descent by gradient descent "by PyTorch -- a simple re-implementation.☆60Updated 5 years ago
- Pytorch Implementation of Neural Architecture Optimization☆113Updated 4 years ago
- [ICML 2020] Efficient Continuous Pareto Exploration in Multi-Task Learning☆146Updated 4 years ago
- ☆154Updated 5 years ago
- Comprehensive and precise reviews of advance AI subfields.☆56Updated 4 years ago
- Pytorch implementation of TRP☆45Updated 4 years ago
- 人工智能和机器学习领域的国际顶级会议NeurIPS论文收集☆101Updated 5 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆412Updated last year
- dlADMM: Deep Learning Optimization via Alternating Direction Method of Multipliers☆163Updated 2 years ago
- ☆26Updated 4 years ago
- Teaches a student network from the knowledge obtained via training of a larger teacher network☆158Updated 7 years ago
- [ICML 2018] "Deep k-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions"☆152Updated 3 years ago
- [CVPR 2020] MTL-NAS: Task-Agnostic Neural Architecture Search towards General-Purpose Multi-Task Learning☆94Updated 2 years ago
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆150Updated 2 years ago
- Pytorch Implementation of the Stacked Capsule Autoencoders☆107Updated 4 years ago
- AdaX: Adaptive Gradient Descent with Exponential Long Term Momery☆34Updated 5 years ago
- PyTorch implementation of [1412.6553] and [1511.06530] tensor decomposition methods for convolutional layers.☆287Updated 3 years ago
- ☆40Updated last year
- Implementation and experiments for AdamW on Pytorch☆94Updated 5 years ago
- Multi-Task Learning Framework on PyTorch. State-of-the-art methods are implemented to effectively train models on multiple tasks.☆149Updated 6 years ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆337Updated 6 years ago
- LEARNING LATENT PERMUTATIONS WITH GUMBEL-SINKHORN NETWORKS IMPLEMENTATION WITH PYTORCH☆79Updated 2 years ago
- Unofficial implementation of Switching from Adam to SGD optimization in PyTorch.☆66Updated 2 years ago
- PyTorch implementation for GAL.☆56Updated 5 years ago
- ☆46Updated 5 years ago
- Learning Sparse Neural Networks through L0 regularization☆244Updated 5 years ago
- CP and Tucker decomposition for Convolutional Neural Networks☆86Updated 7 years ago
- This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018☆180Updated 3 years ago