zoq / Awesome-OptimizerLinks
Collect optimizer related papers, data, repositories
☆91Updated 7 months ago
Alternatives and similar repositories for Awesome-Optimizer
Users that are interested in Awesome-Optimizer are comparing it to the libraries listed below
Sorting:
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆79Updated last year
- Distributed K-FAC preconditioner for PyTorch☆87Updated this week
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆110Updated 2 years ago
- Neural Tangent Kernel Papers☆114Updated 5 months ago
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆63Updated last year
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Updated last year
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆22Updated 7 months ago
- Sampling with gradient-based Markov Chain Monte Carlo approaches☆103Updated last year
- ☆53Updated 8 months ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆66Updated 9 months ago
- ☆68Updated 6 months ago
- 😎 A curated list of tensor decomposition resources for model compression.☆69Updated this week
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆59Updated 3 months ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆41Updated 6 years ago
- ☆16Updated 9 months ago
- Code for visualizing the loss landscape of neural nets☆10Updated 4 years ago
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆68Updated 10 months ago
- Lightning-like training API for JAX with Flax☆41Updated 6 months ago
- Code for the book "The Elements of Differentiable Programming".☆88Updated this week
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Updated last year
- Benchmarking optimization methods on convex problems.☆32Updated last year
- ☆32Updated 8 months ago
- ☆229Updated 4 months ago
- Repo for the paper "Landscape Surrogate Learning Decision Losses for Mathematical Optimization Under Partial Information"☆36Updated last year
- This repository contains PyTorch implementations of various random feature maps for dot product kernels.☆21Updated 11 months ago
- Sparsity support for PyTorch☆35Updated 3 months ago
- ☆37Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆37Updated 2 years ago
- Deep Learning & Information Bottleneck☆60Updated last year
- ☆9Updated 2 years ago