zoq / Awesome-Optimizer
Collect optimizer related papers, data, repositories
☆88Updated 3 months ago
Alternatives and similar repositories for Awesome-Optimizer:
Users that are interested in Awesome-Optimizer are comparing it to the libraries listed below
- Distributed K-FAC Preconditioner for PyTorch☆85Updated this week
- ☆65Updated 3 months ago
- Neural Tangent Kernel Papers☆106Updated 2 months ago
- summer school materials☆44Updated last year
- ☆35Updated 3 months ago
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆108Updated last year
- Sampling with gradient-based Markov Chain Monte Carlo approaches☆97Updated 10 months ago
- Lightning-like training API for JAX with Flax☆38Updated 3 months ago
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆77Updated 9 months ago
- Parameter-Free Optimizers for Pytorch☆121Updated 10 months ago
- ☆164Updated 3 months ago
- Sparsity support for PyTorch☆35Updated last month
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆59Updated last year
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆51Updated this week
- ☆134Updated this week
- Benchmarking optimization methods on convex problems.☆31Updated last year
- ☆16Updated 5 months ago
- ☆52Updated 5 months ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆52Updated 4 months ago
- Repo for the paper "Landscape Surrogate Learning Decision Losses for Mathematical Optimization Under Partial Information"☆36Updated last year
- Parallelizing non-linear sequential models over the sequence length☆51Updated last month
- ☆13Updated 3 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆61Updated 5 months ago
- Official code for our NeurIPS 2024 paper "einspace: Searching for Neural Architectures from Fundamental Operations"☆27Updated 4 months ago
- ☆30Updated 5 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆30Updated 4 months ago
- Information-Theoretic Diffusion: A brand new diffusion model / density estimator based on information theory.☆32Updated 7 months ago
- Deep Learning & Information Bottleneck☆58Updated last year
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Updated last year