kach / gradient-descent-the-ultimate-optimizerLinks
Code for our NeurIPS 2022 paper
☆369Updated 2 years ago
Alternatives and similar repositories for gradient-descent-the-ultimate-optimizer
Users that are interested in gradient-descent-the-ultimate-optimizer are comparing it to the libraries listed below
Sorting:
- ☆785Updated last month
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- Named tensors with first-class dimensions for PyTorch☆331Updated 2 years ago
- This library would form a permanent home for reusable components for deep probabilistic programming. The library would form and harness a…☆309Updated 4 months ago
- A library to inspect and extract intermediate layers of PyTorch models.☆475Updated 3 years ago
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆484Updated 3 years ago
- Laplace approximations for Deep Learning.☆521Updated 6 months ago
- Implementation of https://srush.github.io/annotated-s4☆504Updated 4 months ago
- Constrained optimization toolkit for PyTorch☆699Updated 2 months ago
- Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"☆493Updated 2 years ago
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆595Updated 9 months ago
- D-Adaptation for SGD, Adam and AdaGrad☆526Updated 9 months ago
- TorchOpt is an efficient library for differentiable optimization built upon PyTorch.☆615Updated 3 weeks ago
- Unofficial JAX implementations of deep learning research papers☆158Updated 3 years ago
- A general-purpose, deep learning-first library for constrained optimization in PyTorch☆145Updated 4 months ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆105Updated 2 years ago
- Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…☆271Updated this week
- Compositional Linear Algebra☆489Updated 2 months ago
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆260Updated 7 months ago
- Optimal transport tools implemented with the JAX framework, to solve large scale matching problems of any flavor.☆662Updated last week
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆344Updated last year
- A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training☆492Updated last year
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆438Updated last year
- Easy Hypernetworks in Pytorch and Jax☆105Updated 2 years ago
- Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow☆205Updated last year
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆400Updated this week
- Tensors, for human consumption☆1,316Updated this week
- ☆311Updated 7 months ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆281Updated 2 years ago
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆252Updated 3 years ago