davda54 / ada-hessian
Easy-to-use AdaHessian optimizer (PyTorch)
โ77Updated 4 years ago
Alternatives and similar repositories for ada-hessian:
Users that are interested in ada-hessian are comparing it to the libraries listed below
- ๐ฉ Pytorch and Jax code for the Madam optimiser.โ51Updated 4 years ago
- ๐ง Pytorch code for the Fromage optimiser.โ123Updated 7 months ago
- โ98Updated 3 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).โ106Updated 2 years ago
- Structured matrices for compressing neural networksโ66Updated last year
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilitiesโ209Updated 9 months ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorchโ145Updated last year
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learningโ270Updated last year
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitableโ211Updated 3 years ago
- โ47Updated 4 years ago
- โ33Updated 4 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.โ72Updated 6 months ago
- Efficient Householder Transformation in PyTorchโ63Updated 3 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"โ180Updated 3 years ago
- Very deep VAEs in JAX/Flaxโ46Updated 3 years ago
- ASDL: Automatic Second-order Differentiation Library for PyTorchโ182Updated 2 months ago
- Codebase for Learning Invariances in Neural Networksโ93Updated 2 years ago
- CUDA kernels for generalized matrix-multiplication in PyTorchโ79Updated 3 years ago
- โ36Updated 3 years ago
- Code base for SRSGD.โ28Updated 4 years ago
- codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"โ50Updated last year
- PyTorch implementation of L2L execution algorithmโ107Updated 2 years ago
- DeepOBS: A Deep Learning Optimizer Benchmark Suiteโ103Updated last year
- โ49Updated 4 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)โ59Updated 3 years ago
- Fast Discounted Cumulative Sums in PyTorchโ95Updated 3 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification fโฆโ44Updated 5 years ago
- แผฮฝฮฑฯฮฟฮผฮฎ is a PyTorch library to analyze representation of neural networksโ62Updated last year
- Distributed K-FAC Preconditioner for PyTorchโ85Updated this week
- A collection of optimizers, some arcane others well known, for Flax.โ29Updated 3 years ago