TheMody / No-learning-rates-needed-Introducing-SALSA-Stable-Armijo-Line-Search-Adaptation
SaLSa Optimizer implementation (No learning rates needed)
☆29Updated last week
Alternatives and similar repositories for No-learning-rates-needed-Introducing-SALSA-Stable-Armijo-Line-Search-Adaptation:
Users that are interested in No-learning-rates-needed-Introducing-SALSA-Stable-Armijo-Line-Search-Adaptation are comparing it to the libraries listed below
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆98Updated 3 months ago
- A HuggingFace compatible Small Language Model trainer.☆74Updated 2 months ago
- ☆31Updated 11 months ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆61Updated 11 months ago
- Generate graph/data embeddings multiple ways☆50Updated last week
- FlashRNN - Fast RNN Kernels with I/O Awareness☆82Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 10 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆86Updated 2 weeks ago
- ☆46Updated 5 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 6 months ago
- This is the official repo for Gradient Agreement Filtering (GAF).☆23Updated 2 months ago