TorchJD / torchjd
Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).
☆227Updated last week
Alternatives and similar repositories for torchjd:
Users that are interested in torchjd are comparing it to the libraries listed below
- ☆175Updated 4 months ago
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆425Updated 4 months ago
- Efficient optimizers☆189Updated this week
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆115Updated 2 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆230Updated last month
- 🧱 Modula software package☆188Updated 3 weeks ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆180Updated 7 months ago
- optimizer & lr scheduler & loss function collections in PyTorch☆289Updated this week
- ☆150Updated 8 months ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆367Updated 2 weeks ago
- When it comes to optimizers, it's always better to be safe than sorry☆220Updated 3 weeks ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 9 months ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆82Updated 2 months ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆91Updated 3 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆89Updated 3 weeks ago
- Parameter-Free Optimizers for Pytorch☆123Updated last year
- ☆289Updated 3 months ago
- Implementation of the proposed minGRU in Pytorch☆286Updated last month
- D-Adaptation for SGD, Adam and AdaGrad☆520Updated 3 months ago
- Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow☆196Updated 8 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆173Updated this week
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆92Updated last year
- Implementation of https://srush.github.io/annotated-s4☆490Updated 2 years ago
- Modern Fixed Point Systems using Pytorch☆89Updated last year
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆229Updated 3 months ago
- Easy Hypernetworks in Pytorch and Jax☆100Updated 2 years ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆555Updated 9 months ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆375Updated last week
- Unofficial JAX implementations of deep learning research papers☆155Updated 2 years ago
- Running Jax in PyTorch Lightning☆94Updated 4 months ago