TorchJD / torchjdLinks
Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning).
☆264Updated this week
Alternatives and similar repositories for torchjd
Users that are interested in torchjd are comparing it to the libraries listed below
Sorting:
- ☆206Updated 8 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆275Updated 3 weeks ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆119Updated 5 months ago
- optimizer & lr scheduler & loss function collections in PyTorch☆327Updated last week
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆429Updated 7 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆184Updated 10 months ago
- Efficient optimizers☆253Updated this week
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆392Updated 4 months ago
- Code for our NeurIPS 2022 paper☆369Updated 2 years ago
- For optimization algorithm research and development.☆524Updated this week
- Implementation of the proposed minGRU in Pytorch☆300Updated 4 months ago
- ☆298Updated 7 months ago
- D-Adaptation for SGD, Adam and AdaGrad☆524Updated 6 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆337Updated 3 weeks ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated last week
- ☆115Updated last month
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆100Updated 7 months ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆100Updated last year
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- 🧱 Modula software package☆216Updated last week
- Laplace approximations for Deep Learning.☆515Updated 3 months ago
- Official code for our NeurIPS 2024 paper "einspace: Searching for Neural Architectures from Fundamental Operations"☆28Updated 9 months ago
- ☆150Updated 11 months ago
- Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow☆202Updated 11 months ago
- A repository for log-time feedforward networks☆223Updated last year
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆289Updated 2 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆94Updated 2 weeks ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆389Updated this week
- Getting crystal-like representations with harmonic loss☆192Updated 4 months ago