lixilinx / psgd_torchLinks
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)
☆190Updated last month
Alternatives and similar repositories for psgd_torch
Users that are interested in psgd_torch are comparing it to the libraries listed below
Sorting:
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆191Updated last year
- ☆246Updated last year
- A library for unit scaling in PyTorch☆133Updated 7 months ago
- 🧱 Modula software package☆322Updated 5 months ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- JMP is a Mixed Precision library for JAX.☆211Updated last year
- LoRA for arbitrary JAX models and functions☆144Updated last year
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Updated last year
- ☆62Updated last year
- ☆234Updated 11 months ago
- ☆60Updated 3 years ago
- Accelerated First Order Parallel Associative Scan☆196Updated last month
- A functional training loops library for JAX☆88Updated last year
- ☆40Updated 2 years ago
- ☆291Updated last year
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆62Updated 4 years ago
- A simple library for scaling up JAX programs☆145Updated 3 months ago
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Updated 2 years ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆406Updated this week
- Efficient optimizers☆281Updated last month
- ☆18Updated last year
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆37Updated 3 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆75Updated 7 months ago
- Experiment of using Tangent to autodiff triton☆82Updated 2 years ago
- Named tensors with first-class dimensions for PyTorch☆332Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆37Updated 2 years ago
- A Python package of computer vision models for the Equinox ecosystem.☆110Updated last year
- Unofficial JAX implementations of deep learning research papers☆161Updated 3 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆119Updated 3 years ago
- JAX Synergistic Memory Inspector☆184Updated last year