lixilinx / psgd_torchLinks
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)
☆188Updated this week
Alternatives and similar repositories for psgd_torch
Users that are interested in psgd_torch are comparing it to the libraries listed below
Sorting:
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆190Updated last year
- ☆225Updated last year
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- 🧱 Modula software package☆309Updated 3 months ago
- ☆234Updated 9 months ago
- ☆62Updated last year
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆97Updated last year
- A library for unit scaling in PyTorch☆132Updated 4 months ago
- ☆60Updated 3 years ago
- ☆40Updated last year
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆63Updated 4 years ago
- LoRA for arbitrary JAX models and functions☆143Updated last year
- JMP is a Mixed Precision library for JAX.☆211Updated 10 months ago
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 3 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆74Updated 5 months ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆402Updated this week
- ☆285Updated last year
- Efficient optimizers☆275Updated 3 weeks ago
- minGPT in JAX☆48Updated 3 years ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆62Updated 2 years ago
- JAX Synergistic Memory Inspector☆183Updated last year
- Experiment of using Tangent to autodiff triton☆80Updated last year
- supporting pytorch FSDP for optimizers☆84Updated last year
- A simple library for scaling up JAX programs☆144Updated last month
- Run PyTorch in JAX. 🤝☆309Updated last month
- Automatically take good care of your preemptible TPUs☆37Updated 2 years ago
- Accelerated First Order Parallel Associative Scan☆192Updated last year
- A Python package of computer vision models for the Equinox ecosystem.☆110Updated last year
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆131Updated 3 years ago