lixilinx / psgd_torchLinks
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)
β179Updated last month
Alternatives and similar repositories for psgd_torch
Users that are interested in psgd_torch are comparing it to the libraries listed below
Sorting:
- π§± Modula software packageβ204Updated 3 months ago
- β197Updated 7 months ago
- ASDL: Automatic Second-order Differentiation Library for PyTorchβ188Updated 7 months ago
- Parameter-Free Optimizers for Pytorchβ130Updated last year
- β230Updated 5 months ago
- JMP is a Mixed Precision library for JAX.β206Updated 5 months ago
- β40Updated last year
- LoRA for arbitrary JAX models and functionsβ140Updated last year
- β60Updated 3 years ago
- A library for unit scaling in PyTorchβ125Updated 7 months ago
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)β14Updated last year
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)β94Updated 7 months ago
- β17Updated 10 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwoβ¦β71Updated 2 weeks ago
- minGPT in JAXβ48Updated 3 years ago
- supporting pytorch FSDP for optimizersβ82Updated 7 months ago
- β53Updated 9 months ago
- A simple library for scaling up JAX programsβ139Updated 8 months ago
- A functional training loops library for JAXβ88Updated last year
- Named tensors with first-class dimensions for PyTorchβ332Updated 2 years ago
- Open source code for EigenGame.β30Updated 2 years ago
- β273Updated last year
- Run PyTorch in JAX. π€β256Updated last week
- Replicating and dissecting the git-re-basin project in one-click-replication Colabsβ36Updated 2 years ago
- Efficient optimizersβ232Updated last week
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).β112Updated 3 years ago
- JAX Synergistic Memory Inspectorβ175Updated 11 months ago
- Experiment of using Tangent to autodiff tritonβ79Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXβ84Updated last year
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offloadβ127Updated 2 years ago