lixilinx / psgd_torchLinks
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)
β175Updated last week
Alternatives and similar repositories for psgd_torch
Users that are interested in psgd_torch are comparing it to the libraries listed below
Sorting:
- β185Updated 6 months ago
- π§± Modula software packageβ194Updated 2 months ago
- Efficient optimizersβ206Updated last week
- JMP is a Mixed Precision library for JAX.β199Updated 4 months ago
- Experiment of using Tangent to autodiff tritonβ79Updated last year
- A simple library for scaling up JAX programsβ137Updated 7 months ago
- LoRA for arbitrary JAX models and functionsβ136Updated last year
- ASDL: Automatic Second-order Differentiation Library for PyTorchβ187Updated 6 months ago
- β53Updated 8 months ago
- A library for unit scaling in PyTorchβ125Updated 6 months ago
- Implementation of PSGD optimizer in JAXβ33Updated 5 months ago
- A functional training loops library for JAXβ88Updated last year
- β60Updated 3 years ago
- supporting pytorch FSDP for optimizersβ79Updated 5 months ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.β75Updated 10 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwoβ¦β70Updated this week
- β228Updated 3 months ago
- β17Updated 9 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).β110Updated 3 years ago
- β267Updated 10 months ago
- seqax = sequence modeling + JAXβ155Updated last month
- Named tensors with first-class dimensions for PyTorchβ331Updated last year
- Automatically take good care of your preemptible TPUsβ36Updated 2 years ago
- Accelerated First Order Parallel Associative Scanβ181Updated 9 months ago
- Jax/Flax rewrite of Karpathy's nanoGPTβ57Updated 2 years ago
- Neural Networks for JAXβ84Updated 8 months ago
- β32Updated 8 months ago
- JAX Synergistic Memory Inspectorβ173Updated 10 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)β94Updated 6 months ago
- β36Updated last year