lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)
☆168Updated 2 months ago
Alternatives and similar repositories for psgd_torch:
Users that are interested in psgd_torch are comparing it to the libraries listed below
- ☆161Updated 3 months ago
- Implementation of PSGD optimizer in JAX☆28Updated 2 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆107Updated 2 years ago
- 🧱 Modula software package☆151Updated this week
- ☆212Updated 7 months ago
- Efficient optimizers☆177Updated last week
- LoRA for arbitrary JAX models and functions☆135Updated last year
- ☆52Updated 5 months ago
- supporting pytorch FSDP for optimizers☆77Updated 2 months ago
- minGPT in JAX☆47Updated 3 years ago
- Running Jax in PyTorch Lightning☆86Updated 2 months ago
- JMP is a Mixed Precision library for JAX.☆191Updated last month
- Experiment of using Tangent to autodiff triton☆76Updated last year
- ☆59Updated 2 years ago
- A functional training loops library for JAX☆86Updated last year
- Named tensors with first-class dimensions for PyTorch☆321Updated last year
- ☆219Updated 2 weeks ago
- Automatically take good care of your preemptible TPUs☆36Updated last year
- Jax/Flax rewrite of Karpathy's nanoGPT☆56Updated 2 years ago
- A simple library for scaling up JAX programs☆133Updated 4 months ago
- JAX Synergistic Memory Inspector☆168Updated 7 months ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆183Updated 2 months ago
- Neural Networks for JAX☆83Updated 5 months ago
- Pytorch-like dataloaders for JAX.☆75Updated 4 months ago
- A port of muP to JAX/Haiku☆25Updated 2 years ago
- ☆75Updated 7 months ago
- Lightning-like training API for JAX with Flax☆38Updated 2 months ago
- seqax = sequence modeling + JAX☆145Updated this week
- ☆112Updated 3 weeks ago
- Flow-matching algorithms in JAX☆85Updated 6 months ago