ZQZCalin / trainitLinks
☆13Updated last week
Alternatives and similar repositories for trainit
Users that are interested in trainit are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated 2 weeks ago
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆23Updated 11 months ago
- PyTorch linear operators for curvature matrices (Hessian, Fisher/GGN, KFAC, ...)☆57Updated this week
- Sketched linear operations for PyTorch☆96Updated last week
- Parameter-Free Optimizers for Pytorch☆131Updated last year
- 🧱 Modula software package☆299Updated 2 months ago
- Pytorch-like dataloaders for JAX.☆93Updated 5 months ago
- ☆45Updated last week
- Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and…☆36Updated 2 weeks ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Minimal, lightweight JAX implementations of popular models.☆117Updated this week
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆190Updated 10 months ago
- ☆220Updated 10 months ago
- LoRA for arbitrary JAX models and functions☆141Updated last year
- supporting pytorch FSDP for optimizers☆83Updated 10 months ago
- Implementation of PSGD optimizer in JAX☆35Updated 10 months ago
- IVON optimizer for neural networks based on variational learning.☆72Updated 11 months ago
- ☆58Updated last year
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆14Updated 11 months ago
- ☆120Updated 4 months ago
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆63Updated 2 years ago
- A simple library for scaling up JAX programs☆144Updated last year
- ☆17Updated last year
- ☆13Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Updated last year
- Lightning-like training API for JAX with Flax☆44Updated 10 months ago
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 months ago
- ☆17Updated 2 years ago
- ☆71Updated 10 months ago
- A library for unit scaling in PyTorch☆132Updated 3 months ago