ZQZCalin / trainitLinks
☆13Updated 2 weeks ago
Alternatives and similar repositories for trainit
Users that are interested in trainit are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆183Updated 2 weeks ago
- ☆40Updated last month
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆23Updated 11 months ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- ☆17Updated last year
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆63Updated 2 years ago
- Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and…☆33Updated last month
- ☆120Updated 4 months ago
- ☆58Updated last year
- If it quacks like a tensor...☆59Updated 10 months ago
- Pytorch-like dataloaders for JAX.☆93Updated 4 months ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆190Updated 10 months ago
- ☆39Updated last year
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆14Updated 11 months ago
- 🧱 Modula software package☆282Updated last month
- A simple library for scaling up JAX programs☆143Updated 11 months ago
- ☆216Updated 10 months ago
- ☆19Updated last year
- ☆33Updated last year
- IVON optimizer for neural networks based on variational learning.☆72Updated 11 months ago
- Minimal but scalable implementation of large language models in JAX☆35Updated last month
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆57Updated last month
- Minimal, lightweight JAX implementations of popular models.☆110Updated this week
- Code for the paper "Function-Space Learning Rates"☆23Updated 4 months ago
- Implementation of PSGD optimizer in JAX☆33Updated 9 months ago
- LoRA for arbitrary JAX models and functions☆141Updated last year
- Sketched linear operations for PyTorch☆71Updated this week
- ☆67Updated 10 months ago
- A Python package of computer vision models for the Equinox ecosystem.☆108Updated last year