ZQZCalin / trainitLinks
☆11Updated 4 months ago
Alternatives and similar repositories for trainit
Users that are interested in trainit are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated this week
- ☆37Updated 3 weeks ago
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆22Updated 10 months ago
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆63Updated 2 years ago
- Pytorch-like dataloaders for JAX.☆94Updated 3 months ago
- 🧱 Modula software package☆231Updated 2 weeks ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆189Updated 8 months ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- ☆207Updated 9 months ago
- ☆115Updated 2 months ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆56Updated last month
- ☆57Updated 11 months ago
- IVON optimizer for neural networks based on variational learning.☆71Updated 9 months ago
- ☆70Updated 8 months ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Minimal, lightweight JAX implementations of popular models.☆96Updated last week
- Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and…☆33Updated last week
- A simple library for scaling up JAX programs☆143Updated 10 months ago
- ☆40Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆36Updated last year
- Agustinus' very opiniated publication-ready plotting library☆69Updated 4 months ago
- Sketched matrix decompositions for PyTorch☆70Updated 3 weeks ago
- PyTorch linear operators for curvature matrices (Hessian, Fisher/GGN, KFAC, ...)☆41Updated last week
- Minimal yet performant LLM examples in pure JAX☆150Updated last week
- [TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…☆17Updated 2 years ago
- Flow-matching algorithms in JAX☆104Updated last year
- ☆19Updated last year
- Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023☆19Updated 10 months ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆14Updated 10 months ago
- A general-purpose, deep learning-first library for constrained optimization in PyTorch☆137Updated 2 months ago