ZQZCalin / trainitLinks
☆13Updated last month
Alternatives and similar repositories for trainit
Users that are interested in trainit are comparing it to the libraries listed below
Sorting:
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆23Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated last month
- PyTorch linear operators for curvature matrices (Hessian, Fisher/GGN, KFAC, ...)☆59Updated last week
- ☆47Updated last month
- IVON optimizer for neural networks based on variational learning.☆72Updated last year
- Sketched linear operations for PyTorch☆97Updated last month
- Amortized Probabilistic Conditioning for Optimization, Simulation and Inference (Chang et al., AISTATS 2025)☆21Updated 5 months ago
- Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and…☆39Updated 3 weeks ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆190Updated 11 months ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆63Updated 2 years ago
- Pytorch-like dataloaders for JAX.☆97Updated 6 months ago
- ☆72Updated 11 months ago
- Sampling with gradient-based Markov Chain Monte Carlo approaches☆108Updated last year
- Distributed K-FAC preconditioner for PyTorch☆91Updated this week
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- ☆16Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 3 years ago
- ☆61Updated last year
- ☆20Updated last year
- ☆224Updated 11 months ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆68Updated last year
- Lightning-like training API for JAX with Flax☆44Updated 11 months ago
- Agustinus' very opiniated publication-ready plotting library☆69Updated 6 months ago
- ☆17Updated last year
- ☆11Updated 4 years ago
- A general-purpose, deep learning-first library for constrained optimization in PyTorch☆145Updated 2 weeks ago
- 🧱 Modula software package☆307Updated 3 months ago
- [TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…☆17Updated 2 years ago